News

NVIDIA and SoftBank Corp. Partners to Quicken Japan’s Transition to a Global AI Powerhouse

NVIDIA and SoftBank have partnered to advance Japan's technological leadership and AI ambitions. SoftBank will use the NVIDIA Blackwell platform for building a powerful AI supercomputer, piloting the world's first combined AI and 5G telecom network, and developing an AI marketplace.

NVIDIA announced a number of partnerships with SoftBank Corp. that will help Japan advance its global technological leadership and accelerate its sovereign AI ambitions, while also opening up billions of dollars in AI income prospects for telecom companies around the world.

NVIDIA founder and CEO Jensen Huang said during his keynote address at the NVIDIA AI Summit Japan that SoftBank is utilizing the NVIDIA Blackwell platform to construct Japan’s most potent AI supercomputer and intends to employ the NVIDIA Grace Blackwell platform for its upcoming supercomputer.

Also Read: Nvidia Eyes Partnership with India for Chip Development, Targeting Expanding AI Market

Key Insights:

NVIDIA also disclosed that SoftBank has successfully piloted the world’s first combined AI and 5G telecom network using the NVIDIA AI Aerial accelerated computing platform. This is a computing breakthrough that opens up AI revenue streams for telecom operators that could be worth billions of dollars.

NVIDIA and SoftBank also revealed that SoftBank plans to develop an AI marketplace that can satisfy the need for safe, local AI computing by utilizing NVIDIA AI Enterprise software. By supporting AI training and edge AI inference, this new service puts SoftBank in a position to become Japan’s AI grid and open up new economic prospects for the development, deployment, and use of AI services by the nation’s consumers, businesses, and industries.

Also Read: Denmark’s First AI Supercomputer Gefion Launched by Jensen Huang of Nvidia & Danish King

Blackwell is First Received by SoftBank, with Plans for Grace Blackwell

The world’s first NVIDIA DGXTM B200 systems, which will form the foundation of SoftBank’s upcoming NVIDIA DGX SuperPODTM supercomputer, are scheduled to arrive.

For its own generative AI development and AI-related operations, as well as that of Japanese universities, research institutes, and companies, SoftBank intends to employ its Blackwell-powered DGX SuperPOD.

AI-RAN Achieves a New Achievement

SoftBank has developed a new type of telecommunications network that can handle AI and 5G workloads simultaneously, referred to by the industry as an artificial intelligence radio access network, or AI-RAN, in close collaboration with NVIDIA. This is a significant technological advancement.

Because it enables operators to turn their base stations from cost centers into AI revenue-producing assets, this new type of infrastructure enjoys widespread ecosystem support from the telecom sector.

Also Read: NVIDIA NVLM 1.0: Know All About Open-Source Multimodal LLM

AI-RAN Is Used for Real-World Inference

SoftBank developed real-world AI inference applications for the trial using NVIDIA AI Enterprise, such as robotics control, multimodal retrieval-automated generation at the edge, and autonomous vehicle remote assistance. SoftBank’s AI-RAN network was able to support all inference workloads with optimal performance.

NVIDIA’s AI computing platform is tailored for SoftBank’s fully software-defined 5G radio stack, which includes SoftBank-enhanced L1 software built on NVIDIA AerialTM CUDA®-accelerated RAN libraries. In the future, SoftBank intends to include NVIDIA Aerial RAN Computer-1 systems into its solution, which it claims can consume 40% less power than conventional 5G network infrastructure.

Red Hat and Fujitsu are two NVIDIA and SoftBank partners that helped with the SoftBank AI-RAN solution trial.

Also Read: Nvidia’s New Llama-3.1-Nemotron AI Model Takes on OpenAI & Anthropic with Superior Performance

How are supply and demand balanced?

SoftBank wants to create an ecosystem that links the supply and demand of AI technology by utilizing its in-house orchestrator and NVIDIA AI Enterprise serverless application programming interfaces. This is because an AI-RAN solution must be able to dynamically spin compute up or down in response to supply and demand without sacrificing carrier-grade performance in real-time. This allows SoftBank to provide localized, low-latency, secure inferencing services by sending external AI inferencing jobs to an AI-RAN server when computing resources are available.

This post was last modified on November 13, 2024 3:34 am

Kumud Sahni Pruthi

A postgraduate in Science with an inclination towards education and technology. She always looks for ways to help people improve their lives by putting complex things into simple words through her writing.

Recent Posts

Top 10 Robotics Skills Required for Engineering Career Growth

Are you looking to advance your engineering career in the field of robotics? Check out…

April 18, 2025

Top 20 Books on AI in 2025: The Ultimate Reading List on Artificial Intelligence

Artificial intelligence is a topic that has recently made internet users all over the world…

April 18, 2025

Top 10 Best AI Communities in 2025

Boost your learning journey with the power of AI communities. The article below highlights the…

April 18, 2025

Artificial Intelligence (AI) Glossary and Terminologies – Complete Cheat Sheet List

Demystify the world of Artificial Intelligence with our comprehensive AI Glossary and Terminologies Cheat Sheet.…

April 18, 2025

Scott Wu Net Worth: Devin AI Software Engineer, CEO of Cognition Labs

Scott Wu is the co-founder and Chief Executive Officer of Cognition Labs, an artificial intelligence…

April 17, 2025

Top 13 Yield Farming Platforms in 2025: Maximize APY with Secure and Trusted Crypto Tools

Discover the 13 best yield farming platforms of 2025, where you can safely maximize your…

April 17, 2025