NVIDIA Supports SoftBank in Building AI Supercomputer, Unveils AI-Driven Telecom Network
TOKYO, Nov. 13, 2024 -- NVIDIA has announced a series of collaborations with SoftBank Corp. designed to accelerate Japan’s sovereign AI initiatives and further its global technology leadership while also unlocking billions of dollars in AI revenue opportunities for telecommunications providers worldwide.
During his keynote at NVIDIA AI Summit Japan, NVIDIA founder and CEO Jensen Huang announced that SoftBank is building Japan’s most powerful AI supercomputer using the NVIDIA Blackwell platform and has plans to use the NVIDIA Grace Blackwell platform for its next supercomputer.
Additionally, NVIDIA revealed that SoftBank, using the NVIDIA AI Aerial accelerated computing platform, has successfully piloted the world’s first combined AI and 5G telecom network — a breakthrough in computing that opens AI revenue streams potentially worth billions of dollars to telecom operators.
NVIDIA and SoftBank also announced that, using NVIDIA AI Enterprise software, SoftBank is aiming to create an AI marketplace that can meet the demand for local, secure AI computing. This new service, which supports AI training and edge AI inference, positions SoftBank to become the AI grid for Japan, facilitating new business opportunities for the creation, distribution and use of AI services across the country’s industries, consumers and enterprises.
“Japan has a long history of pioneering technological innovations with global impact,” said Huang. “With SoftBank’s significant investment in NVIDIA’s full-stack AI, Omniverse and 5G AI-RAN platforms, Japan is leaping into the AI industrial revolution to become a global leader, driving a new era of growth across the telecommunications, transportation, robotics and healthcare industries in ways that will greatly benefit humankind in the age of AI.”
“Countries and regions worldwide are accelerating the adoption of AI for social and economic growth, and society is undergoing significant transformation,” said Junichi Miyakawa, president and CEO of SoftBank. “Through our long collaboration with NVIDIA, SoftBank is leading this transformation from the forefront. With our extremely powerful AI infrastructure and our new, distributed AI-RAN solution ‘AITRAS’ that reinvents 5G networks for AI, we will accelerate innovation across the country and throughout the world.”
SoftBank First to Receive Blackwell, Plans for Grace Blackwell
SoftBank is slated to receive the world’s first NVIDIA DGX B200 systems, which will serve as the building blocks for its new NVIDIA DGX SuperPOD supercomputer. The company plans to use its Blackwell-powered DGX SuperPOD for its own generative AI development and AI-related business, as well as that of universities, research institutions and businesses throughout Japan.
Upon completion, SoftBank’s DGX SuperPOD is expected to be Japan’s most performant to date. Featuring NVIDIA AI Enterprise software and NVIDIA Quantum-2 InfiniBand networking, it is also ideal for the development of large language models.
In addition to its DGX SuperPOD, SoftBank plans to build another NVIDIA-accelerated supercomputer to run extremely compute-intensive workloads. Initial plans for the supercomputer are based on an NVIDIA Grace Blackwell platform design featuring NVIDIA GB200 NVL72 multi-node, liquid-cooled, rack-scale systems that combine NVIDIA Blackwell GPUs with power-efficient Arm-based NVIDIA Grace CPUs.
AI-RAN Reaches New Milestone
Working closely with NVIDIA, SoftBank has achieved a technology milestone — the development of a new kind of telecommunications network that can run AI and 5G workloads at the same time, known by the industry as artificial intelligence radio access network, or AI-RAN. This new breed of infrastructure has broad ecosystem support from the telecom industry, as it offers operators the ability to transform their base stations from cost centers into AI revenue-producing assets.
Through an outdoor trial conducted in the Kanagawa prefecture, SoftBank demonstrated that its NVIDIA-accelerated AI-RAN solution has achieved carrier-grade 5G performance and was able to do so while using the network’s excess capacity to run AI inference workloads concurrently.
Traditional telco networks are designed to handle peak loads and, on average, have used only one-third of that capacity. With the common computing capability provided by AI-RAN, it is expected that telcos now have the opportunity to monetize the remaining two-thirds capacity for AI inference services.
NVIDIA and SoftBank estimate that telco operators can earn roughly $5 in AI inference revenue from every $1 of capex it invests in new AI-RAN infrastructure. Taking into account its opex and capex costs, SoftBank estimates it can achieve a return of up to 219% for every AI-RAN server it adds to its infrastructure.
Real-World Inference Runs on AI-RAN
For the trial, SoftBank used NVIDIA AI Enterprise to build real-world AI inference applications, including autonomous vehicle remote support, robotics control and multimodal retrieval-automated generation at the edge. All inference workloads were able to run optimally on SoftBank’s AI-RAN network.
SoftBank’s fully software-defined 5G radio stack is optimized for NVIDIA’s AI computing platform and includes L1 software enhanced by SoftBank based on NVIDIA Aerial CUDA-accelerated RAN libraries. SoftBank plans to incorporate NVIDIA Aerial RAN Computer-1 systems, which it estimates can use 40% less power than traditional 5G network infrastructure, into its solution moving forward.
NVIDIA and SoftBank partners that contributed to the trial of SoftBank’s AI-RAN solution include Fujitsu and Red Hat.
Matching Supply With Demand
Because an AI-RAN solution needs to spin compute up or down dynamically based on demand and supply without compromising carrier-grade performance in real time, SoftBank aims to build an ecosystem that connects the demand and supply of AI technology by using NVIDIA AI Enterprise serverless application programming interfaces and its in-house developed orchestrator. This enables SoftBank to dispatch external AI inferencing jobs to an AI-RAN server when computing resources are available to deliver localized, low-latency, secure inferencing services.
“Shifting from single-purpose to multi-purpose AI-RAN networks can mean 5x the revenue for every dollar of capex invested,” said Ronnie Vasishta, senior vice president of telecom at NVIDIA. “SoftBank’s live field trial marks a huge step toward AI-RAN commercialization with the validation of technology feasibility, performance and economics.”
“SoftBank’s ‘AITRAS’ is the first AI-RAN solution developed through a five-year collaboration with NVIDIA. It integrates and coordinates AI and RAN workloads through the SoftBank-developed orchestrator, enhancing communication efficiency by running dense cells on a single NVIDIA-accelerated GPU server,” said Ryuji Wakikawa, vice president and head of the Research Institute of Advanced Technology at SoftBank. “We are confident this AI-driven innovation, AITRAS, will pave the way for new business models in telecommunications, serving as a crucial factor in the transformation of mobile operators.”
Learn more about NVIDIA solutions for AI-RAN here.
Source: NVIDIA