Covering Scientific & Technical AI | Monday, December 23, 2024

Dell Generative AI Open Ecosystem with AMD Instinct Accelerators 

Dec. 8, 2023 -- Dell Technologies and AMD are coming together to unveil an expansion to the Dell Generative AI Solutions portfolio, continuing the work of accelerating advanced workloads and offering businesses more choice to continue their unique GenAI journeys. This new technology highlights a pivotal role played by open ecosystems and silicon diversity in empowering customers with simple, trusted and tailored solutions to bring AI to their data.

8x AMD Instinct MI300X accelerator Open Compute Project Accelerator Module (OAM) Universal Baseboard (UBB).

The surging growth of data and the question of what to make of it are central in today’s AI landscape. AI is the latest wave of innovation, and working to help ensure organizations have access to the compute power needed to fuel their innovation engine is key. Together with AMD, Dell can offer customers more choice to power their AI and GenAI workloads than before.

The fastest ramping solution in Dell history, the Dell PowerEdge XE9680, will support the upcoming AMD Instinct MI300X accelerator, offering customers more choice when it comes to AI performance. The PowerEdge XE9680 with AMD Instinct MI300X offers high-performance capabilities for enterprises seeking to unlock the value of their data and differentiate their business with customized large language models (LLMs). With eight MI300X GPUs, 192GB of 5.3 TB/s High Bandwidth Memory (HBM3) per GPU for a total coherent HBM3 capacity of 1.5 TB per server and over 21 petaflops of FP16 performance, the PowerEdge XE9680 with MI300X is poised to further democratize access to GenAI for enterprises eager to train larger models, reduce data center footprints, lower TCO and gain a competitive edge.

Along with the support for the AMD MI300X Instinct Accelerator, Dell is also announcing the next step of Dell Generative AI Solutions, making it easier for organizations to deploy trustworthy GenAI with the new Dell Validated Design for Generative AI with AMD.

This new Dell Validated Design for Generative AI with AMD ROCm powered AI frameworks, available next year, extends the Dell Generative AI Solutions ecosystem and will include open-source large language models. AMD ROCm is an open-source stack of drivers, dev toolkits and APIs for AMD Instinct accelerators. Dell Validated Designs for Generative AI make it simple for Dell customers to build trusted GenAI platforms tailored to their needs by taking the guesswork out of integration, performance and sizing considerations.

Dell’s latest eBook and infographic demonstrate how ROCm-optimized open-source AI and ML frameworks like PyTorch, TensorFlow, ONNX-RT, JAX and OpenAI Triton have native support for the PowerEdge XE9680 with MI300X, enabling simple drop-in support for LLMs based on these popular frameworks. Read Dell's three-part blog series to learn more and explore code samples:

The PowerEdge XE9680 is a powerhouse in performance and a champion of security and simplicity. Dell Open Manage Enterprise software simplifies and accelerates GenAI infrastructure deployment, while CloudIQ software provides intelligent automation from anywhere. The PowerEdge XE9680 is backed by integrated cyber recovery and a broad Zero Trust approach for greater security.

The PowerEdge XE9680 leverages the AMD Instinct Platform powered by eight AMD Instinct MI300X accelerators, enabling near-linear scaling and low latency distributed GenAI training and inferencing with Global Memory Interconnect (xGMI) spanning a cluster of PowerEdge servers interconnecting MI300X GPUs over an Ethernet-based AI fabric using a Dell validated GPU fabric with Dell PowerSwitch Z9664F-ON.

As a member of the Ultra Ethernet Consortium (UEC), Dell continues to advocate for standards-based networking. Dell Technologies is uniquely positioned to realize the full potential of an open approach by implementing it across the entire GenAI solution: compute, fabric and storage. The Dell PowerEdge XE9680 with AMD Instinct MI300X accelerators and Dell Validated Design for Generative AI with AMD will be available in the first half of 2024.

Learn more about the Dell PowerEdge XE family of accelerated servers.

See the latest on Dell’s growing range of acceleration products for AI, including the AMD Instinct MI300X.


Source: Dave Lincoln, Dell Technologies

AIwire