Covering Scientific & Technical AI | Wednesday, November 27, 2024

Author Archives: Doug Eadline

Doug Eadline

Nvidia Releasing Open-Source Optimized Tensor RT-LLM Runtime with Commercial Foundational AI Models to Follow Later This Year

September 14th, 2023 Comments Off on Nvidia Releasing Open-Source Optimized Tensor RT-LLM Runtime with Commercial Foundational AI Models to Follow Later This Year
Nvidia's large-language models will become generally available later this year, the company confirmed. Organizations widely rely on Nvidia's graphics processors to write AI applications. The company has also created proprietary pre-trained models similar to OpenAI's GPT-4 and Google's PaLM-2. ...

MLPerf Releases Latest Inference Results and New Storage Benchmark

September 14th, 2023 Comments Off on MLPerf Releases Latest Inference Results and New Storage Benchmark
MLCommons this week issued the results of its latest MLPerf Inference (v3.1) benchmark exercise. Nvidia was again the top performing accelerator, but Intel (Xeon CPU) and Habana (Gaudi1 and 2) performed well. Google provided a peak at its new ...

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 21st, 2023 Comments Off on Nvidia H100: Are 550,000 GPUs Enough for This Year?
The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its latest H100 GPUs worldwide in 2023. The appetite for GPUs is obviously coming ...

GigaIO’s New SuperNode Takes-off with Record Breaking AMD GPU Performance

August 11th, 2023 Comments Off on GigaIO’s New SuperNode Takes-off with Record Breaking AMD GPU Performance
The HPC user's dream is to keep stuffing GPUs into a rack mount box and make everything go faster. There are some servers that offer up to eight GPUs, but the standard server usually offers four GPU slots. Fair ...
Page 2 of 212
AIwire