Covering Scientific & Technical AI | Tuesday, March 18, 2025
Nvidia's large-language models will become generally available later this year, the company confirmed. Organizations widely rely on Nvidia's graphics processors to write AI applications. The company has also created proprietary pre-trained models similar to OpenAI's GPT-4 and Google's PaLM-2. Customers can use their own corpus of data, embed it in Nvidia's pre-trained large language models, ... Full article
MLCommons this week issued the results of its latest MLPerf Inference (v3.1) benchmark exercise. Nvidia was again the top performing accelerator, but Intel (Xeon CPU) and Habana (Gaudi1 and 2) performed well. Google provided a peak at its new TPU (v5e) performance. MLCommons also debuted a new MLPerf Storage (v0.5) benchmark intended to measure storage ... Full article
As the freight train that is generative AI continues barreling down the track to an uncertain destination, we thought it would be good to take some time to stop and ponder where we’re currently at in terms of GenAI adoption. It’s been quite a ride since the launch of ChatGPT in late November 2022 ignited ... Full article
A software development kit for Mojo, a new Python-based language for AI development created by former Google engineers, is now available for download on Linux systems, with support for Mac and Windows coming soon, the company behind Mojo announced today. Mojo is Pythonic language designed to help AI developers get the most performance out of ... Full article
The dominance of Nvidia GPUs has companies scrambling to find non-GPU alternatives, and another mainstream option has emerged with Google’s TPU v5e AI chip. The TPU v5e is also Google’s first AI chip being mainstreamed with a suite of software and tools for large-scale orchestration of AI workloads in virtual environments. The AI chip is ... Full article
Condor Galaxy is an AI system recently debuted by Cerebras Systems and Middle Eastern cloud provider G42. The system has already been busy with training Jais, a 13-billion parameter Arabic large language model trained on a 395-billion-word Arabic and English dataset. Named after Jebel Jais, the UAE’s highest mountain, the Jais LLM is a collaboration ... Full article
Google Cloud introduced several more uses for Duet AI, the AI interface unveiled earlier this year. Data engineers will be singing duets with integrations for BigQuery, AlloyDB, and Cloud Spanner, while data analysts will be in ML harmony with a new Looker integration. Duet AI is also being integrated across Google’s expansive Workspace properties, providing ... Full article
Generative AI is taking the world by storm, as organizations discover the myriad ways it can be used to serve and entice customers. With today’s enhancements to Vertex AI, Google Cloud is giving its customers more GenAI capabilities to choose from. The pace of adoption of GenAI and large language models (LLMs) has been nothing ... Full article
In the world of technology, overnight sensations are rarely born overnight. They are the result of decades of research, development, and perseverance. The recent success of chatGPT, a state-of-the-art language model, is a prime example of this phenomenon. It has turned heads with its capabilities, but the road to this breakthrough was long and winding. ... Full article
OpenAI today announced the launch of ChatGPT Enterprise, a new GPT-4-based service that offers stronger security and privacy protections, support for longer inputs, and new data analysis capabilities, among other new features. OpenAI kicked off the generative AI craze in late 2022 with the launch of ChatGPT, which provided a conversational interface to GPT-3.5, the company’s biggest ... Full article
The world has grown accustomed to the presence of artificial intelligence (AI) in its daily lives. In fact, unless you’ve been asleep for much of 2023, you can see just how AI’s influence on the world is growing with the hype around generative AI. We all know AI has been used for years to recommend ... Full article
Large language models (LLMs) have dominated the data and AI conversation through the first eight months of 2023, courtesy of the whirlwind that is ChatGPT. Despite the consumer success, few companies have concrete plans to put commercial LLMs into production, with concerns about privacy and ethics leading the way. A new report released by Predibase this week ... Full article
Hugging Face announced it has raised $235 million in a Series D funding round led by Salesforce Ventures with participation from a host of tech giants including IBM, Google, Amazon, Nvidia, Intel, AMD, and Qualcomm, as well as Ashton Kutcher’s Sound Ventures. The company is now valued at $4.5 billion. Hugging Face is a developer ... Full article
VMware kicked off its Explore event in Las Vegas with a series of announcements geared toward enabling enterprise generative AI development. VMware and Nvidia extended their partnership to unveil VMware Private AI Foundation with Nvidia, an offering that promises to provide enterprises with the software and compute to fine-tune large language models and run AI-enabled ... Full article
The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its latest H100 GPUs worldwide in 2023. The appetite for GPUs is obviously coming from the generative AI boom, but the HPC market is also competing for these accelerators. It ... Full article
According to a 2023 business survey, 62 percent of enterprises have fully implemented artificial intelligence (AI) for cybersecurity or are exploring additional uses for the technology. With advancements in AI technologies, however, come more ways for sensitive information to be misused. Globally, organizations are leveraging AI and implementing automated security measures into their infrastructure to ... Full article
AMD has released findings from a new survey of global IT leaders suggesting some are finding it challenging to keep up during the recent AI boom: Close to half (46%) of respondents say their organizations are not ready to implement AI, and just 19% say they will prioritize AI within the next year. The report ... Full article
The launch of ChatGPT and similar generative AI technologies is reshaping the skills required in the workplace, according to a new report from LinkedIn. “The Future of Work Report: AI at Work” found the pace at which LinkedIn members added AI skills to their profile has nearly doubled since ChatGPT’s debut in November 2022, rising from 7.7% ... Full article
The default method for accelerating Deep Learning projects is increasing the size of a GPU cluster. However, the cost is increasingly prohibitive. According to Andreessen Horowitz, many companies investing in AI ‘spend more than 80% of their total capital raised on compute resources,’ and rightly so. GPUs are the cornerstone of AI infrastructure and as much ... Full article
The HPC user's dream is to keep stuffing GPUs into a rack mount box and make everything go faster. There are some servers that offer up to eight GPUs, but the standard server usually offers four GPU slots. Fair enough, using four modern GPUs offers a significant amount of HPC heft, but can we go ... Full article
AIwire