EnterpriseAI Articles, Trends & News

Nvidia Releasing Open-Source Optimized Tensor RT-LLM Runtime with Commercial Foundational AI Models to Follow Later This Year

Nvidia's large-language models will become generally available later this year, the company confirmed. Organizations widely rely on Nvidia's graphics processors to write AI applications. The company has also created proprietary pre-trained models similar to OpenAI's GPT-4 and Google's PaLM-2. Customers can use their own corpus of data, embed it in Nvidia's pre-trained large language models, ... Full article

MLPerf Releases Latest Inference Results and New Storage Benchmark

MLCommons this week issued the results of its latest MLPerf Inference (v3.1) benchmark exercise. Nvidia was again the top performing accelerator, but Intel (Xeon CPU) and Habana (Gaudi1 and 2) performed well. Google provided a peak at its new TPU (v5e) performance. MLCommons also debuted a new MLPerf Storage (v0.5) benchmark intended to measure storage ... Full article

GenAI Adoption, By the Numbers

As the freight train that is generative AI continues barreling down the track to an uncertain destination, we thought it would be good to take some time to stop and ponder where we’re currently at in terms of GenAI adoption. It’s been quite a ride since the launch of ChatGPT in late November 2022 ignited ... Full article

New Language Mojo Seeks End to AI Framework Sprawl

A software development kit for Mojo, a new Python-based language for AI development created by former Google engineers, is now available for download on Linux systems, with support for Mac and Windows coming soon, the company behind Mojo announced today. Mojo is Pythonic language designed to help AI developers get the most performance out of ... Full article

Google TPU v5e AI Chip Debuts after Controversial Origins

The dominance of Nvidia GPUs has companies scrambling to find non-GPU alternatives, and another mainstream option has emerged with Google’s TPU v5e AI chip. The TPU v5e is also Google’s first AI chip being mainstreamed with a suite of software and tools for large-scale orchestration of AI workloads in virtual environments. The AI chip is ... Full article

Cerebras and G42’s Inception Unveil Jais: A 13B Parameter Arabic LLM Trained on Condor Galaxy

Condor Galaxy is an AI system recently debuted by Cerebras Systems and Middle Eastern cloud provider G42. The system has already been busy with training Jais, a 13-billion parameter Arabic large language model trained on a 395-billion-word Arabic and English dataset. Named after Jebel Jais, the UAE’s highest mountain, the Jais LLM is a collaboration ... Full article

Duet AI Goes Everywhere in Google’s Cloud

Google Cloud introduced several more uses for Duet AI, the AI interface unveiled earlier this year. Data engineers will be singing duets with integrations for BigQuery, AlloyDB, and Cloud Spanner, while data analysts will be in ML harmony with a new Looker integration. Duet AI is also being integrated across Google’s expansive Workspace properties, providing ... Full article

Google Extends Vertex with More GenAI Features

Generative AI is taking the world by storm, as organizations discover the myriad ways it can be used to serve and entice customers. With today’s enhancements to Vertex AI, Google Cloud is giving its customers more GenAI capabilities to choose from. The pace of adoption of GenAI and large language models (LLMs) has been nothing ... Full article

An Overnight Sensation, 30 Years in the Making: The Parallel Paths of AI and Quantum Computing

In the world of technology, overnight sensations are rarely born overnight. They are the result of decades of research, development, and perseverance. The recent success of chatGPT, a state-of-the-art language model, is a prime example of this phenomenon. It has turned heads with its capabilities, but the road to this breakthrough was long and winding. ... Full article

OpenAI Launches ChatGPT Enterprise

OpenAI today announced the launch of ChatGPT Enterprise, a new GPT-4-based service that offers stronger security and privacy protections, support for longer inputs, and new data analysis capabilities, among other new features. OpenAI kicked off the generative AI craze in late 2022 with the launch of ChatGPT, which provided a conversational interface to GPT-3.5, the company’s biggest ... Full article

Why Trusting AI is All a Matter of the Right Data at the Right Time

The world has grown accustomed to the presence of artificial intelligence (AI) in its daily lives. In fact, unless you’ve been asleep for much of 2023, you can see just how AI’s influence on the world is growing with the hype around generative AI. We all know AI has been used for years to recommend ... Full article

Privacy and Ethical Hurdles to LLM Adoption Grow

Large language models (LLMs) have dominated the data and AI conversation through the first eight months of 2023, courtesy of the whirlwind that is ChatGPT. Despite the consumer success, few companies have concrete plans to put commercial LLMs into production, with concerns about privacy and ethics leading the way. A new report released by Predibase this week ... Full article

‘Every Tech Company Will Be an AI Company’: Hugging Face’s $235M Vote of Confidence

Hugging Face announced it has raised $235 million in a Series D funding round led by Salesforce Ventures with participation from a host of tech giants including IBM, Google, Amazon, Nvidia, Intel, AMD, and Qualcomm, as well as Ashton Kutcher’s Sound Ventures. The company is now valued at $4.5 billion. Hugging Face is a developer ... Full article

VMware Unveils New Generative AI Tools, Expands Nvidia Partnership

VMware kicked off its Explore event in Las Vegas with a series of announcements geared toward enabling enterprise generative AI development. VMware and Nvidia extended their partnership to unveil VMware Private AI Foundation with Nvidia, an offering that promises to provide enterprises with the software and compute to fine-tune large language models and run AI-enabled ... Full article

Nvidia H100: Are 550,000 GPUs Enough for This Year?

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its latest H100 GPUs worldwide in 2023. The appetite for GPUs is obviously coming from the generative AI boom, but the HPC market is also competing for these accelerators. It ... Full article

Cybersecurity’s Rising Significance in the World of Artificial Intelligence

According to a 2023 business survey, 62 percent of enterprises have fully implemented artificial intelligence (AI) for cybersecurity or are exploring additional uses for the technology. With advancements in AI technologies, however, come more ways for sensitive information to be misused. Globally, organizations are leveraging AI and implementing automated security measures into their infrastructure to ... Full article

AI’s Unstoppable Momentum Leaves Some Enterprise IT Teams Scrambling: AMD

AMD has released findings from a new survey of global IT leaders suggesting some are finding it challenging to keep up during the recent AI boom: Close to half (46%) of respondents say their organizations are not ready to implement AI, and just 19% say they will prioritize AI within the next year. The report ... Full article

Navigating the AI Skills Revolution in the Age of GenAI: LinkedIn Report

The launch of ChatGPT and similar generative AI technologies is reshaping the skills required in the workplace, according to a new report from LinkedIn. “The Future of Work Report: AI at Work” found the pace at which LinkedIn members added AI skills to their profile has nearly doubled since ChatGPT’s debut in November 2022, rising from 7.7% ... Full article

Maximizing GPU Performance Amidst Today’s Increasing Shortages

The default method for accelerating Deep Learning projects is increasing the size of a GPU cluster. However, the cost is increasingly prohibitive. According to Andreessen Horowitz, many companies investing in AI ‘spend more than 80% of their total capital raised on compute resources,’ and rightly so. GPUs are the cornerstone of AI infrastructure and as much ... Full article

GigaIO’s New SuperNode Takes-off with Record Breaking AMD GPU Performance

The HPC user's dream is to keep stuffing GPUs into a rack mount box and make everything go faster. There are some servers that offer up to eight GPUs, but the standard server usually offers four GPU slots. Fair enough, using four modern GPUs offers a significant amount of HPC heft, but can we go ... Full article

Happening Now

Recent News

Contributors