Oracle and NVIDIA Collaborate to Help Enterprises Accelerate Agentic AI Inference
AUSTIN, Texas, and SAN JOSE, Calif., March 19, 2025 -- Oracle and NVIDIA have announced a first-of-its-kind integration between NVIDIA accelerated computing and inference software with Oracle’s AI infrastructure and generative AI services to help organizations globally speed the creation of agentic AI applications.
The new integration between Oracle Cloud Infrastructure (OCI) and the NVIDIA AI Enterprise software platform will make 160+ AI tools and 100+ NVIDIA NIM microservices natively available through the OCI Console. In addition, Oracle and NVIDIA are collaborating on the no-code deployment of both Oracle and NVIDIA AI Blueprints and on accelerating AI vector search in Oracle Database 23ai with the NVIDIA cuVS library.
“Oracle has become the platform of choice for both AI training and inferencing, and this partnership enhances our ability to help customers achieve greater innovation and business results,” said Safra Catz, CEO, Oracle. “NVIDIA’s offerings, paired with OCI’s flexibility, scalability, performance, and security, will speed AI adoption and help customers get more value from their data.”
“Oracle and NVIDIA are perfect partners for the age of reasoning — an AI and accelerated computing company working with a key player in processing much of the world’s enterprise data,” said Jensen Huang, co-founder and CEO, NVIDIA. “Together, we help enterprises innovate with agentic AI to deliver amazing things for their customers and partners.”
Purpose-Built Solutions to Meet Enterprise AI Needs
Reducing the time it takes to deploy reasoning models, NVIDIA AI Enterprise will be natively available through the OCI Console, enabling customers to quickly and easily access AI tools, including NVIDIA NIM – a set of 100+ optimized, cloud-native inference microservices for leading AI models, including the latest NVIDIA Llama Nemotron models for advanced AI reasoning. NVIDIA AI Enterprise will be available as a deployment image for OCI bare-metal instances and Kubernetes clusters using OCI Kubernetes Engine. OCI Console customers will benefit from direct billing and customer support through Oracle.
Organizations can deploy OCI’s 150+ AI and cloud services with NVIDIA accelerated computing and NVIDIA AI Enterprise in their data center, the public cloud, or at the edge. This offering provides an integrated AI stack to help address data privacy, sovereign AI, and low-latency requirements.
AI Deployment at Scale with Tailored Blueprints
OCI AI Blueprints provide no-code deployment recipes that enable customers to quickly run AI workloads without having to make decisions about the software stack or manually provision the infrastructure. The blueprints offer clear hardware recommendations for NVIDIA GPUs, NIM microservices, and prepackaged observability tools, helping enterprises accelerate their AI projects from weeks to minutes.
In addition, NVIDIA Blueprints provide developers with a unified experience across the NVIDIA stack by providing reference workflows for enterprise AI use cases. Using NVIDIA Blueprints, organizations can build and operationalize custom AI applications with NVIDIA AI Enterprise and NVIDIA Omniverse software, application programming interfaces, and microservices. For example, developers can begin with an NVIDIA AI Blueprint for a customer service AI assistant and customize it for their own use.
To simplify the development, deployment, and scale-out of advanced physical AI and simulation applications and workflows, the NVIDIA Omniverse and NVIDIA Isaac Sim development workstations and Omniverse Kit App Streaming are expected to be available on Oracle Cloud Infrastructure Marketplace later this year, preconfigured with compute bare-metal instances accelerated by NVIDIA L40S GPUs.
Real-Time AI Inference with NVIDIA NIM in OCI Data Science
To further accelerate enterprise AI adoption and help enable quick AI deployments with minimal setup, data scientists can access pre-optimized NVIDIA NIM microservices directly in OCI Data Science. This supports real-time AI inference use cases without the complexity of managing infrastructure.
To help maintain data security and compliance, the models run in the customer’s OCI tenancy. Customers can purchase the models through a flexible, pay-as-you-go, hourly pricing model or apply their Oracle Universal Credits.
Organizations can use this integration to deploy inference endpoints with preconfigured, optimized NIM inference engines in minutes, rapidly accelerating time-to-value for use cases such as AI-powered assistants, real-time recommendation engines, and copilots. In addition, this allows customers to start using the integration for smaller workloads and seamlessly scale to enterprise-wide deployments.
NVIDIA Accelerated Computing Platform Turbocharges AI Vector Search in Oracle Database 23ai
Oracle and NVIDIA are working together to accelerate the creation of vector embeddings and vector indexes — compute-intensive portions of AI Vector Search workloads in Oracle Database 23ai — using NVIDIA GPUs and NVIDIA cuVS.
Organizations can enable vector embedding through bulk vectorization of large volumes of input data such as text, images, and videos, as well as the fast creation and maintenance of vector indexes. With NVIDIA-accelerated AI Vector Search, Oracle Database customers can significantly improve the performance of their AI pipelines to help support high-volume AI vector workloads.
DeweyVision provides advanced computer vision and artificial intelligence capabilities to turn media into data, making it accessible, searchable, discoverable, retrievable, and actionable. DeweyVision uses Oracle Database 23ai on Oracle Autonomous Database for its AI-powered, no-code warehousing tools. These tools enable production professionals to connect their workflows and edit video footage quickly by cataloging footage in minutes and providing intuitive search capabilities.
“Oracle Database 23ai with AI Vector Search can significantly increase Dewey’s search performance while increasing the scalability of the DeweyVision platform,” said Majid Bemanian, CEO, DeweyVision. “Using NVIDIA GPUs to create the vector embeddings that we load into Oracle Database accelerates the speed at which we can ingest new data, while Autonomous Database and the converged capabilities of Oracle Database 23ai will help reduce our operational costs as we grow and open new opportunities. We believe that the combination of DeweyVision, Oracle Database 23ai, and NVIDIA GPUs running in OCI will help us achieve our goal of becoming Hollywood’s data warehouse.”
NVIDIA Blackwell on OCI Enables AI Anywhere
Oracle and NVIDIA continue to evolve AI infrastructure with new NVIDIA GPU types across OCI’s public regions, Government Clouds, sovereign clouds, OCI Dedicated Region, Oracle Alloy, OCI Compute Cloud@Customer, and OCI Roving Edge Devices.
This includes NVIDIA Quantum-2 InfiniBand cluster network environments, NVIDIA Spectrum Ethernet switches, and optimized NVIDIA NVLink and NVLink Switch functionality for some of the largest AI superclusters in the market. In addition, OCI will offer NVIDIA GB200 NVL72 systems on OCI Supercluster — generally available soon with up to 131,072 NVIDIA GPUs — and is taking orders for one of the largest AI supercomputers in the cloud with NVIDIA Blackwell Ultra GPUs.
OCI will be among the first cloud service providers to offer the next generation of the NVIDIA Blackwell accelerated computing platform. Built on the Blackwell architecture introduced a year ago, Blackwell Ultra includes the NVIDIA GB300 NVL72 rack-scale solution and the NVIDIA HGX B300 NVL16 system. The GB300 NVL72 delivers 1.5x more AI performance than the NVIDIA GB200 NVL72.
About Oracle
Oracle offers integrated suites of applications plus secure, autonomous infrastructure in the Oracle Cloud. For more information about Oracle (NYSE: ORCL), please visit us at oracle.com.
Source: Oracle
Source: Oracle