Oracle and NVIDIA have announced an expanded collaboration to deliver sovereign artificial intelligence (AI) solutions to customers worldwide. Oracle’s distributed cloud, AI infrastructure, and generative AI services, combined with NVIDIA’s accelerated computing and generative AI software, are enabling governments and businesses to deploy AI factories.
These AI factories can run services locally in the cloud and within a country or organization’s secure facilities with a variety of operational controls, supporting sovereign goals of diversifying and driving economic growth.
“As AI reshapes businesses, industry, and policies worldwide, countries and organizations need to strengthen their digital sovereignty to protect their most valuable data,” said Safra Catz, CEO of Oracle. “Our ongoing collaboration with NVIDIA and our unique ability to deploy cloud regions quickly and locally will ensure societies can harness AI without compromising their security.”
Jensen Huang, founder and CEO of NVIDIA, noted that “in an era where innovation will be driven by generative AI, data sovereignty is a cultural and economic imperative.” “Oracle’s integrated cloud applications and infrastructure, combined with NVIDIA’s accelerated computing and generative AI services, provide the flexibility and security that nations and regions require to control their own destiny,” he added.
The combination of NVIDIA’s full-stack AI platform with Oracle’s enterprise AI, deployable in OCI Dedicated Region, Oracle Alloy, Oracle EU Sovereign Cloud, and Oracle Government Cloud, offers customers a cutting-edge AI solution that provides greater control over operations, location, and security to help support digital sovereignty.
Countries worldwide are increasingly investing in AI infrastructure that can support their cultural and economic ambitions. Through 66 cloud regions in 26 countries, customers can access over 100 cloud and AI services spanning infrastructure and applications to support IT migration, modernization, and innovation.
The combined offerings from the companies can be deployed through public cloud or in a customer’s data center at specific locations, with flexible operational controls. Oracle is the only hyperscaler capable of offering full AI and cloud services locally, anywhere. OCI services and pricing are consistent across all deployment types to simplify planning, portability, and management.
Oracle’s cloud services leverage a variety of NVIDIA stack, including NVIDIA’s accelerated computing infrastructure and NVIDIA AI Enterprise software platform, including the recently announced NVIDIA NIM inference microservices, based on NVIDIA inference software such as NVIDIA TensorRT, NVIDIA TensorRT-LLM, and NVIDIA Triton Inference Server.
Several pioneering companies in sovereign AI are already leveraging these solutions. Avaloq, a leader in wealth management technology, selected OCI Dedicated Region to bring a full OCI cloud region to its own data center. TEAM IM, a leading New Zealand information management service provider, chose Oracle Alloy to build the country’s first hyper-scalable cloud owned and operated locally, known as TEAM Cloud. And e& UAE, the telecommunications arm of the e& Group, is collaborating with Oracle to enhance its AI capabilities and plans to deploy NVIDIA H100 Tensor Core GPU clusters within its OCI Dedicated Region.
To help customers address the growing needs of AI models, Oracle plans to leverage the latest NVIDIA Grace Blackwell computing platform, announced today at GTC, in OCI Supercluster and OCI Compute. OCI Supercluster will become significantly faster with new OCI Compute bare metal instances, ultra-low latency RDMA networking, and high-performance storage. OCI Compute will adopt both the NVIDIA GB200 Grace Blackwell Superchip and the NVIDIA Blackwell B200 Tensor Core GPU.
In addition, NVIDIA NIM microservices and CUDA-X, including NVIDIA NeMo Retriever for augmented generation retrieval (RAG) inference deployments, will also help OCI customers provide more insights and accuracy to their generative AI copilots and other productivity tools using their own data.
To meet customers’ increasing demands for more complex AI models, the companies are adding NVIDIA Grace Blackwell to NVIDIA DGX Cloud in OCI. Customers will be able to access new instances based on GB200 NVL72 via this supercomputing service co-designed for energy-efficient training and inference in a era of trillion-parameter LLMs.
The complete DGX Cloud cluster build will include over 20,000 GB200 accelerators and NVIDIA CX8 InfiniBand networking, providing highly scalable and high-performance cloud infrastructure. The cluster will consist of 72 GPU Blackwell NVL72 and 36 CPU Grace with fifth-generation NVLink.
In summary, the expanded collaboration between Oracle and NVIDIA aims to deliver cutting-edge sovereign AI solutions to customers worldwide, combining the strengths of both companies in cloud, infrastructure, generative AI services, and accelerated computing. This will enable governments and businesses to harness the potential of AI while respecting their digital sovereignty goals.