Oracle Expands AI Capabilities with New Nvidia GPU Offerings

Oracle has announced a significant expansion of its partnership with Nvidia, introducing new GPU options and AI infrastructure services on Oracle Cloud Infrastructure (OCI). This move reflects the maturing artificial intelligence market and aims to provide greater flexibility for businesses of all sizes seeking to leverage AI capabilities.

New GPU Options

The centerpiece of Oracle’s announcement is the addition of Nvidia L40S GPUs to OCI’s compute offerings, along with new virtual machine options for Nvidia H100 Tensor Core GPUs. These additions expand OCI’s Nvidia GPU lineup, catering to a wide range of AI computing needs from entry-level to high-performance options.

L40S GPU: A Versatile AI Accelerator

The L40S GPU instances are positioned as versatile options for various AI workloads, including:

Inference
Training of smaller models
Graphics-intensive applications (e.g., digital twins)

Dave Salvator, director of accelerated computing products at Nvidia, described the L40S GPU as a “universal AI accelerator” capable of handling traditional AI tasks, with a focus on inference and the ability to train small models. It also offers visual capabilities for 3D rendering and video processing.

Deployment Flexibility

Oracle is offering these new GPU options in both bare metal and virtual machine configurations, providing customers with more choices in deploying AI workloads. Leo Leung, VP of OCI and Oracle Tech, emphasized the importance of bare metal offerings, stating, “With bare metal, there’s no debate. You’re going to get all the resource available for the customer. And that’s critical for that first stage of AI, where people want the maximum performance.”

OCI Supercluster: Supporting Massive AI Models

Oracle has also updated its “OCI Supercluster” service, which now supports up to 65,000 NVIDIA GPUs. This massive scale is designed for organizations training the largest AI models with hundreds of billions of parameters.Salvator highlighted the importance of scale, noting, “Scale really matters. That’s a combination of compute, as well as really capable networking. The faster you can get deployed, the faster you can get to inferencing and putting your application out there and begin getting value from it.”

Strategic Implications

Industry analysts view this expansion as a strategic move by Oracle to compete more aggressively in the AI cloud market, which is currently dominated by Amazon Web Services, Microsoft Azure, and Google Cloud. By leveraging its partnership with Nvidia, Oracle is positioning itself as a serious contender for enterprises looking to deploy large-scale AI workloads.The partnership also benefits Nvidia by providing another major cloud platform to showcase its latest GPU technologies and expand its reach in the enterprise market.

Expanding AI Access Across Business Sizes

As AI continues to transform industries, the competition among cloud providers to offer the most powerful and flexible AI infrastructure is intensifying. Oracle’s latest offerings demonstrate its commitment to staying competitive in this rapidly evolving landscape.

For businesses, these new options present opportunities to right-size their AI infrastructure investments, potentially lowering barriers to entry for smaller organizations while providing the necessary scale for the most demanding AI workloads.

Leung summarized Oracle’s approach, stating, “As a cloud provider, from our perspective, we want to serve all those types of customers,” from tech giants hosting massive models to small engineering teams working on specialized applications.

With this announcement, Oracle has made a clear statement about its AI ambitions, setting the stage for increased competition in the cloud AI market and providing businesses of all sizes with more options for leveraging AI technologies.

Prachi Kothiyal

What Is a Monorepo? Benefits for Full‑Stack Development Teams

Kashif Khan July 31, 2025 11:14 am No Comments

Modern software development often involves multiple applications, shared libraries, backend services, frontend UIs, and deployment pipelines—all maintained by a full-stack team. Managing these components across separate repositories (a “polyrepo” structure)

Illustration comparing SRE and DevOps roles, highlighting their key differences and synergy

SRE vs DevOps: What’s the Difference and How They Collaborate

Neel Vithlani July 31, 2025 9:24 am No Comments

Software teams must push code to users without breaking running services. Two professional disciplines shape that objective: DevOps and Site Reliability Engineering. Engineers weighing career paths often compare SRE vs

Spotify Backstage developer portal interface showcasing unified tools and services

Spotify’s AI Music Dilemma Raises Questions of Ethics and Trust

Prachi Kothiyal July 29, 2025 5:29 am No Comments

In recent months, Spotify has become the epicenter of a heated debate over the ethics of artificial intelligence in music. The world’s leading streaming service faced strong backlash after investigations