Technology

NVIDIA Unveils Generative AI Microservices for Enhanced Developer Experiences and Wide Deployment

Published March 18, 2024

NVIDIA, a leader in accelerated computing, has introduced a suite of enterprise-grade generative AI microservices, allowing businesses to easily build and launch custom applications. These new offerings are part of NVIDIA's strategy to empower developers to innovate and deploy AI copilots seamlessly across a wide range of platforms.

Expansive Reach of NVIDIA's AI Ecosystem

The launch features a new catalog of GPU-accelerated NVIDIA NIM microservices and cloud endpoints designed for pretrained AI models. These services are finely tuned to operate efficiently on the vast landscape of CUDA-enabled GPUs found in various settings, from cloud environments to individual PCs. The goal is to offer a standardized and straightforward pathway to run custom AI solutions at scale.

NIM and CUDA-X Microservices

NIM microservices offer prebuilt containers which drastically reduce deployment times from weeks to mere minutes. They also include industry-standard APIs to facilitate rapid development across numerous domains such as natural language processing, drug discovery, and speech recognition. For more horizontal needs, NVIDIA offers CUDA-X microservices. These provide the foundational tools necessary for AI development tasks such as data preprocessing, model customization, and training accelerations.

Adoption by Leading Enterprises

Several high-profile application platform providers, including Adobe, Cadence, CrowdStrike, SAP, and ServiceNow, are among the first to leverage NVIDIA's generative AI microservices. This broad adoption highlights the significant impact NVIDIA's technologies are having across various industries.

Flexible Deployment Options and Access

With NVIDIA AI Enterprise 5.0, customers can deploy these microservices across a range of infrastructures, including on-premises and in the cloud, on systems certified by NVIDIA. Developers interested in experimenting with NVIDIA's offerings can do so at no cost, and those seeking production-grade solutions can find them readily available on certified platforms and leading cloud services.

NVIDIA, AI, Microservices