Universal data center GPU powered by Ada Lovelace architecture. Combines AI inference with ray tracing graphics for generative AI, video processing, and visualization workloads.
Flexible pricing options to match your workload requirements.
Pay as you go with no commitment
Latest NVIDIA architecture with 4th gen Tensor Cores and 3rd gen RT Cores.
Native FP8 for up to 2x inference throughput on transformer models.
Deploy LLMs, Stable Diffusion, and other generative models efficiently.
Ray-traced rendering and real-time visualization with 3rd gen RT cores.
AI inference and graphics in one GPU.
Save 15% with monthly commitment
Maximum savings with annual commitment
Combines AI inference, graphics, and video processing in one GPU.
Hardware support for AI-powered frame generation and super sampling.
Video analytics, encoding, and AI-powered video processing.
Build and run NVIDIA Omniverse applications at scale.