NVIDIA® GB200 NVL72

Introducing the cutting-edge NVIDIA GB200 compute tray, leveraging the full potential of NVIDIA’s innovative MGX design.

The world’s most powerful GPU
NVIDIA® GB200 GPUs AVAILABLE SOON

GB200 NVL72 is a rack-scale, liquid-cooled solution connecting 36 Grace CPUs and 72 Blackwell GPUs, enabling a single 72-GPU NVLink domain that delivers 30X faster real-time trillion-parameter LLM inference.

With GB200 SXM you get:

  • Real-Time Inference for Trillion-Parameter LLMs
  • Massive LLM Training at High Speed

Top 4 Use Cases

Groundbreaking Blackwell Architecture

NVIDIA Blackwell architecture sets a new benchmark for accelerated computing with unparalleled performance, efficiency, and scalability, featuring 208 billion transistors on a custom TSMC 4NP process.

Breakthrough CPU Performance

NVIDIA Grace CPU revolutionizes data centre computing with outstanding performance and memory bandwidth, offering 2X energy efficiency and unprecedented speed for AI, cloud, and HPC applications.

Seamless Interconnectivity

The fifth-generation NVIDIA NVLink unlocks exascale computing and trillion-parameter AI models, enabling swift and seamless communication between every GPU in your server cluster for accelerated performance.

High-Performance Networking

NVIDIA Quantum-X800 InfiniBand, NVIDIA Spectrum X800 Ethernet, and NVIDIA BlueField®-3 DPUs enable efficient scalability across hundreds and thousands of Blackwell GPUs for optimal application performance.

Tech Specs

Form Factor GB200 NVL72
Configuration 36 Grace CPU : 72 Blackwell GPUs
FP4 Tensor Core 1,440 PFLOPS
FP8/FP6 Tensor Core 720 PFLOPS
INT8 Tensor Core 720 POPS
FP16/BF16 Tensor Core 360 PFLOPS
TF32 Tensor Core 180 PFLOPS
FP32 6,480 TFLOPS
FP64 3,240 TFLOPS
FP64 Tensor Core 3,240 TFLOPS
GPU Memory | Bandwidth Up to 13.5 TB HBM3e | 576 TB/s
NVLink Bandwidth 130TB/s
CPU Core Count 2,592 Arm® Neoverse V2 cores
CPU Memory | Bandwidth Up to 17 TB LPDDR5X | Up to 18.4 TB/s