Nvidia GB300 NVL72

      Nvidia GB300 NVL72

 

Designed for AI Reasoning Performance

The NVIDIA GB300 NVL72 features a fully liquid-cooled, rack-scale design that unifies 72 NVIDIA Blackwell Ultra GPUs and 36 Arm®-based NVIDIA Grace™ CPUs in a single platform optimized for test-time scaling inference. AI factories powered with the GB300 NVL72 using NVIDIA Quantum-X800 InfiniBand or Spectrum™-X Ethernet paired with ConnectX®-8 SuperNICS provide a 50x higher output for reasoning model inference compared to the NVIDIA Hopper™ platform.

NVIDIA GB300 NVL72 Specifications
Configuration 72 NVIDIA Blackwell Ultra GPUs, 36 NVIDIA Grace CPUs
NVLink Bandwidth 130 TB/s
Fast Memory Up to 40 TB
GPU Memory | Bandwidth Up to 21 TB | Up to 576 TB/s
CPU Memory | Bandwidth Up to 18 TB SOCAMM with LPDDR5X | Up to 14.3 TB/s
CPU Core Count 2,592 Arm Neoverse V2 cores
FP4 Tensor Core 1,400 | 1,100² PFLOPS
FP8/FP6 Tensor Core 720 PFLOPS
INT8 Tensor Core 23 PFLOPS
FP16/BF16 Tensor Core 360 PFLOPS
TF32 Tensor Core 180 PFLOPS
FP32 6 PFLOPS
FP64 / FP64 Tensor Core 100 TFLOPS