Nvidia GB300 NVL72
Nvidia GB300 NVL72
Designed for AI Reasoning Performance
The NVIDIA GB300 NVL72 features a fully liquid-cooled, rack-scale design that unifies 72 NVIDIA Blackwell Ultra GPUs and 36 Arm®-based NVIDIA Grace™ CPUs in a single platform optimized for test-time scaling inference. AI factories powered with the GB300 NVL72 using NVIDIA Quantum-X800 InfiniBand or Spectrum™-X Ethernet paired with ConnectX®-8 SuperNICS provide a 50x higher output for reasoning model inference compared to the NVIDIA Hopper™ platform.
| NVIDIA GB300 NVL72 Specifications | |
|---|---|
| Configuration | 72 NVIDIA Blackwell Ultra GPUs, 36 NVIDIA Grace CPUs |
| NVLink Bandwidth | 130 TB/s |
| Fast Memory | Up to 40 TB |
| GPU Memory | Bandwidth | Up to 21 TB | Up to 576 TB/s |
| CPU Memory | Bandwidth | Up to 18 TB SOCAMM with LPDDR5X | Up to 14.3 TB/s |
| CPU Core Count | 2,592 Arm Neoverse V2 cores |
| FP4 Tensor Core | 1,400 | 1,100² PFLOPS |
| FP8/FP6 Tensor Core | 720 PFLOPS |
| INT8 Tensor Core | 23 PFLOPS |
| FP16/BF16 Tensor Core | 360 PFLOPS |
| TF32 Tensor Core | 180 PFLOPS |
| FP32 | 6 PFLOPS |
| FP64 / FP64 Tensor Core | 100 TFLOPS |