GB200 NVL72
Ultra-Large AI·HPC GPU Cluster
Designed for AI inference performance and efficiency

Designed for AI inference performance and efficiency
| GB200 NVL72 | GB200 Grace Blackwell Superchip | |
|---|---|---|
| Configuration | 36 Grace CPU : 72 Blackwell GPUs | 1 Grace CPU : 2 Blackwell GPU |
| FP4 Core² | 1,440 PFLOPS | 40 PFLOPS |
| FP8/FP6 Core² | 720 PFLOPS | 20 PFLOPS |
| INT8 Tensor Core² | 720 POPS | 20 POPS |
| FP16/BF16 Tensor Core² | 360 PFLOPS | 10 PFLOPS |
| TF32 Tensor Core² | 180 PFLOPS | 5 PFLOPS |
| FP32 | 6,480 TFLOPS | 180 TFLOPS |
| FP64/FP64 Tensor Core | 3,240 TFLOPS | 90 TFLOPS |
| GPU memory bandwidth | Max 13.5TB HBM3e | 576TB/s | Max 384GB HBM3e | 16TB/s |
| NVLink memory bandwidth | 130TB/s | 3.6TB/s |
| CPU Core | 2592 Arm® Neoverse V2 Core | 72 Arm® Neoverse V2 Core |
| CPU memory Bandwith | Max 17TB LPDDR5X | Max 18.4TB/s | Max 480GB LPDDR5X | Max 512GB/s |