Key Specifications
Compare Cloud Provider Prices
There is a 69% difference between the highest and lowest listings for the GH200. You can pay up to $6.50/hr, but the market floor is currently $1.99/hr per GPU (on-demand).
How we sort results
Default ranking
Our algorithm weighs five factors to find the most relevant matches for you:
- Price: We blend total hourly price with price-per-GPU to balance affordability and value.
- Specs: We favor offers with higher CPU, RAM, and GPU Memory.
- Billing: We favor on-demand billing for simplicity and flexibility over spot instances, reservations, and custom quotes.
- Location: We blend both datacenter proximity and provider HQ location. Datacenter location matters for latency, while HQ location matters for compliance and support.
- Provider diversity: Each time a provider appears in the list, their subsequent offerings are ranked a little lower, so one provider's offerings don't crowd out the top positions.
Sorting and filtering
Click any column header to sort by that column. Use the filters above the table to narrow results by billing type, GPU count, vCPUs, or RAM. Custom sorting resets the default relevance ranking.
Transparency and funding
Ads and sponsors: Any paid placements are fixed at the top and clearly labeled as sponsored content.
Affiliates: Any affiliate links will be indicated to you as well. We may earn a commission if you click them, but this never influences the ranking order.
| Provider | GPUs | Total VRAM | vCPUs | RAM | Billing | $/GPU/h | Total/h | Availability | |
|---|---|---|---|---|---|---|---|---|---|
Vultr
|
1x GH200 1x GH200 96GB | 96GB | 72 | 480GB | On-Demand Pay-as-you-go pricing. No term commitments. | $1.99 | $1.99 | Unknown | View |
Lambda Labs
|
1x GH200 1x GH200 96GB | 96GB | 64 | 432GB | On-Demand Pay-as-you-go pricing. No term commitments. | $2.29 | $2.29 | Unknown | View |
Sesterce
|
1x GH200 1x GH200 96GB | 96GB | 64 | 432GB | On-Demand Pay-as-you-go pricing. No term commitments. | $2.52 | $2.52 | Available | View |
CoreWeave
|
1x GH200 NVL 1x GH200 96GB NVL | 96GB | 72 | 480GB | On-Demand Pay-as-you-go pricing. No term commitments. | $6.50 | $6.50 | Available | View |
| No offerings matching your filters. | |||||||||
Heads up: We do our best to keep these prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates with the provider before provisioning.
Frequently Asked Questions
Why choose the GH200?
Grace Hopper superchip combines ARM CPU with H100 GPU and 96GB or 144GB HBM3e. Unified memory architecture with 900 GB/s NVLink. Efficient for AI workloads that benefit from tight CPU-GPU integration.
When is the GH200 not a good fit?
ARM CPU ecosystem may have compatibility issues with some software stacks. For standard GPU cloud workloads, a regular H100 is simpler to deploy and more widely available.
Are GH200 prices going up or down?
On-demand pricing has decreased by about 3% since June 2025, dropping from $3.72 to $3.59/hr per GPU.
What size AI models can the GH200 run?
With 96GB of VRAM, the GH200 is well suited to 30B-class models in FP16, and 70B-class models in 4-bit or 8-bit quantized form.
How much VRAM does the GH200 have?
The GH200 has 96GB of VRAM. Multi-GPU setups increase total memory, but that memory is not automatically pooled across GPUs.
What is the GH200's memory bandwidth?
The GH200 has 4,000 GB/s of memory bandwidth. Higher bandwidth helps with faster data transfer between GPU memory and compute cores.
What data types does the GH200 support?
The GH200 supports 7 precision formats. Training: BF16, FP16, TF32, FP32. Inference: FP8, INT8. Scientific: FP64.
Does the GH200 support NVLink?
Yes. The GH200 supports NVLink with 900 GB/s of bidirectional bandwidth. This helps accelerate multi-GPU communication.
How much does the GH200 cost per hour?
GH200 pricing currently ranges from $1.99/hr to $6.50/hr per GPU, depending on the provider, instance type, and billing model.
How much does the GH200 cost per month?
At 720 hours per month, one GH200 can cost between $1,432.80 to $4,680.00 per month, depending on the provider. Reserved and spot pricing can lower that further.
Which cloud providers offer the GH200?
The GH200 is available from 4 cloud providers, including CoreWeave, Sesterce, Vultr. Pricing and availability vary by region and billing model.
Can I rent the GH200 in the cloud?
Yes. We currently track 4 GH200 listings across 4 cloud providers:
Billing type Listings Avg $/GPU/hr On-demand 4 $3.32/hr
Technical Specifications
| Feature | GH200 | GH200 NVL2 |
|---|---|---|
| CPU core count | 72 Arm Neoverse V2 cores | 144 Arm Neoverse V2 cores |
| L1 cache | 64KB i-cache + 64KB d-cache | 64KB i-cache + 64KB d-cache |
| L2 cache | 1MB per core | 1MB per core |
| L3 cache | 114MB | 228MB |
| Base frequency | all-core single instruction, multiple data (SIMD) frequency | 3.1GHz | 3.0GHz |
| LPDDR5X size | 480GB, 120GB, 240GB | 960GB, 240GB, 480GB |
| Memory bandwidth | Up to 384GB/s, Up to 512GB/s | Up to 768GB/s, Up to 1024GB/s |
| PCIe links | Up to 4x PCIe x16 (Gen5) | Up to 8x PCIe x16 (Gen5) |
| FP64 | 34 teraFLOPS | 68 teraFLOPS |
| FP64 Tensor Core | 67 teraFLOPS | 134 teraFLOPS |
| FP32 | 67 teraFLOPS | 134 teraFLOPS |
| TF32 Tensor Core | 989 teraFLOPS*, 494 teraFLOPS | 1,979 teraFLOPS*, 990 teraFLOPS |
| BFLOAT16 Tensor Core | 1,979 teraFLOPS*, 990 teraFLOPS | 3,958 teraFLOPS*, 1,979 teraFLOPS |
| FP16 Tensor Core | 1,979 teraFLOPS*, 990 teraFLOPS | 3,958 teraFLOPS*, 1,979 teraFLOPS |
| FP8 Tensor Core | 3,958 teraFLOPS*, 1,979 teraFLOPS | 7,916 teraFLOPS*, 3,958 teraFLOPS |
| INT8 Tensor Core | 3,958 TOPS*, 1,979 TOPS | 7,916 TOPS*, 3,958 TOPS |
| High-bandwidth memory (HBM) size | 96GB HBM3 | 144GB HBM3e, Up to 288GB HBM3e |
| Memory bandwidth | Up to 4TB/s, Up to 4.9TB/s | Up to 9.8TB/s |
| NVIDIA NVLink-C2C CPU-to-GPU bandwidth | 900 GB/s | 900 GB/s |
| Power | Configurable 450 to 1000W (Memory + CPU + GPU) | Configurable 900W to 2000W (Memory + CPU + GPU) |
| Thermal solution | Air cooled or liquid cooled | Air cooled or liquid cooled |
* with sparsity
Source: official Nvidia GH200 datasheet.
Alternatives to Nvidia GH200
Last updated