Nvidia GH200

Nvidia GH200

Grace Hopper superchip combining ARM CPU with H100 GPU.

Compare vs other GPUs →
Aggregating historical prices...

Key Specifications

Architecture
Grace Hopper
Memory
96GB HBM3 + 480GB LPDDR5X
Memory Bandwidth
4,000 GB/s + 512 GB/s
Release date
Q2 2023

Compare Cloud Provider Prices

There is a 69% difference between the highest and lowest listings for the GH200. You can pay up to $6.50/hr, but the market floor is currently $1.99/hr per GPU (on-demand).

of shown

Default ranking

Our algorithm weighs five factors to find the most relevant matches for you:

  1. Price: We blend total hourly price with price-per-GPU to balance affordability and value.
  2. Specs: We favor offers with higher CPU, RAM, and GPU Memory.
  3. Billing: We favor on-demand billing for simplicity and flexibility over spot instances, reservations, and custom quotes.
  4. Location: We blend both datacenter proximity and provider HQ location. Datacenter location matters for latency, while HQ location matters for compliance and support.
  5. Provider diversity: Each time a provider appears in the list, their subsequent offerings are ranked a little lower, so one provider's offerings don't crowd out the top positions.

Sorting and filtering

Click any column header to sort by that column. Use the filters above the table to narrow results by billing type, GPU count, vCPUs, or RAM. Custom sorting resets the default relevance ranking.

Transparency and funding

Ads and sponsors: Any paid placements are fixed at the top and clearly labeled as sponsored content.

Affiliates: Any affiliate links will be indicated to you as well. We may earn a commission if you click them, but this never influences the ranking order.

Provider GPUs Total VRAM vCPUs RAM Billing $/GPU/h Total/h Availability
Vultr logo Vultr 1x GH200 1x GH200 96GB 96GB 72 480GB On-Demand Pay-as-you-go pricing. No term commitments. $1.99 $1.99 Unknown View
Lambda Labs logo Lambda Labs 1x GH200 1x GH200 96GB 96GB 64 432GB On-Demand Pay-as-you-go pricing. No term commitments. $2.29 $2.29 Unknown View
Sesterce logo Sesterce 1x GH200 1x GH200 96GB 96GB 64 432GB On-Demand Pay-as-you-go pricing. No term commitments. $2.52 $2.52 Available View
CoreWeave logo CoreWeave 1x GH200 NVL 1x GH200 96GB NVL 96GB 72 480GB On-Demand Pay-as-you-go pricing. No term commitments. $6.50 $6.50 Available View
No offerings matching your filters.

Heads up: We do our best to keep these prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates with the provider before provisioning.

Frequently Asked Questions

Why choose the GH200?

Grace Hopper superchip combines ARM CPU with H100 GPU and 96GB or 144GB HBM3e. Unified memory architecture with 900 GB/s NVLink. Efficient for AI workloads that benefit from tight CPU-GPU integration.

When is the GH200 not a good fit?

ARM CPU ecosystem may have compatibility issues with some software stacks. For standard GPU cloud workloads, a regular H100 is simpler to deploy and more widely available.

Are GH200 prices going up or down?

On-demand pricing has decreased by about 3% since June 2025, dropping from $3.72 to $3.59/hr per GPU.

What size AI models can the GH200 run?

With 96GB of VRAM, the GH200 is well suited to 30B-class models in FP16, and 70B-class models in 4-bit or 8-bit quantized form.

How much VRAM does the GH200 have?

The GH200 has 96GB of VRAM. Multi-GPU setups increase total memory, but that memory is not automatically pooled across GPUs.

What is the GH200's memory bandwidth?

The GH200 has 4,000 GB/s of memory bandwidth. Higher bandwidth helps with faster data transfer between GPU memory and compute cores.

What data types does the GH200 support?

The GH200 supports 7 precision formats. Training: BF16, FP16, TF32, FP32. Inference: FP8, INT8. Scientific: FP64.

Does the GH200 support NVLink?

Yes. The GH200 supports NVLink with 900 GB/s of bidirectional bandwidth. This helps accelerate multi-GPU communication.

How much does the GH200 cost per hour?

GH200 pricing currently ranges from $1.99/hr to $6.50/hr per GPU, depending on the provider, instance type, and billing model.

How much does the GH200 cost per month?

At 720 hours per month, one GH200 can cost between $1,432.80 to $4,680.00 per month, depending on the provider. Reserved and spot pricing can lower that further.

Which cloud providers offer the GH200?

The GH200 is available from 4 cloud providers, including CoreWeave, Sesterce, Vultr. Pricing and availability vary by region and billing model.

Can I rent the GH200 in the cloud?

Yes. We currently track 4 GH200 listings across 4 cloud providers:

Billing type Listings Avg $/GPU/hr
On-demand 4 $3.32/hr

Technical Specifications

Feature GH200 GH200 NVL2
CPU core count 72 Arm Neoverse V2 cores 144 Arm Neoverse V2 cores
L1 cache 64KB i-cache + 64KB d-cache 64KB i-cache + 64KB d-cache
L2 cache 1MB per core 1MB per core
L3 cache 114MB 228MB
Base frequency | all-core single instruction, multiple data (SIMD) frequency 3.1GHz 3.0GHz
LPDDR5X size 480GB, 120GB, 240GB 960GB, 240GB, 480GB
Memory bandwidth Up to 384GB/s, Up to 512GB/s Up to 768GB/s, Up to 1024GB/s
PCIe links Up to 4x PCIe x16 (Gen5) Up to 8x PCIe x16 (Gen5)
FP64 34 teraFLOPS 68 teraFLOPS
FP64 Tensor Core 67 teraFLOPS 134 teraFLOPS
FP32 67 teraFLOPS 134 teraFLOPS
TF32 Tensor Core 989 teraFLOPS*, 494 teraFLOPS 1,979 teraFLOPS*, 990 teraFLOPS
BFLOAT16 Tensor Core 1,979 teraFLOPS*, 990 teraFLOPS 3,958 teraFLOPS*, 1,979 teraFLOPS
FP16 Tensor Core 1,979 teraFLOPS*, 990 teraFLOPS 3,958 teraFLOPS*, 1,979 teraFLOPS
FP8 Tensor Core 3,958 teraFLOPS*, 1,979 teraFLOPS 7,916 teraFLOPS*, 3,958 teraFLOPS
INT8 Tensor Core 3,958 TOPS*, 1,979 TOPS 7,916 TOPS*, 3,958 TOPS
High-bandwidth memory (HBM) size 96GB HBM3 144GB HBM3e, Up to 288GB HBM3e
Memory bandwidth Up to 4TB/s, Up to 4.9TB/s Up to 9.8TB/s
NVIDIA NVLink-C2C CPU-to-GPU bandwidth 900 GB/s 900 GB/s
Power Configurable 450 to 1000W (Memory + CPU + GPU) Configurable 900W to 2000W (Memory + CPU + GPU)
Thermal solution Air cooled or liquid cooled Air cooled or liquid cooled

* with sparsity

Source: official Nvidia GH200 datasheet.

Alternatives to Nvidia GH200

Last updated