Nvidia B100

Nvidia B100

Blackwell data center GPU for next-generation AI training and inference.

Compare vs other GPUs →
Aggregating historical prices...

Key Specifications

Architecture
Blackwell
Memory
Up to 192GB HBM3e
Memory Bandwidth
8,000 GB/s
Release date
Q4 2024

Compare Cloud Provider Prices

The B100 is listed by 1 cloud provider, but none offer public on-demand pricing.

of shown

Default ranking

Our algorithm weighs five factors to find the most relevant matches for you:

  1. Price: We blend total hourly price with price-per-GPU to balance affordability and value.
  2. Specs: We favor offers with higher CPU, RAM, and GPU Memory.
  3. Billing: We favor on-demand billing for simplicity and flexibility over spot instances, reservations, and custom quotes.
  4. Location: We blend both datacenter proximity and provider HQ location. Datacenter location matters for latency, while HQ location matters for compliance and support.
  5. Provider diversity: Each time a provider appears in the list, their subsequent offerings are ranked a little lower, so one provider's offerings don't crowd out the top positions.

Sorting and filtering

Click any column header to sort by that column. Use the filters above the table to narrow results by billing type, GPU count, vCPUs, or RAM. Custom sorting resets the default relevance ranking.

Transparency and funding

Ads and sponsors: Any paid placements are fixed at the top and clearly labeled as sponsored content.

Affiliates: Any affiliate links will be indicated to you as well. We may earn a commission if you click them, but this never influences the ranking order.

Provider GPUs Total VRAM vCPUs RAM Billing $/GPU/h Total/h Availability
CUDO Compute logo CUDO 1x B100 1x B100 192GB 192GB -- -- Custom Custom terms negotiated with the provider. Custom Custom Unknown View
No offerings matching your filters.

Heads up: We do our best to keep these prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates with the provider before provisioning.

Frequently Asked Questions

Why choose the B100?

192GB HBM3e with Blackwell architecture FP4 Tensor Cores. 1800 GB/s NVLink for massive multi-GPU scaling. Significant inference throughput improvement over H100 with FP4 precision.

When is the B100 not a good fit?

New generation with limited provider availability. High cost per hour. For inference on models under 70B or workloads that don't benefit from FP4, the H100 or L40S is more cost-effective.

What size AI models can the B100 run?

With 192GB of VRAM, the B100 can usually run 70B-class models with headroom, and may handle much larger models in 4-bit quantized form depending on runtime overhead, KV cache, and context length.

How much VRAM does the B100 have?

The B100 has 192GB of VRAM. Multi-GPU setups increase total memory, but that memory is not automatically pooled across GPUs.

What is the B100's memory bandwidth?

The B100 has 8,000 GB/s of memory bandwidth. Higher bandwidth helps with faster data transfer between GPU memory and compute cores.

What data types does the B100 support?

The B100 supports 9 precision formats. Training: BF16, FP16, TF32, FP32. Inference: FP4, FP6, FP8, INT8. Scientific: FP64.

Does the B100 support NVLink?

Yes. The B100 supports NVLink with 1800 GB/s of bidirectional bandwidth. This helps accelerate multi-GPU communication.

Which cloud providers offer the B100?

The B100 is available from 1 cloud provider: CUDO Compute.

Can I rent the B100 in the cloud?

Yes. We currently track 1 B100 listings across 1 cloud providers:

Billing type Listings Avg $/GPU/hr
Custom contract 1 Custom

Technical Specifications

Form Factor 8x NVIDIA B100 SXM
FP4 Tensor Core¹ 112 PFLOPS
FP8/FP6 Tensor Core¹ 56 PFLOPS
INT8 Tensor Core¹ 56 POPS
FP16/BF16 Tensor Core¹ 28 PFLOPS
TF32 Tensor Core¹ 14 PFLOPS
FP32 480 TFLOPS
FP64 240 TFLOPS
FP64 Tensor Core 240 TFLOPS
Memory Up to 1.5TB
NVLink Fifth generation
NVIDIA NVSwitch™ Fourth generation
NVSwitch GPU-to-GPU Bandwidth 1.8TB/s
Total Aggregate Bandwidth 14.4TB/s

¹ With sparsity.

Source: official Nvidia B100 datasheet.

Alternatives to Nvidia B100

Last updated