Key Specifications
Compare Cloud Provider Prices
Listings for the L40 reach $1.64/hr, often reflecting a premium for high availability. However, you might be able to find available instances from as low as $0.47/hr per GPU (spot instance).
How we sort results
Default ranking
Our algorithm weighs five factors to find the most relevant matches for you:
- Price: We blend total hourly price with price-per-GPU to balance affordability and value.
- Specs: We favor offers with higher CPU, RAM, and GPU Memory.
- Billing: We favor on-demand billing for simplicity and flexibility over spot instances, reservations, and custom quotes.
- Location: We blend both datacenter proximity and provider HQ location. Datacenter location matters for latency, while HQ location matters for compliance and support.
- Provider diversity: Each time a provider appears in the list, their subsequent offerings are ranked a little lower, so one provider's offerings don't crowd out the top positions.
Sorting and filtering
Click any column header to sort by that column. Use the filters above the table to narrow results by billing type, GPU count, vCPUs, or RAM. Custom sorting resets the default relevance ranking.
Transparency and funding
Ads and sponsors: Any paid placements are fixed at the top and clearly labeled as sponsored content.
Affiliates: Any affiliate links will be indicated to you as well. We may earn a commission if you click them, but this never influences the ranking order.
| Provider | GPUs | Total VRAM | vCPUs | RAM | Billing | $/GPU/h | Total/h | Availability | |
|---|---|---|---|---|---|---|---|---|---|
Thunder Compute
Our sponsor
|
1x L40 1x L40 48GB (Prototyping) | 48GB | 4 | 32GB | On-Demand Pay-as-you-go pricing. No term commitments. | $0.89 | $0.89 | Available | View |
Runpod
|
1x L40 1x L40 48GB (community-cloud) | 48GB | 9 | 50GB | On-Demand Pay-as-you-go pricing. No term commitments. | $0.69 | $0.69 | Available | View |
CoreWeave
|
1x L40 1x L40 48GB | 48GB | -- | -- | On-Demand Pay-as-you-go pricing. No term commitments. | $1.25 | $1.25 | Available | View |
Runpod
|
1x L40 1x L40 48GB (secure-cloud) | 48GB | 9 | 50GB | On-Demand Pay-as-you-go pricing. No term commitments. | $0.82 | $0.82 | Available | View |
Together
|
1x L40 1x L40 48GB | 48GB | -- | -- | On-Demand Pay-as-you-go pricing. No term commitments. | $1.49 | $1.49 | Unknown | View |
Sesterce
|
1x L40 1x L40 48GB | 48GB | 26 | 192GB | On-Demand Pay-as-you-go pricing. No term commitments. | $0.97 | $0.97 | Available | View |
Hyperstack
|
1x L40 1x L40 48GB | 48GB | 28 | 120GB | On-Demand Pay-as-you-go pricing. No term commitments. | $1.00 | $1.00 | Unknown | View |
Runpod
|
2x L40 2x L40 48GB (community-cloud) | 96GB | 18 | 100GB | On-Demand Pay-as-you-go pricing. No term commitments. | $0.69 | $1.38 | Available | View |
Oblivus
|
1x L40 1x L40 48GB | 48GB | 28 | 58GB | On-Demand Pay-as-you-go pricing. No term commitments. | $1.05 | $1.05 | Unknown | View |
Thunder Compute
|
2x L40 2x L40 48GB (Prototyping) | 96GB | 8 | 64GB | On-Demand Pay-as-you-go pricing. No term commitments. | $0.89 | $1.78 | Available | View |
Sesterce
|
1x L40 1x L40 48GB | 48GB | 28 | 58GB | On-Demand Pay-as-you-go pricing. No term commitments. | $1.10 | $1.10 | Available | View |
Runpod
|
2x L40 2x L40 48GB (secure-cloud) | 96GB | 18 | 100GB | On-Demand Pay-as-you-go pricing. No term commitments. | $0.82 | $1.64 | Available | View |
Thunder Compute
|
1x L40 1x L40 48GB (Production) | 48GB | 10 | 80GB | On-Demand Pay-as-you-go pricing. No term commitments. | $1.39 | $1.39 | Available | View |
Massed Compute
|
1x L40 1x L40 48GB | 48GB | 14 | 72GB | On-Demand Pay-as-you-go pricing. No term commitments. | $0.86 | $0.86 | Available | View |
Runcrate
|
1x L40 PCIe 1x L40 48GB PCIe | 48GB | 26 | 192GB | On-Demand Pay-as-you-go pricing. No term commitments. | $0.97 | $0.97 | Available | View |
Sesterce
|
2x L40 2x L40 48GB | 96GB | 50 | 384GB | On-Demand Pay-as-you-go pricing. No term commitments. | $1.09 | $2.18 | Available | View |
Vast.ai
|
1x L40 1x L40 46GB (27979081) | 46GB | 128 | 128GB | On-Demand Pay-as-you-go pricing. No term commitments. | $1.00 | $1.00 | Available | View |
TensorDock
|
1x L40 1x L40 48GB | 48GB | 16 | 32GB | On-Demand Starting price, marketplace rates vary | $1.06 | $1.06 | Unknown | View |
Hyperstack
|
1x L40 1x L40 48GB | 48GB | -- | -- | Reserved Reserved capacity with term commitments. | $0.70 | $0.70 | Unknown | View |
Sesterce
|
2x L40 2x L40 48GB | 96GB | 60 | 116GB | On-Demand Pay-as-you-go pricing. No term commitments. | $1.10 | $2.20 | Available | View |
| No offerings matching your filters. | |||||||||
Heads up: We do our best to keep these prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates with the provider before provisioning.
Frequently Asked Questions
Why choose the L40?
48GB GDDR6 with Ada Lovelace architecture. Good for visualization, rendering, and inference workloads. FP8 Tensor Core support for efficient AI inference.
When is the L40 not a good fit?
PCIe-only. Lower inference throughput than L40S for pure AI workloads. If you don't need the visualization features, the L40S is typically a better value.
Are L40 prices going up or down?
On-demand pricing has increased by about 11% since June 2025, from $1.01 to $1.12/hr per GPU.
What size AI models can the L40 run?
With 48GB of VRAM, the L40 can typically run models up to about 30B parameters in FP16, or 70B-class models in 4-bit quantized form for inference.
How much VRAM does the L40 have?
The L40 has 48GB of VRAM. Multi-GPU setups increase total memory, but that memory is not automatically pooled across GPUs.
What is the L40's memory bandwidth?
The L40 has 864 GB/s of memory bandwidth. Higher bandwidth helps with faster data transfer between GPU memory and compute cores.
What data types does the L40 support?
The L40 supports 6 precision formats. Training: BF16, FP16, TF32, FP32. Inference: FP8, INT8.
Does the L40 support NVLink?
No. The L40 is a PCIe-only GPU with no NVLink, so it is better suited to single-GPU inference and smaller-scale workloads than large distributed training jobs.
How much does the L40 cost per hour?
L40 pricing currently ranges from $0.47/hr to $1.64/hr per GPU, depending on the provider, instance type, and billing model.
How much does the L40 cost per month?
At 720 hours per month, one L40 can cost between $337.32 to $1,180.08 per month, depending on the provider. Reserved and spot pricing can lower that further.
Which cloud providers offer the L40?
The L40 is available from 11 cloud providers, including Runpod, Sesterce, CoreWeave. Pricing and availability vary by region and billing model.
Can I rent the L40 in the cloud?
Yes. We currently track 52 L40 listings across 11 cloud providers:
Billing type Listings Avg $/GPU/hr On-demand 33 $1.07/hr Reserved 10 $0.77/hr Spot 9 $0.69/hr
Technical Specifications
| GPU Architecture | NVIDIA Ada Lovelace architecture |
| GPU Memory | 48GB GDDR6 |
| Memory Bandwidth | 864GB/s |
| Interconnect Interface | PCIe Gen4 x16 (64GB/s bi-directional) |
| CUDA Cores | 18,176 |
| Third-Generation RT Cores | 142 |
| Fourth-Generation Tensor Cores | 568 |
| RT Core Performance TFLOPS | 209 |
| FP32 TFLOPS | 90.5 |
| TF32 Tensor Core TFLOPS | 90.5 |
| BFLOAT16 Tensor Core TFLOPS | 181.05 |
| FP16 Tensor Core TFLOPS | 181.05 |
| FP8 Tensor Core TFLOPS | 362 |
| Peak INT8 Tensor TOPS | 362 |
| Peak INT4 Tensor TOPS | 724 |
| Form Factor | 4.4" (H) x 10.5" (L) - dual slot |
| Display Ports | 4x DisplayPort 1.4a |
| Max Power Consumption | 300W |
| NVLink Support | No |
Source: official Nvidia L40 datasheet.
Alternatives to Nvidia L40
Last updated