48GB GDDR6 with Ada Lovelace architecture. Good for visualization, rendering, and inference workloads. FP8 Tensor Core support for efficient AI inference.

When is the L40 not a good fit?

PCIe-only. Lower inference throughput than L40S for pure AI workloads. If you don't need the visualization features, the L40S is typically a better value.

Are L40 prices going up or down?

On-demand pricing has decreased by about 16% since July 2025, dropping from $1.07 to $0.89/hr per GPU.

What size AI models can the L40 run?

With 48GB of VRAM, the L40 can typically run models up to about 30B parameters in FP16, or 70B-class models in 4-bit quantized form for inference.

How much VRAM does the L40 have?

The L40 has 48GB of VRAM. Multi-GPU setups increase total memory, but that memory is not automatically pooled across GPUs.

What is the L40's memory bandwidth?

The L40 has 864 GB/s of memory bandwidth. Higher bandwidth helps with faster data transfer between GPU memory and compute cores.

What data types does the L40 support?

The L40 supports 6 precision formats. Training: BF16, FP16, TF32, FP32. Inference: FP8, INT8.

Does the L40 support NVLink?

No. The L40 is a PCIe-only GPU with no NVLink, so it is better suited to single-GPU inference and smaller-scale workloads than large distributed training jobs.

How much does the L40 cost per hour?

L40 pricing currently ranges from $0.39/hr to $1.64/hr per GPU, depending on the provider, instance type, and billing model.

How much does the L40 cost per month?

At 720 hours per month, one L40 can cost between $279.65 to $1,180.08 per month, depending on the provider. Reserved and spot pricing can lower that further.

Which cloud providers offer the L40?

The L40 is available from 11 cloud providers, including Sesterce, Hyperstack, CoreWeave. Pricing and availability vary by region and billing model.

Can I rent the L40 in the cloud?

Yes. We currently track 37 L40 listings across 11 cloud providers: Billing type Listings Avg $/GPU/hr On-demand 28 $0.96/hr Reserved 4 $0.71/hr Spot 5 $0.58/hr

Nvidia L40

Data center GPU for combined AI inference and visualization.

Compare vs other GPUs →

Aggregating historical prices...

Key Specifications

Architecture

Ada Lovelace

Memory

48GB GDDR6

Memory Bandwidth

864 GB/s

Release date

Q4 2022

Compare Cloud Provider Prices

Listings for the L40 reach $1.64/hr, often reflecting a premium for high availability. However, you might be able to find available instances from as low as $0.39/hr per GPU (spot instance).

Billing

Max price/hr

Min GPUs

Min vCPUs

Min RAM (GB)

of shown

Default ranking

Our algorithm weighs five factors to find the most relevant matches for you:

Price: We blend total hourly price with price-per-GPU to balance affordability and value.
Specs: We favor offers with higher CPU, RAM, and GPU Memory.
Billing: We favor on-demand billing for simplicity and flexibility over spot instances, reservations, and custom quotes.
Location: We blend both datacenter proximity and provider HQ location. Datacenter location matters for latency, while HQ location matters for compliance and support.
Provider diversity: Each time a provider appears in the list, their subsequent offerings are ranked a little lower, so one provider's offerings don't crowd out the top positions.

Sorting and filtering

Click any column header to sort by that column. Use the filters above the table to narrow results by billing type, GPU count, vCPUs, or RAM. Custom sorting resets the default relevance ranking.

Transparency and funding

Ads and sponsors: Any paid placements are fixed at the top and clearly labeled as sponsored content.

Affiliates: Any affiliate links will be indicated to you as well. We may earn a commission if you click them, but this never influences the ranking order.

Provider	GPUs	Total VRAM	vCPUs	RAM	Billing	$/GPU/h	Total/h	Availability
Thunder Compute Our sponsor	1x L40 1x L40 48GB	48GB	6	48GB	On-Demand Pay-as-you-go pricing. No term commitments.	$0.79	$0.79	Available Last checked <24h ago	View
Runpod	1x L40 1x L40 48GB (community-cloud)	48GB	9	50GB	On-Demand Pay-as-you-go pricing. No term commitments.	$0.69	$0.69	Available Last checked <15m ago	View
Runpod	1x L40 1x L40 48GB (secure-cloud)	48GB	9	50GB	On-Demand Pay-as-you-go pricing. No term commitments.	$0.82	$0.82	Low stock Last checked <15m ago	View
Sesterce	1x L40 1x L40 48GB	48GB	26	192GB	On-Demand Pay-as-you-go pricing. No term commitments.	$0.97	$0.97	Available Last checked <15m ago	View
Hyperstack	1x L40 1x L40 48GB	48GB	28	120GB	On-Demand Pay-as-you-go pricing. No term commitments.	$1.00	$1.00	Unknown	View
Thunder Compute	2x L40 2x L40 48GB	96GB	12	96GB	On-Demand Pay-as-you-go pricing. No term commitments.	$0.79	$1.58	Available Last checked <24h ago	View
Oblivus	1x L40 1x L40 48GB	48GB	28	58GB	On-Demand Pay-as-you-go pricing. No term commitments.	$1.05	$1.05	Unknown	View
Sesterce	1x L40 1x L40 48GB	48GB	28	58GB	On-Demand Pay-as-you-go pricing. No term commitments.	$1.10	$1.10	Available Last checked <15m ago	View
Vast.ai	1x L40 1x L40 46GB	46GB	--	--	On-Demand Pay-as-you-go pricing. No term commitments.	$0.53	$0.53	Available Last checked <15m ago	View
Massed Compute	1x L40 1x L40 48GB	48GB	14	72GB	On-Demand Pay-as-you-go pricing. No term commitments.	$0.86	$0.86	Available Last checked <24h ago	View
Runcrate	1x L40 PCIe 1x L40 48GB PCIe	48GB	26	192GB	On-Demand Pay-as-you-go pricing. No term commitments.	$0.97	$0.97	Available Last checked <1h ago	View
Sesterce	2x L40 2x L40 48GB	96GB	50	384GB	On-Demand Pay-as-you-go pricing. No term commitments.	$1.09	$2.18	Available Last checked <15m ago	View
TensorDock	1x L40 1x L40 48GB	48GB	16	32GB	On-Demand Starting price, marketplace rates vary	$1.06	$1.06	Unknown	View
Hyperstack	1x L40 1x L40 48GB	48GB	28	120GB	Reservation Reserved capacity with term commitments.	$0.70	$0.70	Unknown	View
Thunder Compute	4x L40 4x L40 48GB	192GB	24	192GB	On-Demand Pay-as-you-go pricing. No term commitments.	$0.99	$3.96	Available Last checked <24h ago	View
Vast.ai	2x L40 2x L40 46GB	92GB	--	--	On-Demand Pay-as-you-go pricing. No term commitments.	$0.58	$1.15	Available Last checked <15m ago	View
CoreWeave	8x L40 8x L40 48GB	384GB	128	1.0TB	On-Demand Pay-as-you-go pricing. No term commitments.	$1.25	$10.00	Unknown	View
Sesterce	2x L40 2x L40 48GB	96GB	60	116GB	On-Demand Pay-as-you-go pricing. No term commitments.	$1.10	$2.20	Available Last checked <15m ago	View
Massed Compute	2x L40 2x L40 48GB	96GB	26	144GB	On-Demand Pay-as-you-go pricing. No term commitments.	$0.86	$1.72	Available Last checked <24h ago	View
Hyperstack	1x L40 1x L40 48GB	48GB	--	--	Spot Variable spot pricing with potential interruptions.	$0.80	$0.80	Unknown	View
No offerings matching your filters.

Heads up: We do our best to keep these specs & prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates & exact specs with the provider before provisioning.

Frequently Asked Questions

Why choose the L40?: 48GB GDDR6 with Ada Lovelace architecture. Good for visualization, rendering, and inference workloads. FP8 Tensor Core support for efficient AI inference.
When is the L40 not a good fit?: PCIe-only. Lower inference throughput than L40S for pure AI workloads. If you don't need the visualization features, the L40S is typically a better value.
Are L40 prices going up or down?: On-demand pricing has decreased by about 16% since July 2025, dropping from $1.07 to $0.89/hr per GPU.
What size AI models can the L40 run?: With 48GB of VRAM, the L40 can typically run models up to about 30B parameters in FP16, or 70B-class models in 4-bit quantized form for inference.
How much VRAM does the L40 have?: The L40 has 48GB of VRAM. Multi-GPU setups increase total memory, but that memory is not automatically pooled across GPUs.
What is the L40's memory bandwidth?: The L40 has 864 GB/s of memory bandwidth. Higher bandwidth helps with faster data transfer between GPU memory and compute cores.
What data types does the L40 support?: The L40 supports 6 precision formats. Training: BF16, FP16, TF32, FP32. Inference: FP8, INT8.
Does the L40 support NVLink?: No. The L40 is a PCIe-only GPU with no NVLink, so it is better suited to single-GPU inference and smaller-scale workloads than large distributed training jobs.
How much does the L40 cost per hour?: L40 pricing currently ranges from $0.39/hr to $1.64/hr per GPU, depending on the provider, instance type, and billing model.
How much does the L40 cost per month?: At 720 hours per month, one L40 can cost between $279.65 to $1,180.08 per month, depending on the provider. Reserved and spot pricing can lower that further.
Which cloud providers offer the L40?: The L40 is available from 11 cloud providers, including Sesterce, Hyperstack, CoreWeave. Pricing and availability vary by region and billing model.
Can I rent the L40 in the cloud?: Yes. We currently track 37 L40 listings across 11 cloud providers:

Billing type Listings Avg $/GPU/hr

On-demand 28 $0.96/hr

Reserved 4 $0.71/hr

Spot 5 $0.58/hr

Billing type	Listings	Avg $/GPU/hr
On-demand	28	$0.96/hr
Reserved	4	$0.71/hr
Spot	5	$0.58/hr

Technical Specifications


GPU Architecture	NVIDIA Ada Lovelace architecture
GPU Memory	48GB GDDR6
Memory Bandwidth	864GB/s
Interconnect Interface	PCIe Gen4 x16 (64GB/s bi-directional)
CUDA Cores	18,176
Third-Generation RT Cores	142
Fourth-Generation Tensor Cores	568
RT Core Performance TFLOPS	209
FP32 TFLOPS	90.5
TF32 Tensor Core TFLOPS	90.5
BFLOAT16 Tensor Core TFLOPS	181.05
FP16 Tensor Core TFLOPS	181.05
FP8 Tensor Core TFLOPS	362
Peak INT8 Tensor TOPS	362
Peak INT4 Tensor TOPS	724
Form Factor	4.4" (H) x 10.5" (L) - dual slot
Display Ports	4x DisplayPort 1.4a
Max Power Consumption	300W
NVLink Support	No