Nvidia A16 - GPU Price Comparison

Introduced in 2021, the Nvidia A16 is optimized for virtualized environments, supporting multi-instance AI inference and cloud-based deployments. Its design is aimed at enabling efficient scaling of inference workloads in multi-user setups.

Provider	GPUs	VRAM	vCPUs	RAM	Price/h
Vultr	1x A16	16GB	6	64GB	$0.51	Source
Sesterce	1x A16	16GB	6	--	$0.56	Source
Vultr	2x A16	96GB	12	128GB	$1.02	Source
Sesterce	2x A16	32GB	12	--	$1.12	Source
Vultr	4x A16	192GB	24	256GB	$2.05	Source
Sesterce	4x A16	64GB	24	--	$2.26	Source
Vultr	8x A16	384GB	48	486GB	$4.09	Source
Sesterce	8x A16	128GB	48	--	$4.50	Source
Vultr	16x A16	768GB	96	960GB	$9.19	Source
Sesterce	16x A16	256GB	96	--	$10.11	Source

Note: Prices are subject to change and may vary by region and other factors not listed here.

Compare Nvidia A16 against other GPUs →

Nvidia A16 specs


GPU Architecture	NVIDIA Ampere architecture
GPU Memory	4x 16 GB GDDR6
Memory Bandwidth	4x 200 GB/s
Error-Correcting Code (ECC)	Yes
NVIDIA Ampere architecture-based CUDA Cores	4x 1280
NVIDIA Third-Generation Tensor Cores	4x 40
NVIDIA Second-Generation RT Cores	4x 10
FP32 \| TF32 \| TF32' (TFLOPS)	4x 4.5, 4x 9, 4x 18
FP16 \| FP16' (TFLOPS)	4x 17.9, 4x 35.9
INT8 \| INT8' (TOPS)	4x 35.9, 4x 71.8
System Interface	PCIe Gen4 (x16)
Max Power Consumption	250W
Thermal Solution	Passive
Form Factor	Full height, full length (FHFL) Dual Slot
Power Connector	8-pin CPU
Encode/Decode Engines	4 NVENC, 8 NVDEC (includes AV1 decode)
Secure and Measured Boot with Hardware Root of Trust for GPU	Yes (optional)
vGPU Software Support	NVIDIA Virtual PC (vPC), NVIDIA Virtual Applications (vApps), NVIDIA RTX Virtual Workstation (vWS), NVIDIA AI Enterprise, NVIDIA Virtual Compute Server (vCS)
Graphics APIs	DirectX 12.07, Shader Model 5.17, OpenGL 4.68, Vulkan 1.18
Compute APIs	CUDA, DirectCompute, OpenCL™, OpenACC®
MIG Support	No

Source: official Nvidia A16 datasheet.

Runpod

Sponsor

Spin up a GPU in seconds across 30+ regions

Managed containers with monitoring built-in

Autoscales from 0 to thousands of containers

Learn more →