Nvidia A16

Compare prices for Nvidia A16 across cloud providers

Dec. 11, 2024 (updated)

Introduced in 2021, the Nvidia A16 is optimized for virtualized environments, supporting multi-instance AI inference and cloud-based deployments. Its design is aimed at enabling efficient scaling of inference workloads in multi-user setups.

Provider GPUs VRAM vCPUs RAM Price/h
Vultr logo Vultr 1x A16 16GB 6 64GB $0.51 Launch
Vultr logo Vultr 2x A16 96GB 12 128GB $1.02 Launch
Vultr logo Vultr 4x A16 192GB 24 256GB $2.05 Launch
Vultr logo Vultr 8x A16 384GB 48 486GB $4.09 Launch
Vultr logo Vultr 16x A16 768GB 96 960GB $9.19 Launch

Note: Prices are subject to change and may vary by region and other factors not listed here. For some GPUs, I include links to Shadeform (the sponsor) so you can check if they're available right now. I don’t earn a commission when you click on these links, but their monthly sponsorship helps me keep the site running.

Nvidia A16 specs

GPU Architecture NVIDIA Ampere architecture
GPU Memory 4x 16 GB GDDR6
Memory Bandwidth 4x 200 GB/s
Error-Correcting Code (ECC) Yes
NVIDIA Ampere architecture-based CUDA Cores 4x 1280
NVIDIA Third-Generation Tensor Cores 4x 40
NVIDIA Second-Generation RT Cores 4x 10
FP32 | TF32 | TF32' (TFLOPS) 4x 4.5, 4x 9, 4x 18
FP16 | FP16' (TFLOPS) 4x 17.9, 4x 35.9
INT8 | INT8' (TOPS) 4x 35.9, 4x 71.8
System Interface PCIe Gen4 (x16)
Max Power Consumption 250W
Thermal Solution Passive
Form Factor Full height, full length (FHFL) Dual Slot
Power Connector 8-pin CPU
Encode/Decode Engines 4 NVENC, 8 NVDEC (includes AV1 decode)
Secure and Measured Boot with Hardware Root of Trust for GPU Yes (optional)
vGPU Software Support NVIDIA Virtual PC (vPC), NVIDIA Virtual Applications (vApps), NVIDIA RTX Virtual Workstation (vWS), NVIDIA AI Enterprise, NVIDIA Virtual Compute Server (vCS)
Graphics APIs DirectX 12.07, Shader Model 5.17, OpenGL 4.68, Vulkan 1.18
Compute APIs CUDA, DirectCompute, OpenCL™, OpenACC®
MIG Support No

Source: official Nvidia A16 datasheet.