Llama 3.2 11B Vision

View website

Meta's smallest multimodal Llama, released September 2024 at 11B parameters. Adds image understanding via cross-attention adapters, suitable for single-GPU deployment.

At a glance

Context window: 128K tokens
Max output: 128K tokens
Knowledge cutoff: Sep 2024
Modalities: Text Text Image Image → Text Text

Pricing by provider

Provider	Input / 1M tokens	Output / 1M tokens
Together	$0.18	$0.18	View
Meta	Self-hosted	Self-hosted	View

Heads up: We do our best to keep these specs & prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates & exact specs with the provider before provisioning.

Compare with other models

Estimated prices shown. Actual costs may vary based on context length, batch size, caching, and provider-specific pricing tiers.