Command A Vision - Specs & Pricing [2026]

Cohere's first multimodal model, released July 2025. Targets enterprise visual tasks like chart analysis, OCR, and document Q&A across 6 languages.

At a glance

Context window

128K tokens

Max output

8K tokens

Knowledge cutoff

Jun 2024

Modalities

Text Image → Text

Capabilities

Structured output

Return responses in structured formats like JSON.

Pricing by provider

Provider	Input / 1M tokens	Output / 1M tokens
Cohere	Self-hosted	Self-hosted	View

Provider

Input / 1M tokens

Output / 1M tokens

Cohere

Self-hosted

View

Heads up: We do our best to keep these specs & prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates & exact specs with the provider before provisioning.