Cohere logo Command A Vision

View website

Cohere's first multimodal model, released July 2025. Targets enterprise visual tasks like chart analysis, OCR, and document Q&A across 6 languages.

At a glance

Context window
128K tokens
Max output
8K tokens
Knowledge cutoff
Jun 2024
Modalities
Text Text Image Image Text Text

Capabilities

Structured output

Structured output

Return responses in structured formats like JSON.

Pricing by provider

Provider Input / 1M tokens Output / 1M tokens
Cohere logo Cohere Self-hosted Self-hosted

Heads up: We do our best to keep these specs & prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates & exact specs with the provider before provisioning.

Compare with other models

Estimated prices shown. Actual costs may vary based on context length, batch size, caching, and provider-specific pricing tiers.