Meta logo Llama 4 Behemoth

View website

Meta's largest announced Llama model, a ~2T MoE (288B active, 16 experts). Announced April 2025 but not yet publicly released.

At a glance

Context window
1M tokens
Max output
1M tokens
Knowledge cutoff
Apr 2025
Modalities
Text Text Image Image Text Text

Capabilities

Function calling

Function calling

Connect to external tools, APIs, and systems.

Pricing by provider

Provider Input / 1M tokens Output / 1M tokens
Meta logo Meta Self-hosted Self-hosted

Heads up: We do our best to keep these specs & prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates & exact specs with the provider before provisioning.

Compare with other models

Estimated prices shown. Actual costs may vary based on context length, batch size, caching, and provider-specific pricing tiers.