Llama 4 Maverick - Specs & Pricing [2026]

Released April 2025, a 400B MoE (17B active, 128 experts) fitting on a single H100 node. Meta's first MoE-based Llama generation, trained on 30T tokens.

At a glance

Context window

1M tokens

Max output

1M tokens

Knowledge cutoff

Apr 2025

Modalities

Text Image → Text

Capabilities

Function calling

Connect to external tools, APIs, and systems.

Pricing by provider

Provider	Input / 1M tokens	Output / 1M tokens
Replicate	$0.25	$0.95	View
Together	$0.27	$0.85	View
Novita	$0.27	$0.85	View
Meta	Self-hosted	Self-hosted	View

Provider

Input / 1M tokens

Output / 1M tokens

Replicate

$0.25

$0.95

View

Together

$0.27

$0.85

View

Novita

$0.27

$0.85

View

At a glance

Capabilities

Function calling

Pricing by provider

Compare with other models