Qwen3-Next-80B-A3B Thinking
View website
Reasoning variant of Qwen3-Next-80B-A3B, released September 2025. Same hybrid attention architecture (80B total, 3B active, 512 experts) with chain-of-thought.
At a glance
- Context window
- 131K tokens
- Max output
- 66K tokens
- Knowledge cutoff
- Jun 2025
- Modalities
- Text Text → Text Text
Capabilities
Function calling
Function calling
Connect to external tools, APIs, and systems.
Structured output
Structured output
Return responses in structured formats like JSON.
Pricing by provider
| Provider | Input / 1M tokens | Output / 1M tokens | |
|---|---|---|---|
|
|
$0.15 | $1.50 |
Heads up: We do our best to keep these specs & prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates & exact specs with the provider before provisioning.
Compare with other models
Estimated prices shown. Actual costs may vary based on context length, batch size, caching, and provider-specific pricing tiers.