GPT-OSS-120B - Specs & Pricing [2026]

Released August 2025, OpenAI's first open-weight model. A 117B MoE (5.1B active) fitting on a single 80 GB GPU. Matches o4-mini on reasoning benchmarks. Apache 2.0.

At a glance

Context window

131K tokens

Max output

131K tokens

Knowledge cutoff

Jun 2024

Modalities

Text → Text

Capabilities

Function calling

Connect to external tools, APIs, and systems.

Structured output

Return responses in structured formats like JSON.

Pricing by provider

Provider	Input / 1M tokens	Output / 1M tokens
Novita	$0.05	$0.25	View
Hyperstack	$0.10	$0.40	View
Azure	$0.15	$0.60	View

Provider

Input / 1M tokens

Output / 1M tokens

Novita

$0.05

$0.25

View

Hyperstack

$0.10

$0.40

View

Azure

$0.15

$0.60

View

Heads up: We do our best to keep these specs & prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates & exact specs with the provider before provisioning.

At a glance

Capabilities

Function calling

Structured output

Pricing by provider

Compare with other models