Gemma 3 12B Instruct - Specs & Pricing [2026]

Released March 2025, a 12B instruction-tuned model from Google's open-weight Gemma 3 family. Runs on a single consumer GPU with vision support.

At a glance

Context window

128K tokens

Max output

128K tokens

Knowledge cutoff

Aug 2024

Modalities

Text Image → Text

Capabilities

Function calling

Connect to external tools, APIs, and systems.

Pricing by provider

Provider	Input / 1M tokens	Output / 1M tokens
Novita	$0.05	$0.10	View
Google Cloud	Self-hosted	Self-hosted	View

Provider

Input / 1M tokens

Output / 1M tokens

Novita

$0.05

$0.10

View

Google Cloud

Self-hosted

View

Heads up: We do our best to keep these specs & prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates & exact specs with the provider before provisioning.