Mistral NeMo
View website
A 12B model built with NVIDIA, released July 2024 under Apache 2.0. Runs on a single GPU with 11-language support.
At a glance
- Context window
- 128K tokens
- Max output
- 33K tokens
- Knowledge cutoff
- Jul 2024
- Modalities
- Text Text → Text Text
Pricing by provider
| Provider | Input / 1M tokens | Output / 1M tokens | |
|---|---|---|---|
|
|
$0.04 | $0.17 | |
|
|
$0.15 | $0.15 |
Heads up: We do our best to keep these specs & prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates & exact specs with the provider before provisioning.
Compare with other models
Estimated prices shown. Actual costs may vary based on context length, batch size, caching, and provider-specific pricing tiers.