DeepSeek logo DeepSeek R1 Distill Llama 70B

View website

Released January 2025, a 70B reasoning model distilled from DeepSeek-R1 into the Llama 3.3 architecture. Open-weight under the Llama 3.3 license.

At a glance

Context window
131K tokens
Max output
33K tokens
Knowledge cutoff
Jul 2024
Modalities
Text Text Text Text

Pricing by provider

Provider Input / 1M tokens Output / 1M tokens
Novita logo Novita $0.80 $0.80

Heads up: We do our best to keep these specs & prices accurate. However, cloud costs may fluctuate based on region, usage, and other factors not listed here. These are estimates based on common setups and are for informational purposes only. Always verify current rates & exact specs with the provider before provisioning.

Compare with other models

Estimated prices shown. Actual costs may vary based on context length, batch size, caching, and provider-specific pricing tiers.