R1 Distill Llama 70B

Reasoning Open Weights

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

Providers 2

Released Jan 23, 2025

Input Modalities text

Output Modalities text

Benchmarks

Available Providers (2)

Provider	Model ID	Input Cost	Output Cost	Context	Max Output	Docs
Kilo Gateway	`deepseek/deepseek-r1-distill-llama-70b`	$0.70/MTok	$0.80/MTok	131.1K	16.4K
OpenRouter	`deepseek/deepseek-r1-distill-llama-70b`	$0.80/MTok	$0.80/MTok	128K	8.2K

Capabilities

Reasoning

Tool Calling

Attachments

Open Weights

Structured Output