All Models

DeepSeek R1 Distill Llama 70B

deepseek-thinking Reasoning Tool Calling Open Weights Structured Output

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across multiple benchmarks, including: - AIME 2024 pass@1: 70.0 - MATH-500 pass@1: 94.5 - CodeForces Rating: 1633 The model leverages fine-tuning from DeepSeek R1's outputs, enabling competitive performance comparable to larger frontier models.

Providers 11
Released Jan 1, 2025
Input Modalities text
Output Modalities text
Tarsk Use coding

Available Providers (11)

Provider Model ID Input Cost Output Cost Context Max Output Docs
OpenRouter deepseek/deepseek-r1-distill-llama-70b $0.00/MTok $0.00/MTok 8.2K 8.2K
FastRouter deepseek-ai/deepseek-r1-distill-llama-70b $0.03/MTok $0.14/MTok 131.1K 131.1K
Helicone deepseek-r1-distill-llama-70b $0.03/MTok $0.13/MTok 128K 4.1K
Chutes deepseek-ai/DeepSeek-R1-Distill-Llama-70B $0.03/MTok $0.11/MTok 131.1K 131.1K
Alibaba (China) deepseek-r1-distill-llama-70b $0.29/MTok $0.86/MTok 32.8K 16.4K
Kilo Gateway deepseek/deepseek-r1-distill-llama-70b $0.70/MTok $0.80/MTok 131.1K 16.4K
OVHcloud AI Endpoints deepseek-r1-distill-llama-70b $0.74/MTok $0.74/MTok 131.1K 131.1K
Groq deepseek-r1-distill-llama-70b $0.75/MTok $0.99/MTok 131.1K 8.2K
NovitaAI deepseek/deepseek-r1-distill-llama-70b $0.80/MTok $0.80/MTok 8.2K 8.2K
Scaleway deepseek-r1-distill-llama-70b $0.90/MTok $0.90/MTok 32K 8.2K
Vultr DeepSeek-R1-Distill-Llama-70B $2.00/MTok $2.00/MTok 130K 4.1K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output