All Providers
39 Models
14 Families
10M Max Context
$0.02–$16.50 Input Cost/MTok
$0.03–$82.50 Output Cost/MTok
Capabilities
Reasoning
Tool Calling
Attachments
Open Weights
Setup
Set the following environment variable to use Deep Infra:
DEEPINFRA_API_KEY Models (39)
| Model | Model ID | Input Cost | Output Cost | Context | Capabilities |
|---|---|---|---|---|---|
| Llama 3.1 8B Turbo llama | meta-llama/Llama-3.1-8B-Instruct-Turbo | $0.02/MTok | $0.03/MTok | 131.1K | Tools Open |
| Llama 3.1 8B llama | meta-llama/Llama-3.1-8B-Instruct | $0.02/MTok | $0.05/MTok | 131.1K | Tools Open |
| GPT OSS 20B gpt-oss | openai/gpt-oss-20b | $0.03/MTok | $0.14/MTok | 131.1K | Reasoning Tools Open |
| GPT OSS 120B gpt-oss | openai/gpt-oss-120b | $0.05/MTok | $0.24/MTok | 131.1K | Reasoning Tools Open |
| GLM-4.7-Flash glm-flash | zai-org/GLM-4.7-Flash | $0.06/MTok | $0.40/MTok | 202.8K | Reasoning Tools Open |
| Gemma 4 26B A4B IT gemma | google/gemma-4-26B-A4B-it | $0.07/MTok | $0.34/MTok | 262.1K | Reasoning Tools Open |
| Llama 4 Scout 17B llama | meta-llama/Llama-4-Scout-17B-16E-Instruct | $0.08/MTok | $0.30/MTok | 10M | Tools Open |
| Llama 3.3 70B Turbo llama | meta-llama/Llama-3.3-70B-Instruct-Turbo | $0.10/MTok | $0.32/MTok | 131.1K | Tools Open |
| Gemma 4 31B IT gemma | google/gemma-4-31B-it | $0.13/MTok | $0.38/MTok | 262.1K | Reasoning Tools Open |
| DeepSeek V4 Flash deepseek-flash | deepseek-ai/DeepSeek-V4-Flash | $0.14/MTok | $0.28/MTok | 1M | Reasoning Tools Open |
| Llama 4 Maverick 17B FP8 llama | meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | $0.15/MTok | $0.60/MTok | 1M | Tools Open |
| Qwen3.6 35B A3B qwen | Qwen/Qwen3.6-35B-A3B | $0.20/MTok | $1/MTok | 262.1K | Reasoning Tools Open |
| Qwen 3.5 35B A3B qwen | Qwen/Qwen3.5-35B-A3B | $0.20/MTok | $0.95/MTok | 262.1K | Reasoning Tools Open |
| MiniMax M2 minimax | MiniMaxAI/MiniMax-M2 | $0.25/MTok | $1.02/MTok | 262.1K | Reasoning Tools Open |
| DeepSeek-V3.2 | deepseek-ai/DeepSeek-V3.2 | $0.26/MTok | $0.38/MTok | 163.8K | Reasoning Tools |
| MiniMax M2.5 minimax | MiniMaxAI/MiniMax-M2.5 | $0.27/MTok | $0.95/MTok | 204.8K | Reasoning Tools Open |
| MiniMax M2.1 | MiniMaxAI/MiniMax-M2.1 | $0.28/MTok | $1.20/MTok | 196.6K | Reasoning Tools Open |
| GLM-4.6V glm | zai-org/GLM-4.6V | $0.30/MTok | $0.90/MTok | 204.8K | Reasoning Tools Open |
| Qwen3 Coder 480B A35B Instruct Turbo qwen | Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo | $0.30/MTok | $1.20/MTok | 262.1K | Tools Open |
| Kimi K2 0905 kimi | moonshotai/Kimi-K2-Instruct-0905 | $0.40/MTok | $2/MTok | 262.1K | Tools Open |
| Llama 3.1 70B llama | meta-llama/Llama-3.1-70B-Instruct | $0.40/MTok | $0.40/MTok | 131.1K | Tools Open |
| Llama 3.1 70B Turbo llama | meta-llama/Llama-3.1-70B-Instruct-Turbo | $0.40/MTok | $0.40/MTok | 131.1K | Tools Open |
| Qwen3 Coder 480B A35B Instruct qwen | Qwen/Qwen3-Coder-480B-A35B-Instruct | $0.40/MTok | $1.60/MTok | 262.1K | Tools Open |
| MiMo-V2.5 mimo | xiaomi/mimo-v2.5 | $0.40/MTok | $2/MTok | 262.1K | Reasoning Tools Open |
| GLM-4.7 glm | zai-org/GLM-4.7 | $0.43/MTok | $1.75/MTok | 202.8K | Reasoning Tools Open |
| GLM-4.6 glm | zai-org/GLM-4.6 | $0.43/MTok | $1.74/MTok | 204.8K | Reasoning Tools Open |
| DeepSeek V4 Pro deepseek-thinking | deepseek-ai/DeepSeek-V4-Pro | $0.43/MTok | $0.87/MTok | 65.5K | Reasoning Tools Open |
| Kimi K2 Thinking kimi-thinking | moonshotai/Kimi-K2-Thinking | $0.47/MTok | $2/MTok | 131.1K | Reasoning Tools Open |
| Kimi K2.5 kimi | moonshotai/Kimi-K2.5 | $0.50/MTok | $2.80/MTok | 262.1K | Reasoning Tools Open |
| Kimi K2 kimi | moonshotai/Kimi-K2-Instruct | $0.50/MTok | $2/MTok | 131.1K | Tools Open |
| DeepSeek-R1-0528 | deepseek-ai/DeepSeek-R1-0528 | $0.50/MTok | $2.15/MTok | 163.8K | Reasoning |
| Qwen 3.5 397B A17B qwen | Qwen/Qwen3.5-397B-A17B | $0.54/MTok | $3.40/MTok | 262.1K | Reasoning Tools Open |
| GLM-4.5 glm | zai-org/GLM-4.5 | $0.60/MTok | $2.20/MTok | 131.1K | Tools Open |
| Kimi K2.6 kimi | moonshotai/Kimi-K2.6 | $0.75/MTok | $3.50/MTok | 262.1K | Reasoning Tools Open |
| GLM-5 glm | zai-org/GLM-5 | $0.80/MTok | $2.56/MTok | 202.8K | Reasoning Tools Open |
| MiMo-V2.5-Pro mimo | xiaomi/mimo-v2.5-pro | $1/MTok | $3/MTok | 1.0M | Reasoning Tools Open |
| GLM-5.1 glm | zai-org/GLM-5.1 | $1.40/MTok | $4.40/MTok | 202.8K | Reasoning Tools Open |
| Claude Sonnet 3.7 (Latest) claude-sonnet | anthropic/claude-3-7-sonnet-latest | $3.30/MTok | $16.50/MTok | 200K | Reasoning Tools |
| Claude Opus 4 claude-opus | anthropic/claude-4-opus | $16.50/MTok | $82.50/MTok | 200K | Reasoning Tools |