All Providers
39 Models
14 Families
10M Max Context
$0.02–$16.50 Input Cost/MTok
$0.03–$82.50 Output Cost/MTok

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights

Setup

Set the following environment variable to use Deep Infra:

DEEPINFRA_API_KEY

Models (39)

Model Model ID Input Cost Output Cost Context Capabilities
Llama 3.1 8B Turbo llama meta-llama/Llama-3.1-8B-Instruct-Turbo $0.02/MTok $0.03/MTok 131.1K Tools Open
Llama 3.1 8B llama meta-llama/Llama-3.1-8B-Instruct $0.02/MTok $0.05/MTok 131.1K Tools Open
GPT OSS 20B gpt-oss openai/gpt-oss-20b $0.03/MTok $0.14/MTok 131.1K Reasoning Tools Open
GPT OSS 120B gpt-oss openai/gpt-oss-120b $0.05/MTok $0.24/MTok 131.1K Reasoning Tools Open
GLM-4.7-Flash glm-flash zai-org/GLM-4.7-Flash $0.06/MTok $0.40/MTok 202.8K Reasoning Tools Open
Gemma 4 26B A4B IT gemma google/gemma-4-26B-A4B-it $0.07/MTok $0.34/MTok 262.1K Reasoning Tools Files Open
Llama 4 Scout 17B llama meta-llama/Llama-4-Scout-17B-16E-Instruct $0.08/MTok $0.30/MTok 10M Tools Open
Llama 3.3 70B Turbo llama meta-llama/Llama-3.3-70B-Instruct-Turbo $0.10/MTok $0.32/MTok 131.1K Tools Open
Gemma 4 31B IT gemma google/gemma-4-31B-it $0.13/MTok $0.38/MTok 262.1K Reasoning Tools Files Open
DeepSeek V4 Flash deepseek-flash deepseek-ai/DeepSeek-V4-Flash $0.14/MTok $0.28/MTok 1M Reasoning Tools Open
Llama 4 Maverick 17B FP8 llama meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 $0.15/MTok $0.60/MTok 1M Tools Open
Qwen3.6 35B A3B qwen Qwen/Qwen3.6-35B-A3B $0.20/MTok $1/MTok 262.1K Reasoning Tools Files Open
Qwen 3.5 35B A3B qwen Qwen/Qwen3.5-35B-A3B $0.20/MTok $0.95/MTok 262.1K Reasoning Tools Files Open
MiniMax M2 minimax MiniMaxAI/MiniMax-M2 $0.25/MTok $1.02/MTok 262.1K Reasoning Tools Open
DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 $0.26/MTok $0.38/MTok 163.8K Reasoning Tools
MiniMax M2.5 minimax MiniMaxAI/MiniMax-M2.5 $0.27/MTok $0.95/MTok 204.8K Reasoning Tools Open
MiniMax M2.1 MiniMaxAI/MiniMax-M2.1 $0.28/MTok $1.20/MTok 196.6K Reasoning Tools Open
GLM-4.6V glm zai-org/GLM-4.6V $0.30/MTok $0.90/MTok 204.8K Reasoning Tools Files Open
Qwen3 Coder 480B A35B Instruct Turbo qwen Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo $0.30/MTok $1.20/MTok 262.1K Tools Open
Kimi K2 0905 kimi moonshotai/Kimi-K2-Instruct-0905 $0.40/MTok $2/MTok 262.1K Tools Open
Llama 3.1 70B llama meta-llama/Llama-3.1-70B-Instruct $0.40/MTok $0.40/MTok 131.1K Tools Open
Llama 3.1 70B Turbo llama meta-llama/Llama-3.1-70B-Instruct-Turbo $0.40/MTok $0.40/MTok 131.1K Tools Open
Qwen3 Coder 480B A35B Instruct qwen Qwen/Qwen3-Coder-480B-A35B-Instruct $0.40/MTok $1.60/MTok 262.1K Tools Open
MiMo-V2.5 mimo xiaomi/mimo-v2.5 $0.40/MTok $2/MTok 262.1K Reasoning Tools Files Open
GLM-4.7 glm zai-org/GLM-4.7 $0.43/MTok $1.75/MTok 202.8K Reasoning Tools Open
GLM-4.6 glm zai-org/GLM-4.6 $0.43/MTok $1.74/MTok 204.8K Reasoning Tools Open
DeepSeek V4 Pro deepseek-thinking deepseek-ai/DeepSeek-V4-Pro $0.43/MTok $0.87/MTok 65.5K Reasoning Tools Open
Kimi K2 Thinking kimi-thinking moonshotai/Kimi-K2-Thinking $0.47/MTok $2/MTok 131.1K Reasoning Tools Open
Kimi K2.5 kimi moonshotai/Kimi-K2.5 $0.50/MTok $2.80/MTok 262.1K Reasoning Tools Files Open
Kimi K2 kimi moonshotai/Kimi-K2-Instruct $0.50/MTok $2/MTok 131.1K Tools Open
DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 $0.50/MTok $2.15/MTok 163.8K Reasoning
Qwen 3.5 397B A17B qwen Qwen/Qwen3.5-397B-A17B $0.54/MTok $3.40/MTok 262.1K Reasoning Tools Files Open
GLM-4.5 glm zai-org/GLM-4.5 $0.60/MTok $2.20/MTok 131.1K Tools Open
Kimi K2.6 kimi moonshotai/Kimi-K2.6 $0.75/MTok $3.50/MTok 262.1K Reasoning Tools Files Open
GLM-5 glm zai-org/GLM-5 $0.80/MTok $2.56/MTok 202.8K Reasoning Tools Open
MiMo-V2.5-Pro mimo xiaomi/mimo-v2.5-pro $1/MTok $3/MTok 1.0M Reasoning Tools Open
GLM-5.1 glm zai-org/GLM-5.1 $1.40/MTok $4.40/MTok 202.8K Reasoning Tools Open
Claude Sonnet 3.7 (Latest) claude-sonnet anthropic/claude-3-7-sonnet-latest $3.30/MTok $16.50/MTok 200K Reasoning Tools Files
Claude Opus 4 claude-opus anthropic/claude-4-opus $16.50/MTok $82.50/MTok 200K Reasoning Tools Files