All Models
| Model | Model ID | Input Cost | Output Cost | Context | Max Output | Capabilities |
|---|---|---|---|---|---|---|
| GLM-4.7-Flash glm | hf:zai-org/GLM-4.7-Flash | $0.06/MTok | $0.40/MTok | 196.6K | 65.5K | Reasoning Tools Open |
| GPT OSS 120B gpt-oss | hf:openai/gpt-oss-120b | $0.10/MTok | $0.10/MTok | 128K | 32.8K | Reasoning Tools Open |
| Llama-4-Scout-17B-16E-Instruct llama | hf:meta-llama/Llama-4-Scout-17B-16E-Instruct | $0.15/MTok | $0.60/MTok | 328K | 4.1K | Tools Open |
| Llama-3.1-8B-Instruct llama | hf:meta-llama/Llama-3.1-8B-Instruct | $0.20/MTok | $0.20/MTok | 128K | 32.8K | Reasoning Tools Open |
| Qwen 3 235B Instruct qwen | hf:Qwen/Qwen3-235B-A22B-Instruct-2507 | $0.20/MTok | $0.60/MTok | 256K | 32K | Tools Open |
| Llama-4-Maverick-17B-128E-Instruct-FP8 llama | hf:meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | $0.22/MTok | $0.88/MTok | 524K | 4.1K | Tools Open |
| DeepSeek V3.2 deepseek | hf:deepseek-ai/DeepSeek-V3.2 | $0.27/MTok | $0.40/MTok | 162.8K | 8K | Reasoning Tools Open |
| MiniMax-M2 minimax | hf:MiniMaxAI/MiniMax-M2 | $0.55/MTok | $2.19/MTok | 196.6K | 131K | Reasoning Tools Open |
| MiniMax-M2.1 minimax | hf:MiniMaxAI/MiniMax-M2.1 | $0.55/MTok | $2.19/MTok | 204.8K | 131.1K | Reasoning Tools Open |
| DeepSeek R1 deepseek-thinking | hf:deepseek-ai/DeepSeek-R1 | $0.55/MTok | $2.19/MTok | 128K | 128K | Reasoning Tools Open |
| Kimi K2.5 kimi | hf:moonshotai/Kimi-K2.5 | $0.55/MTok | $2.19/MTok | 262.1K | 65.5K | Reasoning Tools Open |
| Kimi K2 Thinking kimi-thinking | hf:moonshotai/Kimi-K2-Thinking | $0.55/MTok | $2.19/MTok | 262.1K | 262.1K | Reasoning Tools Open |
| Kimi K2.5 (NVFP4) kimi | hf:nvidia/Kimi-K2.5-NVFP4 | $0.55/MTok | $2.19/MTok | 262.1K | 65.5K | Reasoning Tools Open |
| GLM 4.6 glm | hf:zai-org/GLM-4.6 | $0.55/MTok | $2.19/MTok | 200K | 64K | Reasoning Tools Open |
| GLM 4.7 glm | hf:zai-org/GLM-4.7 | $0.55/MTok | $2.19/MTok | 200K | 64K | Reasoning Tools Open |
| DeepSeek V3.1 deepseek | hf:deepseek-ai/DeepSeek-V3.1 | $0.56/MTok | $1.68/MTok | 128K | 128K | Reasoning Tools |
| MiniMax-M2.5 minimax | hf:MiniMaxAI/MiniMax-M2.5 | $0.60/MTok | $3.00/MTok | 191.5K | 65.5K | Reasoning Tools Open |
| Qwen3 235B A22B Thinking 2507 qwen | hf:Qwen/Qwen3-235B-A22B-Thinking-2507 | $0.65/MTok | $3.00/MTok | 256K | 32K | Reasoning Tools Open |
| Qwen2.5-Coder-32B-Instruct qwen | hf:Qwen/Qwen2.5-Coder-32B-Instruct | $0.80/MTok | $0.80/MTok | 32.8K | 32.8K | Open |
| Llama-3.1-70B-Instruct llama | hf:meta-llama/Llama-3.1-70B-Instruct | $0.90/MTok | $0.90/MTok | 128K | 32.8K | Reasoning Tools Open |
| Llama-3.3-70B-Instruct llama | hf:meta-llama/Llama-3.3-70B-Instruct | $0.90/MTok | $0.90/MTok | 128K | 32.8K | Reasoning Tools Open |
| DeepSeek V3 (0324) deepseek | hf:deepseek-ai/DeepSeek-V3-0324 | $1.20/MTok | $1.20/MTok | 128K | 128K | Tools |
| DeepSeek V3.1 Terminus deepseek | hf:deepseek-ai/DeepSeek-V3.1-Terminus | $1.20/MTok | $1.20/MTok | 128K | 128K | Reasoning Tools |
| Kimi K2 0905 kimi | hf:moonshotai/Kimi-K2-Instruct-0905 | $1.20/MTok | $1.20/MTok | 262.1K | 32.8K | Tools Open |
| DeepSeek V3 deepseek | hf:deepseek-ai/DeepSeek-V3 | $1.25/MTok | $1.25/MTok | 128K | 128K | Reasoning Tools Open |
| Qwen 3 Coder 480B qwen | hf:Qwen/Qwen3-Coder-480B-A35B-Instruct | $2.00/MTok | $2.00/MTok | 256K | 32K | Tools Open |
| DeepSeek R1 (0528) deepseek-thinking | hf:deepseek-ai/DeepSeek-R1-0528 | $3.00/MTok | $8.00/MTok | 128K | 128K | Reasoning Tools |
| Llama-3.1-405B-Instruct llama | hf:meta-llama/Llama-3.1-405B-Instruct | $3.00/MTok | $3.00/MTok | 128K | 32.8K | Reasoning Tools Open |