All Models
| Model | Model ID | Input Cost | Output Cost | Context | Max Output | Capabilities |
|---|---|---|---|---|---|---|
| FLUX.1-dev | black-forest-labs/flux-dev | $0.00/MTok | $0.00/MTok | 77 | — | Open |
| FLUX.1-schnell | black-forest-labs/flux-schnell | $0.00/MTok | $0.00/MTok | 77 | — | Open |
| BGE-ICL text-embedding | BAAI/bge-en-icl | $0.01/MTok | $0.00/MTok | 32.8K | — | Open |
| bge-multilingual-gemma2 text-embedding | BAAI/bge-multilingual-gemma2 | $0.01/MTok | $0.00/MTok | 8.2K | — | Open |
| e5-mistral-7b-instruct text-embedding | intfloat/e5-mistral-7b-instruct | $0.01/MTok | $0.00/MTok | 32.8K | — | Open |
| Qwen3-Embedding-8B text-embedding | Qwen/Qwen3-Embedding-8B | $0.01/MTok | $0.00/MTok | 32.8K | — | Open |
| Gemma-2-2b-it | google/gemma-2-2b-it | $0.02/MTok | $0.06/MTok | 8.2K | 4.1K | Open |
| Meta-Llama-3.1-8B-Instruct | meta-llama/Meta-Llama-3.1-8B-Instruct | $0.02/MTok | $0.06/MTok | 128K | 4.1K | Tools Open |
| Llama-Guard-3-8B | meta-llama/Llama-Guard-3-8B | $0.02/MTok | $0.06/MTok | 8.2K | 1.0K | Open |
| Gemma-2-9b-it (Fast) | google/gemma-2-9b-it-fast | $0.03/MTok | $0.09/MTok | 8.2K | 4.1K | Open |
| Meta-Llama-3.1-8B-Instruct (Fast) | meta-llama/Meta-Llama-3.1-8B-Instruct-fast | $0.03/MTok | $0.09/MTok | 128K | 4.1K | Tools Open |
| Qwen2.5-Coder-7B (Fast) | Qwen/Qwen2.5-Coder-7B-fast | $0.03/MTok | $0.09/MTok | 128K | 8.2K | Tools Open |
| gpt-oss-20b | openai/gpt-oss-20b | $0.05/MTok | $0.20/MTok | 128K | 4.1K | Tools Open |
| Nemotron-3-Nano-30B-A3B | nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B | $0.06/MTok | $0.24/MTok | 32K | 4.1K | Tools Open |
| Nemotron-Nano-V2-12b | nvidia/Nemotron-Nano-V2-12b | $0.07/MTok | $0.20/MTok | 32K | 4.1K | Tools Open |
| Gemma-3-27b-it | google/gemma-3-27b-it | $0.10/MTok | $0.30/MTok | 110K | 8.2K | Tools Open |
| Qwen3-30B-A3B-Instruct-2507 | Qwen/Qwen3-30B-A3B-Instruct-2507 | $0.10/MTok | $0.30/MTok | 128K | 8.2K | Tools Open |
| Qwen3-32B | Qwen/Qwen3-32B | $0.10/MTok | $0.30/MTok | 128K | 8.2K | Tools Open |
| Qwen3-30B-A3B-Thinking-2507 | Qwen/Qwen3-30B-A3B-Thinking-2507 | $0.10/MTok | $0.30/MTok | 128K | 16.4K | Reasoning Tools Open |
| Qwen3-Coder-30B-A3B-Instruct | Qwen/Qwen3-Coder-30B-A3B-Instruct | $0.10/MTok | $0.30/MTok | 128K | 8.2K | Tools Open |
| Hermes-4-70B | NousResearch/Hermes-4-70B | $0.13/MTok | $0.40/MTok | 128K | 8.2K | Reasoning Tools Open |
| Llama-3.3-70B-Instruct | meta-llama/Llama-3.3-70B-Instruct | $0.13/MTok | $0.40/MTok | 128K | 8.2K | Tools Open |
| Qwen3-Next-80B-A3B-Thinking | Qwen/Qwen3-Next-80B-A3B-Thinking | $0.15/MTok | $1.20/MTok | 128K | 16.4K | Reasoning Tools Open |
| gpt-oss-120b | openai/gpt-oss-120b | $0.15/MTok | $0.60/MTok | 128K | 8.2K | Reasoning Tools Open |
| GLM-4.5-Air | zai-org/GLM-4.5-Air | $0.20/MTok | $1.20/MTok | 128K | 4.1K | Tools |
| INTELLECT-3 | PrimeIntellect/INTELLECT-3 | $0.20/MTok | $1.10/MTok | 128K | 8.2K | Tools Open |
| Gemma-3-27b-it (Fast) | google/gemma-3-27b-it-fast | $0.20/MTok | $0.60/MTok | 110K | 8.2K | Tools Open |
| Qwen3 235B A22B Thinking 2507 qwen | Qwen/Qwen3-235B-A22B-Thinking-2507 | $0.20/MTok | $0.80/MTok | 262.1K | 8.2K | Reasoning Tools |
| Qwen3-32B (Fast) | Qwen/Qwen3-32B-fast | $0.20/MTok | $0.60/MTok | 128K | 8.2K | Tools Open |
| Qwen3 235B A22B Instruct 2507 qwen | Qwen/Qwen3-235B-A22B-Instruct-2507 | $0.20/MTok | $0.60/MTok | 262.1K | 8.2K | Reasoning Tools |
| Llama-3.3-70B-Instruct (Fast) | meta-llama/Llama-3.3-70B-Instruct-fast | $0.25/MTok | $0.75/MTok | 128K | 8.2K | Tools Open |
| Qwen2.5-VL-72B-Instruct | Qwen/Qwen2.5-VL-72B-Instruct | $0.25/MTok | $0.75/MTok | 128K | 8.2K | Tools Open |
| Nemotron-3-Super-120B-A12B | nvidia/nemotron-3-super-120b-a12b | $0.30/MTok | $0.90/MTok | 256K | 32.8K | Reasoning Tools Open |
| MiniMax-M2.1 | MiniMaxAI/MiniMax-M2.1 | $0.30/MTok | $1.20/MTok | 128K | 8.2K | Reasoning Tools Open |
| DeepSeek-V3.2 | deepseek-ai/DeepSeek-V3.2 | $0.30/MTok | $0.45/MTok | 163K | 16.4K | Reasoning Tools Open |
| GLM-4.7 (FP8) | zai-org/GLM-4.7-FP8 | $0.40/MTok | $2.00/MTok | 128K | 4.1K | Tools |
| Qwen3 Coder 480B A35B Instruct qwen | Qwen/Qwen3-Coder-480B-A35B-Instruct | $0.40/MTok | $1.80/MTok | 262.1K | 66.5K | Tools |
| DeepSeek-V3-0324 | deepseek-ai/DeepSeek-V3-0324 | $0.50/MTok | $1.50/MTok | 128K | 8.2K | Tools Open |
| Kimi-K2-Instruct | moonshotai/Kimi-K2-Instruct | $0.50/MTok | $2.40/MTok | 200K | 8.2K | Tools |
| Kimi-K2.5-fast kimi | moonshotai/Kimi-K2.5-fast | $0.50/MTok | $2.50/MTok | 256K | 8.2K | Reasoning Tools Open |
| Kimi-K2.5 kimi | moonshotai/Kimi-K2.5 | $0.50/MTok | $2.50/MTok | 256K | 8.2K | Reasoning Tools Open |
| GLM-4.5 | zai-org/GLM-4.5 | $0.60/MTok | $2.20/MTok | 128K | 4.1K | Tools |
| Llama-3.1-Nemotron-Ultra-253B-v1 | nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 | $0.60/MTok | $1.80/MTok | 128K | 4.1K | Tools Open |
| Kimi-K2-Thinking | moonshotai/Kimi-K2-Thinking | $0.60/MTok | $2.50/MTok | 128K | 16.4K | Reasoning Tools Open |
| DeepSeek-V3-0324 (Fast) | deepseek-ai/DeepSeek-V3-0324-fast | $0.75/MTok | $2.25/MTok | 128K | 8.2K | Tools Open |
| DeepSeek-R1-0528 | deepseek-ai/DeepSeek-R1-0528 | $0.80/MTok | $2.40/MTok | 128K | 32.8K | Reasoning Tools Open |
| GLM-5 | zai-org/GLM-5 | $1.00/MTok | $3.20/MTok | 200K | 16.4K | Reasoning Tools |
| Hermes-4-405B | NousResearch/Hermes-4-405B | $1.00/MTok | $3.00/MTok | 128K | 8.2K | Reasoning Tools Open |
| DeepSeek R1 0528 Fast deepseek | deepseek-ai/DeepSeek-R1-0528-fast | $2.00/MTok | $6.00/MTok | 131.1K | 8.2K | Reasoning Tools Open |