All Providers
93 Models
21 Families
1.0M Max Context
$0–$0.43 Input Cost/MTok
$0–$0.87 Output Cost/MTok
Capabilities
Reasoning
Tool Calling
Attachments
Open Weights
Setup
Set the following environment variable to use Nvidia:
NVIDIA_API_KEY Models (93)
| Model | Model ID | Input Cost | Output Cost | Context | Capabilities |
|---|---|---|---|---|---|
| GLM-4.7 glm | z-ai/glm4.7 | $0/MTok | $0/MTok | 204.8K | Reasoning Tools Open |
| GLM-5.1 glm | z-ai/glm-5.1 | $0/MTok | $0/MTok | 131.1K | Reasoning Tools Open |
| solar-10.7b-instruct | upstage/solar-10_7b-instruct | $0/MTok | $0/MTok | 128K | Tools Open |
| sarvam-m | sarvamai/sarvam-m | $0/MTok | $0/MTok | 128K | Tools Open |
| Magistral Small 2506 | mistralai/magistral-small-2506 | $0/MTok | $0/MTok | 32.8K | |
| Mistral Large 3 675B Instruct 2512 mistral-large | mistralai/mistral-large-3-675b-instruct-2512 | $0/MTok | $0/MTok | 262.1K | Tools Open |
| mistral-nemotron | mistralai/mistral-nemotron | $0/MTok | $0/MTok | 128K | Tools Open |
| Mistral: Mixtral 8x7B Instruct | mistralai/mixtral-8x7b-instruct | $0/MTok | $0/MTok | 32.8K | Tools Open |
| Mistral-7B-Instruct-v0.3 | mistralai/mistral-7b-instruct-v03 | $0/MTok | $0/MTok | 65.5K | Tools Open |
| Mistral: Mixtral 8x22B Instruct | mistralai/mixtral-8x22b-instruct | $0/MTok | $0/MTok | 65.5K | Tools Open |
| Mistral Medium 3 mistral-medium | mistralai/mistral-medium-3-instruct | $0/MTok | $0/MTok | 131.1K | |
| mistral-small-4-119b-2603 | mistralai/mistral-small-4-119b-2603 | $0/MTok | $0/MTok | 128K | Tools Open |
| Devstral-2-123B-Instruct-2512 devstral | mistralai/devstral-2-123b-instruct-2512 | $0/MTok | $0/MTok | 262.1K | Reasoning Tools Open |
| Qwen2.5 Coder 32b Instruct | qwen/qwen2.5-coder-32b-instruct | $0/MTok | $0/MTok | 128K | Tools Open |
| Qwen3.5 122B-A10B qwen | qwen/qwen3.5-122b-a10b | $0/MTok | $0/MTok | 262.1K | Reasoning Tools Open |
| Qwen3-Next-80B-A3B-Instruct qwen | qwen/qwen3-next-80b-a3b-instruct | $0/MTok | $0/MTok | 262.1K | Tools |
| Qwen Image Edit qwen | qwen/qwen-image-edit | $0/MTok | $0/MTok | — | |
| Qwen3 Coder 480B A35B Instruct qwen | qwen/qwen3-coder-480b-a35b-instruct | $0/MTok | $0/MTok | 262.1K | Tools |
| Qwen3-Next-80B-A3B-Thinking qwen | qwen/qwen3-next-80b-a3b-thinking | $0/MTok | $0/MTok | 262.1K | Reasoning Tools Open |
| Qwen Image qwen | qwen/qwen-image | $0/MTok | $0/MTok | — | |
| Qwen3.5-397B-A17B qwen | qwen/qwen3.5-397b-a17b | $0/MTok | $0/MTok | 262.1K | Reasoning Tools Open |
| FLUX.1-schnell | black-forest-labs/flux_1-schnell | $0/MTok | $0/MTok | 77 | Open |
| FLUX.2 Klein 4B flux | black-forest-labs/flux_2-klein-4b | $0/MTok | $0/MTok | 41.0K | Open |
| FLUX.1-Kontext-dev | black-forest-labs/flux_1-kontext-dev | $0/MTok | $0/MTok | 41.0K | Open |
| FLUX.1-dev flux | black-forest-labs/flux.1-dev | $0/MTok | $0/MTok | 4.1K | |
| Kimi K2 Thinking kimi-thinking | moonshotai/kimi-k2-thinking | $0/MTok | $0/MTok | 262.1K | Reasoning Tools Open |
| Kimi K2 Instruct kimi | moonshotai/kimi-k2-instruct | $0/MTok | $0/MTok | 128K | Reasoning Tools |
| Kimi K2.6 kimi-k2.6 | moonshotai/kimi-k2.6 | $0/MTok | $0/MTok | 262.1K | Reasoning Tools Open |
| Kimi K2 0905 kimi | moonshotai/kimi-k2-instruct-0905 | $0/MTok | $0/MTok | 262.1K | Tools Open |
| dracarys-llama-3.1-70b-instruct | abacusai/dracarys-llama-3_1-70b-instruct | $0/MTok | $0/MTok | 128K | Tools Open |
| DeepSeek V3.2 deepseek | deepseek-ai/deepseek-v3.2 | $0/MTok | $0/MTok | 163.8K | Reasoning Tools |
| DeepSeek V3.1 Terminus deepseek | deepseek-ai/deepseek-v3.1-terminus | $0/MTok | $0/MTok | 128K | Reasoning Tools |
| cosmos-predict1-5b | nvidia/cosmos-predict1-5b | $0/MTok | $0/MTok | — | Open |
| magpie-tts-zeroshot | nvidia/magpie-tts-zeroshot | $0/MTok | $0/MTok | — | Open |
| sparsedrive | nvidia/sparsedrive | $0/MTok | $0/MTok | 128K | Open |
| streampetr | nvidia/streampetr | $0/MTok | $0/MTok | 128K | Open |
| Nemotron 3 Nano Omni nemotron | nvidia/nemotron-3-nano-omni-30b-a3b-reasoning | $0/MTok | $0/MTok | 256K | Reasoning Tools Open |
| nemotron-3-nano-30b-a3b nemotron | nvidia/nemotron-3-nano-30b-a3b | $0/MTok | $0/MTok | 131.1K | Reasoning Tools Open |
| nv-embed-v1 | nvidia/nv-embed-v1 | $0/MTok | $0/MTok | 32.8K | Open |
| llama-nemotron-rerank-vl-1b-v2 | nvidia/llama-nemotron-rerank-vl-1b-v2 | $0/MTok | $0/MTok | 128K | Open |
| studiovoice | nvidia/studiovoice | $0/MTok | $0/MTok | 128K | Open |
| cosmos-transfer2.5-2b | nvidia/cosmos-transfer2_5-2b | $0/MTok | $0/MTok | — | Open |
| nemotron-3-content-safety | nvidia/nemotron-3-content-safety | $0/MTok | $0/MTok | 128K | Open |
| usdvalidate | nvidia/usdvalidate | $0/MTok | $0/MTok | — | Open |
| llama-3_2-nemoretriever-300m-embed-v1 | nvidia/llama-3_2-nemoretriever-300m-embed-v1 | $0/MTok | $0/MTok | 32.8K | Open |
| llama-nemotron-embed-vl-1b-v2 | nvidia/llama-nemotron-embed-vl-1b-v2 | $0/MTok | $0/MTok | 32.8K | Open |
| usdcode | nvidia/usdcode | $0/MTok | $0/MTok | 128K | |
| llama-3.1-nemotron-safety-guard-8b-v3 | nvidia/llama-3_1-nemotron-safety-guard-8b-v3 | $0/MTok | $0/MTok | 128K | Open |
| rerank-qa-mistral-4b | nvidia/rerank-qa-mistral-4b | $0/MTok | $0/MTok | 128K | Open |
| nvidia-nemotron-nano-9b-v2 nemotron | nvidia/nvidia-nemotron-nano-9b-v2 | $0/MTok | $0/MTok | 131.1K | Reasoning Tools Open |
| synthetic-video-detector | nvidia/synthetic-video-detector | $0/MTok | $0/MTok | — | Open |
| Llama 3.3 Nemotron Super 49B v1.5 nemotron | nvidia/llama-3_3-nemotron-super-49b-v1_5 | $0/MTok | $0/MTok | 131.1K | Reasoning Tools Open |
| nv-embedcode-7b-v1 | nvidia/nv-embedcode-7b-v1 | $0/MTok | $0/MTok | 32.8K | Open |
| cosmos-transfer1-7b | nvidia/cosmos-transfer1-7b | $0/MTok | $0/MTok | — | Open |
| nemotron-mini-4b-instruct | nvidia/nemotron-mini-4b-instruct | $0/MTok | $0/MTok | 128K | Tools Open |
| nemotron-voicechat | nvidia/nemotron-voicechat | $0/MTok | $0/MTok | 128K | Tools Open |
| riva-translate-4b-instruct-v1_1 | nvidia/riva-translate-4b-instruct-v1_1 | $0/MTok | $0/MTok | 128K | Open |
| Llama 3.3 Nemotron Super 49B v1 nemotron | nvidia/llama-3_3-nemotron-super-49b-v1 | $0/MTok | $0/MTok | 131.1K | Reasoning Tools Open |
| nemotron-content-safety-reasoning-4b | nvidia/nemotron-content-safety-reasoning-4b | $0/MTok | $0/MTok | 128K | Reasoning Open |
| gliner-pii | nvidia/gliner-pii | $0/MTok | $0/MTok | 128K | Open |
| bevformer | nvidia/bevformer | $0/MTok | $0/MTok | 128K | Open |
| Active Speaker Detection | nvidia/active-speaker-detection | $0/MTok | $0/MTok | — | Open |
| MiniMax-M2.7 minimax | minimaxai/minimax-m2.7 | $0/MTok | $0/MTok | 204.8K | Reasoning Tools Open |
| MiniMax-M2.5 minimax | minimaxai/minimax-m2.5 | $0/MTok | $0/MTok | 204.8K | Reasoning Tools Open |
| Phi 4 Multimodal | microsoft/phi-4-multimodal-instruct | $0/MTok | $0/MTok | 128K | |
| Phi-4-Mini phi | microsoft/phi-4-mini-instruct | $0/MTok | $0/MTok | 131.1K | Reasoning Tools |
| Step 3.7 Flash | stepfun-ai/step-3.7-flash | $0/MTok | $0/MTok | 256K | Reasoning Tools Open |
| Step 3.5 Flash | stepfun-ai/step-3.5-flash | $0/MTok | $0/MTok | 256K | Reasoning Tools Open |
| Llama 3.1 70b Instruct | meta/llama-3.1-70b-instruct | $0/MTok | $0/MTok | 128K | Tools Open |
| Llama Guard 4 12B llama | meta/llama-guard-4-12b | $0/MTok | $0/MTok | 128K | Open |
| Llama 4 Maverick 17b 128e Instruct | meta/llama-4-maverick-17b-128e-instruct | $0/MTok | $0/MTok | 128K | Tools Open |
| esm2-650m | meta/esm2-650m | $0/MTok | $0/MTok | 128K | Open |
| Llama 3.2 3B Instruct llama | meta/llama-3.2-3b-instruct | $0/MTok | $0/MTok | 32.8K | Open |
| Llama 3.2 11b Vision Instruct | meta/llama-3.2-11b-vision-instruct | $0/MTok | $0/MTok | 128K | Tools Open |
| Llama 3.2 1b Instruct | meta/llama-3.2-1b-instruct | $0/MTok | $0/MTok | 128K | Tools Open |
| esmfold | meta/esmfold | $0/MTok | $0/MTok | 128K | Open |
| Llama 3.1 8B Instruct llama | meta/llama-3.1-8b-instruct | $0/MTok | $0/MTok | 16K | Tools Open |
| Llama 3.3 70b Instruct | meta/llama-3.3-70b-instruct | $0/MTok | $0/MTok | 128K | Tools Open |
| Llama-3.2-90B-Vision-Instruct llama | meta/llama-3.2-90b-vision-instruct | $0/MTok | $0/MTok | 128K | Tools Open |
| BGE M3 bge | baai/bge-m3 | $0/MTok | $0/MTok | 8.2K | Open |
| Gemma-4-31B-IT gemma | google/gemma-4-31b-it | $0/MTok | $0/MTok | 256K | Reasoning Tools Open |
| Gemma 2 2b It | google/gemma-2-2b-it | $0/MTok | $0/MTok | 128K | Tools Open |
| Gemma 3n E4b It | google/gemma-3n-e4b-it | $0/MTok | $0/MTok | 128K | Tools Open |
| paligemma | google/google-paligemma | $0/MTok | $0/MTok | 128K | Open |
| Gemma 3n E2b It | google/gemma-3n-e2b-it | $0/MTok | $0/MTok | 128K | Tools Open |
| Gemma-3-27B-IT gemma | google/gemma-3-27b-it | $0/MTok | $0/MTok | 131.1K | Reasoning Tools |
| GPT-OSS-120B gpt-oss | openai/gpt-oss-120b | $0/MTok | $0/MTok | 128K | Reasoning |
| GPT OSS 20B gpt-oss | openai/gpt-oss-20b | $0/MTok | $0/MTok | 131.1K | Reasoning Tools Open |
| Whisper Large v3 whisper | openai/whisper-large-v3 | $0/MTok | $0/MTok | — | Open |
| ByteDance-Seed/Seed-OSS-36B-Instruct seed | bytedance/seed-oss-36b-instruct | $0/MTok | $0/MTok | 262K | Tools |
| DeepSeek V4 Flash deepseek-flash | deepseek-ai/deepseek-v4-flash | $0.14/MTok | $0.28/MTok | 1.0M | Reasoning Tools Open |
| Nemotron 3 Super nemotron | nvidia/nemotron-3-super-120b-a12b | $0.20/MTok | $0.80/MTok | 262.1K | Reasoning Tools Open |
| DeepSeek V4 Pro deepseek-thinking | deepseek-ai/deepseek-v4-pro | $0.43/MTok | $0.87/MTok | 1.0M | Reasoning Tools Open |