All Models
nvidia-nemotron-nano-9b-v2
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.
Available Providers (6)
| Provider | Model ID | Input Cost | Output Cost | Context | Max Output | Docs |
|---|---|---|---|---|---|---|
| | nvidia/nvidia-nemotron-nano-9b-v2 | $0.00/MTok | $0.00/MTok | 131.1K | 131.1K | |
| | nvidia/nemotron-nano-9b-v2 | $0.04/MTok | $0.16/MTok | 131.1K | 131.1K | |
| | nvidia/nemotron-nano-9b-v2 | $0.04/MTok | $0.16/MTok | 131.1K | 131.1K | |
| | nvidia/nemotron-nano-9b-v2 | $0.04/MTok | $0.16/MTok | 131.1K | 26.2K | |
| | nvidia.nemotron-nano-9b-v2 | $0.06/MTok | $0.23/MTok | 128K | 4.1K | |
| | nvidia/nvidia-nemotron-nano-9b-v2 | $0.17/MTok | $0.68/MTok | 128K | 16.4K |
Capabilities
Reasoning
Tool Calling
Attachments
Open Weights
Structured Output