All Models

nvidia-nemotron-nano-9b-v2

nemotron Reasoning Tool Calling Open Weights

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

Providers 6
Released Dec 1, 2024
Input Modalities text
Output Modalities text
Tarsk Use coding

Available Providers (6)

Provider Model ID Input Cost Output Cost Context Max Output Docs
Nvidia nvidia/nvidia-nemotron-nano-9b-v2 $0.00/MTok $0.00/MTok 131.1K 131.1K
OpenRouter nvidia/nemotron-nano-9b-v2 $0.04/MTok $0.16/MTok 131.1K 131.1K
Vercel AI Gateway nvidia/nemotron-nano-9b-v2 $0.04/MTok $0.16/MTok 131.1K 131.1K
Kilo Gateway nvidia/nemotron-nano-9b-v2 $0.04/MTok $0.16/MTok 131.1K 26.2K
Amazon Bedrock nvidia.nemotron-nano-9b-v2 $0.06/MTok $0.23/MTok 128K 4.1K
NanoGPT nvidia/nvidia-nemotron-nano-9b-v2 $0.17/MTok $0.68/MTok 128K 16.4K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output