All Models

Nemotron 3 Ultra

nemotron Reasoning Tool Calling Open Weights

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Providers 3
Released Jun 4, 2026
Input Modalities text
Output Modalities text
Tarsk Use coding

Available Providers (3)

Provider Model ID Input Cost Output Cost Context Max Output Docs
Vercel AI Gateway nvidia/nemotron-3-ultra-550b-a55b $0.60/MTok $2.40/MTok 1M 65K
DigitalOcean nemotron-3-ultra-550b /MTok /MTok 131.1K 8.2K
Ollama Cloud nemotron-3-ultra /MTok /MTok 262.1K 128K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output