All Models
Nemotron 3 Ultra
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...
Benchmarks
Available Providers (3)
| Provider | Model ID | Input Cost | Output Cost | Context | Max Output | Docs |
|---|---|---|---|---|---|---|
| | nvidia/nemotron-3-ultra-550b-a55b | $0.60/MTok | $2.40/MTok | 1M | 65K | |
| | nemotron-3-ultra-550b | —/MTok | —/MTok | 131.1K | 8.2K | |
| | nemotron-3-ultra | —/MTok | —/MTok | 262.1K | 128K |
Capabilities
Reasoning
Tool Calling
Attachments
Open Weights
Structured Output