All Models
Nemotron 3 Ultra 550B A55B
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...
Benchmarks
Available Providers (4)
| Provider | Model ID | Input Cost | Output Cost | Context | Max Output | Docs |
|---|---|---|---|---|---|---|
| | nemotron-3-ultra-550b | $0.50/MTok | $2.50/MTok | 1M | 128K | |
| | nvidia/nemotron-3-ultra-550b-a55b | $0.50/MTok | $2.50/MTok | 1M | 65.5K | |
| | nvidia/nemotron-3-ultra-550b-a55b | $0.50/MTok | $2.20/MTok | 262.1K | 16.4K | |
| | nvidia/nemotron-3-ultra-550b-a55b | $0.60/MTok | $3.60/MTok | 512.3K | 512.3K |
Capabilities
Reasoning
Tool Calling
Attachments
Open Weights
Structured Output