All Models
Mercury Coder
Mercury Coder is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like Claude 3.5 Haiku and GPT-4o Mini while matching their performance. Mercury Coder's speed means that developers can stay in the flow while coding, enjoying rapid chat-based iteration and responsive code completion suggestions. On Copilot Arena, Mercury Coder ranks 1st in speed and ties for 2nd in quality. Read more in the [blog post here](https://www.inceptionlabs.ai/blog/introducing-mercury).
Available Providers (2)
| Provider | Model ID | Input Cost | Output Cost | Context | Max Output | Docs |
|---|---|---|---|---|---|---|
| | mercury-coder | $0.25/MTok | $1.00/MTok | 128K | 16.4K | |
| | inception/mercury-coder | $0.25/MTok | $0.75/MTok | 128K | 32K |
Capabilities
Reasoning
Tool Calling
Attachments
Open Weights
Structured Output