All Models

Mercury

mercury Tool Calling Structured Output

Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude 3.5 Haiku while matching their performance. Mercury's speed enables developers to provide responsive user experiences, including with voice agents, search interfaces, and chatbots. Read more in the [blog post] (https://www.inceptionlabs.ai/blog/introducing-mercury) here.

Providers 2
Released Jun 26, 2025
Input Modalities text
Output Modalities text
Tarsk Use coding

Available Providers (2)

Provider Model ID Input Cost Output Cost Context Max Output Docs
Inception mercury $0.25/MTok $1.00/MTok 128K 16.4K
OpenRouter inception/mercury $0.25/MTok $0.75/MTok 128K 32K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output