All Models

Qwen/Qwen3-8B

qwen Reasoning Tool Calling Open Weights Structured Output

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.

Providers 3
Released Apr 1, 2025
Input Modalities text
Output Modalities text
Tarsk Use coding

Available Providers (3)

Provider Model ID Input Cost Output Cost Context Max Output Docs
Kilo Gateway qwen/qwen3-8b $0.05/MTok $0.40/MTok 41.0K 8.2K
SiliconFlow Qwen/Qwen3-8B $0.06/MTok $0.06/MTok 131K 131K
SiliconFlow (China) Qwen/Qwen3-8B $0.06/MTok $0.06/MTok 131K 131K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output