All Models

Qwen3-32B

qwen Reasoning Tool Calling Open Weights Structured Output

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. The model demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts and can extend to 131K tokens using YaRN-based scaling.

Providers 13
Released Dec 1, 2024
Input Modalities text
Output Modalities text
Tarsk Use coding

Available Providers (13)

Provider Model ID Input Cost Output Cost Context Max Output Docs
iFlow qwen3-32b $0.00/MTok $0.00/MTok 128K 32K
Chutes Qwen/Qwen3-32B $0.08/MTok $0.24/MTok 41.0K 41.0K
Abacus Qwen/Qwen3-32B $0.09/MTok $0.29/MTok 128K 8.2K
OVHcloud AI Endpoints qwen3-32b $0.09/MTok $0.25/MTok 32.8K 32.8K
Cortecs qwen3-32b $0.10/MTok $0.33/MTok 16.4K 16.4K
Nebius Token Factory Qwen/Qwen3-32B $0.10/MTok $0.30/MTok 128K 8.2K
NovitaAI qwen/qwen3-32b-fp8 $0.10/MTok $0.45/MTok 41.0K 20K
Jiekou.AI qwen/qwen3-32b-fp8 $0.10/MTok $0.45/MTok 41.0K 20K
Alibaba (China) qwen3-32b $0.29/MTok $1.15/MTok 131.1K 16.4K
Groq qwen/qwen3-32b $0.29/MTok $0.59/MTok 131.1K 16.4K
Helicone qwen3-32b $0.29/MTok $0.59/MTok 131.1K 41.0K
Alibaba qwen3-32b $0.70/MTok $2.80/MTok 131.1K 16.4K
Qiniu qwen3-32b /MTok /MTok 40K 4.1K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output