All Models

Qwen/Qwen3-235B-A22B-Instruct-2507

qwen Reasoning Tool Calling Open Weights Structured Output

Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following, logical reasoning, math, code, and tool usage. The model supports a native 262K context length and does not implement "thinking mode" (<think> blocks). Compared to its base variant, this version delivers significant gains in knowledge coverage, long-context reasoning, coding benchmarks, and alignment with open-ended tasks. It is particularly strong on multilingual understanding, math reasoning (e.g., AIME, HMMT), and alignment evaluations like Arena-Hard and WritingBench.

Providers 3
Released Apr 1, 2025
Input Modalities text
Output Modalities text
Tarsk Use coding

Available Providers (3)

Provider Model ID Input Cost Output Cost Context Max Output Docs
Kilo Gateway qwen/qwen3-235b-a22b-2507 $0.07/MTok $0.10/MTok 262.1K 52.4K
SiliconFlow Qwen/Qwen3-235B-A22B-Instruct-2507 $0.09/MTok $0.60/MTok 262K 262K
SiliconFlow (China) Qwen/Qwen3-235B-A22B-Instruct-2507 $0.09/MTok $0.60/MTok 262K 262K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output