Qwen3 VL 8B Instruct

qwen Tool Calling Attachments Open Weights Structured Output

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

Providers 3

Released Aug 19, 2025

Input Modalities text, image

Output Modalities text

Tarsk Use coding

Benchmarks

Available Providers (3)

Provider	Model ID	Input Cost	Output Cost	Context	Max Output
LLM Gateway	`qwen3-vl-8b-instruct`	$0.08/MTok	$0.50/MTok	131.1K	8.2K
Kilo Gateway	`qwen/qwen3-vl-8b-instruct`	$0.08/MTok	$0.50/MTok	131.1K	32.8K
OpenRouter	`qwen/qwen3-vl-8b-instruct`	$0.08/MTok	$0.50/MTok	131.1K	32.8K

Capabilities

Reasoning

Tool Calling

Attachments

Open Weights

Structured Output