All Models
Qwen3-VL 30B-A3B Instruct
Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...
Benchmarks
Available Providers (4)
| Provider | Model ID | Input Cost | Output Cost | Context | Max Output | Docs |
|---|---|---|---|---|---|---|
| | qwen/qwen3-vl-30b-a3b-instruct | $0.13/MTok | $0.52/MTok | 131.1K | 32.8K | |
| | qwen/qwen3-vl-30b-a3b-instruct | $0.13/MTok | $0.52/MTok | 131.1K | 32.8K | |
| | Qwen/Qwen3-VL-30B-A3B-Instruct | $0.15/MTok | $0.55/MTok | 256K | 32.8K | |
| | qwen3-vl-30b-a3b-instruct | $0.20/MTok | $0.70/MTok | 131.1K | 8.2K |
Capabilities
Reasoning
Tool Calling
Attachments
Open Weights
Structured Output