All Models

Qwen: Qwen VL Plus

Attachments

Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for image input. It delivers significant performance across a broad range of visual tasks.

Providers 1
Released Jan 25, 2024
Input Modalities image, text
Output Modalities text

Available Providers (1)

Provider Model ID Input Cost Output Cost Context Max Output Docs
Kilo Gateway qwen/qwen-vl-plus $0.14/MTok $0.41/MTok 131.1K 8.2K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output