All Models

Qwen2.5-VL 72B Instruct

qwen Tool Calling Attachments Open Weights Structured Output

Qwen vision-language model for visual reasoning, documents, and agent tasks

Providers 8
Released Sep 1, 2024
Input Modalities text, image, video
Output Modalities text
Tarsk Use coding

Available Providers (8)

Provider Model ID Input Cost Output Cost Context Max Output Docs
LLM Gateway qwen2-5-vl-72b-instruct $0.13/MTok $0.40/MTok 32.8K 8.2K
Nebius Token Factory Qwen/Qwen2.5-VL-72B-Instruct $0.25/MTok $0.75/MTok 128K 8.2K
NovitaAI qwen/qwen2.5-vl-72b-instruct $0.80/MTok $0.80/MTok 32.8K 32.8K
Kilo Gateway qwen/qwen2.5-vl-72b-instruct $0.80/MTok $0.80/MTok 32.8K 32.8K
OpenRouter qwen/qwen2.5-vl-72b-instruct $0.80/MTok $1/MTok 128K 128K
OVHcloud AI Endpoints qwen2.5-vl-72b-instruct $1.01/MTok $1.01/MTok 32.8K 32.8K
Alibaba (China) qwen2-5-vl-72b-instruct $2.29/MTok $6.88/MTok 131.1K 8.2K
Alibaba qwen2-5-vl-72b-instruct $2.80/MTok $8.40/MTok 131.1K 8.2K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output