All Models
Provider
Cloudflare Workers AI
42 Models
20 Families
Reasoning
Tool Calling
Open Weights
| Model | Model ID | Input Cost | Output Cost | Context | Max Output | Capabilities |
|---|---|---|---|---|---|---|
| BART Large CNN bart | @cf/facebook/bart-large-cnn | $0.00/MTok | $0.00/MTok | 128K | 16.4K | |
| MyShell MeloTTS melotts | @cf/myshell-ai/melotts | $0.00/MTok | $0.00/MTok | 128K | 16.4K | |
| Pipecat Smart Turn v2 smart-turn | @cf/pipecat-ai/smart-turn-v2 | $0.00/MTok | $0.00/MTok | 128K | 16.4K | |
| Deepgram Aura 2 (ES) aura | @cf/deepgram/aura-2-es | $0.00/MTok | $0.00/MTok | 128K | 16.4K | |
| Deepgram Nova 3 nova | @cf/deepgram/nova-3 | $0.00/MTok | $0.00/MTok | 128K | 16.4K | |
| Deepgram Aura 2 (EN) aura | @cf/deepgram/aura-2-en | $0.00/MTok | $0.00/MTok | 128K | 16.4K | |
| BGE Reranker Base bge | @cf/baai/bge-reranker-base | $0.0031/MTok | $0.00/MTok | 128K | 16.4K | |
| BGE M3 bge | @cf/baai/bge-m3 | $0.01/MTok | $0.00/MTok | 128K | 16.4K | |
| Qwen3 Embedding 0.6B qwen | @cf/qwen/qwen3-embedding-0.6b | $0.01/MTok | $0.00/MTok | 128K | 16.4K | |
| IBM Granite 4.0 H Micro granite | @cf/ibm-granite/granite-4.0-h-micro | $0.02/MTok | $0.11/MTok | 128K | 16.4K | |
| PLaMo Embedding 1B plamo | @cf/pfnet/plamo-embedding-1b | $0.02/MTok | $0.00/MTok | 128K | 16.4K | |
| BGE Small EN v1.5 bge | @cf/baai/bge-small-en-v1.5 | $0.02/MTok | $0.00/MTok | 128K | 16.4K | |
| DistilBERT SST-2 INT8 distilbert | @cf/huggingface/distilbert-sst-2-int8 | $0.03/MTok | $0.00/MTok | 128K | 16.4K | |
| Llama 3.2 1B Instruct llama | @cf/meta/llama-3.2-1b-instruct | $0.03/MTok | $0.20/MTok | 128K | 16.4K | |
| Llama 3.2 11B Vision Instruct llama | @cf/meta/llama-3.2-11b-vision-instruct | $0.05/MTok | $0.68/MTok | 128K | 16.4K | |
| Qwen3 30B A3B FP8 qwen | @cf/qwen/qwen3-30b-a3b-fp8 | $0.05/MTok | $0.34/MTok | 128K | 16.4K | |
| Llama 3.2 3B Instruct llama | @cf/meta/llama-3.2-3b-instruct | $0.05/MTok | $0.34/MTok | 128K | 16.4K | |
| GLM-4.7-Flash glm-flash | @cf/zai-org/glm-4.7-flash | $0.06/MTok | $0.40/MTok | 131.1K | 131.1K | Reasoning Tools Open |
| BGE Base EN v1.5 bge | @cf/baai/bge-base-en-v1.5 | $0.07/MTok | $0.00/MTok | 128K | 16.4K | |
| Mistral 7B Instruct v0.1 mistral | @cf/mistral/mistral-7b-instruct-v0.1 | $0.11/MTok | $0.19/MTok | 128K | 16.4K | |
| Llama 3 8B Instruct AWQ llama | @cf/meta/llama-3-8b-instruct-awq | $0.12/MTok | $0.27/MTok | 128K | 16.4K | |
| Llama 3.1 8B Instruct AWQ llama | @cf/meta/llama-3.1-8b-instruct-awq | $0.12/MTok | $0.27/MTok | 128K | 16.4K | |
| Llama 3.1 8B Instruct FP8 llama | @cf/meta/llama-3.1-8b-instruct-fp8 | $0.15/MTok | $0.29/MTok | 128K | 16.4K | |
| BGE Large EN v1.5 bge | @cf/baai/bge-large-en-v1.5 | $0.20/MTok | $0.00/MTok | 128K | 16.4K | |
| GPT OSS 20B | @cf/openai/gpt-oss-20b | $0.20/MTok | $0.30/MTok | 128K | 16.4K | |
| Llama 4 Scout 17B 16E Instruct llama | @cf/meta/llama-4-scout-17b-16e-instruct | $0.27/MTok | $0.85/MTok | 128K | 16.4K | |
| Llama 3.1 8B Instruct llama | @cf/meta/llama-3.1-8b-instruct | $0.28/MTok | $0.83/MTok | 128K | 16.4K | |
| Llama 3 8B Instruct llama | @cf/meta/llama-3-8b-instruct | $0.28/MTok | $0.83/MTok | 128K | 16.4K | |
| Llama 3.3 70B Instruct FP8 Fast llama | @cf/meta/llama-3.3-70b-instruct-fp8-fast | $0.29/MTok | $2.25/MTok | 128K | 16.4K | |
| M2M100 1.2B m2m | @cf/meta/m2m100-1.2b | $0.34/MTok | $0.34/MTok | 128K | 16.4K | |
| IndicTrans2 EN-Indic 1B indictrans | @cf/ai4bharat/indictrans2-en-indic-1B | $0.34/MTok | $0.34/MTok | 128K | 16.4K | |
| Gemma 3 12B IT gemma | @cf/google/gemma-3-12b-it | $0.35/MTok | $0.56/MTok | 128K | 16.4K | |
| Mistral Small 3.1 24B Instruct mistral-small | @cf/mistralai/mistral-small-3.1-24b-instruct | $0.35/MTok | $0.56/MTok | 128K | 16.4K | |
| GPT OSS 120B | @cf/openai/gpt-oss-120b | $0.35/MTok | $0.75/MTok | 128K | 16.4K | |
| Gemma SEA-LION v4 27B IT gemma | @cf/aisingapore/gemma-sea-lion-v4-27b-it | $0.35/MTok | $0.56/MTok | 128K | 16.4K | |
| Llama Guard 3 8B llama | @cf/meta/llama-guard-3-8b | $0.48/MTok | $0.03/MTok | 128K | 16.4K | |
| Nemotron 3 Super 120B nemotron | @cf/nvidia/nemotron-3-120b-a12b | $0.50/MTok | $1.50/MTok | 256K | 256K | Reasoning Tools Open |
| DeepSeek R1 Distill Qwen 32B deepseek-thinking | @cf/deepseek-ai/deepseek-r1-distill-qwen-32b | $0.50/MTok | $4.88/MTok | 128K | 16.4K | |
| Llama 2 7B Chat FP16 llama | @cf/meta/llama-2-7b-chat-fp16 | $0.56/MTok | $6.67/MTok | 128K | 16.4K | |
| Kimi K2.5 kimi | @cf/moonshotai/kimi-k2.5 | $0.60/MTok | $3.00/MTok | 256K | 256K | Reasoning Tools Open |
| QwQ 32B qwen | @cf/qwen/qwq-32b | $0.66/MTok | $1.00/MTok | 128K | 16.4K | |
| Qwen 2.5 Coder 32B Instruct qwen | @cf/qwen/qwen2.5-coder-32b-instruct | $0.66/MTok | $1.00/MTok | 128K | 16.4K |