All Models

OpenAI: GPT-4o Audio

Tool Calling

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens.

Providers 1
Released Aug 15, 2025
Input Modalities audio, text
Output Modalities audio, text
Tarsk Use coding

Available Providers (1)

Provider Model ID Input Cost Output Cost Context Max Output Docs
Kilo Gateway openai/gpt-4o-audio-preview $2.50/MTok $10.00/MTok 128K 16.4K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output