OpenAI: GPT-4o Audio

Tool Calling

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs...

Providers 1

Released Aug 15, 2025

Input Modalities audio, text

Output Modalities audio, text

Tarsk Use coding

Available Providers (1)

Provider	Model ID	Input Cost	Output Cost	Context	Max Output	Docs
Kilo Gateway	`openai/gpt-4o-audio-preview`	$2.50/MTok	$10.00/MTok	128K	16.4K

Capabilities

Reasoning

Tool Calling

Attachments

Open Weights

Structured Output