Grok STT

grok

Speech transcription model for accurate audio-to-text and captioning workflows

Providers 1

Released Mar 16, 2026

Input Modalities audio

Output Modalities text

Available Providers (1)

Provider	Model ID	Input Cost	Output Cost	Context	Max Output	Docs
Vercel AI Gateway	`xai/grok-stt`	—/MTok	—/MTok	—	—

Reasoning

Tool Calling

Attachments

Open Weights

Structured Output