All Models

Grok STT

grok

Speech transcription model for accurate audio-to-text and captioning workflows

Providers 1
Released Mar 16, 2026
Input Modalities audio
Output Modalities text

Available Providers (1)

Provider Model ID Input Cost Output Cost Context Max Output Docs
Vercel AI Gateway xai/grok-stt /MTok /MTok

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output