OpenAI logo

OpenAI

GPT Audio

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.

Input / 1M tokens
$2.50
Output / 1M tokens
$10.00
Context window
128K tokens
Provider
OpenAI

Performance

Median streaming throughput and first-token latency measured by Artificial Analysis.

Output tokens / sec
Time to first token