Mistral logo

Mistral

Voxtral Small 24B 2507

Voxtral Small is an enhancement of Mistral Small 3, incorporating state-of-the-art audio input capabilities while retaining best-in-class text performance. It excels at speech transcription, translation and audio understanding. Input audio is priced at $100 per million seconds.

Input / 1M tokens
$0.100
Output / 1M tokens
$0.300
Context window
32K tokens
Provider
Mistral
Cached input / 1M
$0.010

Performance

Median streaming throughput and first-token latency measured by Artificial Analysis.

Output tokens / sec
Time to first token