OpenAI

GPT-4o (2024-08-06)

The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more [here](https://openai.com/index/introducing-structured-outputs-in-the-api/). GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as fast and 50% more cost-effective. GPT-4o also offers improved performance in processing non-English languages and enhanced visual capabilities. For benchmarking against other models, it was briefly called ["im-also-a-good-gpt2-chatbot"](https://twitter.com/LiamFedus/status/1790064963966370209)

Input / 1M tokens: $2.50
Output / 1M tokens: $10.00
Context window: 128K tokens
Provider: OpenAI
Cached input / 1M: $1.25
Knowledge cutoff: 2023-10-31

Performance

Median streaming throughput and first-token latency measured by Artificial Analysis.

Output tokens / sec: 109 t/s
Time to first token: 0.56s

Benchmarks

Intelligence, coding, and math indexes plus the underlying evaluation scores.

Intelligence Index: 19
Coding Index: 17
Math Index: —
MMLU-Pro: —
GPQA: 52.1%
HLE: 2.9%
LiveCodeBench: 31.7%
SciCode: 33.1%
MATH-500: 79.5%
AIME: 11.7%

Benchmarks via Artificial Analysis