Z AI
GLM 5V Turbo
GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding, and task execution, and works seamlessly with agents to complete the full loop of “perceive → plan → execute“.
- Input / 1M tokens
- $1.20
- Output / 1M tokens
- $4.00
- Context window
- 203K tokens
- Provider
- Z AI
- Cached input / 1M
- $0.240
Performance
Median streaming throughput and first-token latency measured by Artificial Analysis.
- Output tokens / sec
- 0 t/s
- Time to first token
- 0.00s
Benchmarks
Intelligence, coding, and math indexes plus the underlying evaluation scores.
- Intelligence Index
- 43
- Coding Index
- 36
- Math Index
- —
- MMLU-Pro
- —
- GPQA
- 80.9%
- HLE
- 15.8%
- LiveCodeBench
- —
- SciCode
- 43.5%
- MATH-500
- —
- AIME
- —
Benchmarks via Artificial Analysis