Z AI logo

Z AI

GLM 5V Turbo

GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding, and task execution, and works seamlessly with agents to complete the full loop of “perceive → plan → execute“.

Input / 1M tokens
$1.20
Output / 1M tokens
$4.00
Context window
203K tokens
Provider
Z AI
Cached input / 1M
$0.240

Performance

Median streaming throughput and first-token latency measured by Artificial Analysis.

Output tokens / sec
0 t/s
Time to first token
0.00s

Benchmarks

Intelligence, coding, and math indexes plus the underlying evaluation scores.

Intelligence Index
43
Coding Index
36
Math Index
MMLU-Pro
GPQA
80.9%
HLE
15.8%
LiveCodeBench
SciCode
43.5%
MATH-500
AIME

Benchmarks via Artificial Analysis