Z-ai · released 2026-06-16 · text model

GLM 5.2

Name: GLM 5.2
Brand: Z-ai
Price: 0.76006 USD
Availability: InStock

Goes to OpenRouter · referral link · See disclosure →

Input price$0.76/1M tokens

Output price$2.39/1M tokens

Context1Mtokens

P50 latency—sfirst tok

Throughput—tok/s

Price history

Output price per 1M tokens · last 12 months

all prices in USD per 1M tokens · All hidden costs surfaced

Pricing tier	Input	Output	Notes
Standard Pay-as-you-go API rate card	$0.76	$2.39	Default if no headers set
With prompt caching Cached read · cache write	$0.08−90%	$2.39	Cache writes priced at standard input ($0.76); cached reads 90% off. Requires cache_control header.
Batch API (50% off) 24h SLA, async	$0.38−50%	$1.19−50%	Send up to 50K requests in one batch; results within 24h
Image input (vision) Per image, low/high detail differ	$0.76 + image fee	$2.39	$0.50 per 1024×1024 image at high detail; low detail ~$0.10
Reasoning tokens Hidden chain-of-thought tokens	$0.76	$2.39 × ~3-8	When using reasoning.effort: high, output tokens include hidden reasoning. Real cost can be 3-8× headline output.
Structured output JSON mode + schema	$0.76	$2.39	No extra cost; first call may be slower (~2x latency) due to schema compilation