DeepSeek V3.5
DeepSeek · released 2025-12-01
Quality (AA Index)81
Input price$0.14
Output price$0.28
Context128K
Throughput95 tok/s
P50 latency1.5s
Side-by-side pricing, capabilities, real-world cost across common scenarios, and our editorial pick.
DeepSeek · released 2025-12-01
Anthropic · released 2026-04-08
Links open in a new tab via our OpenRouter referral. Affiliate disclosure
| Metric | DeepSeek V3.5 | Claude Haiku 4.5 | Verdict |
|---|---|---|---|
Input price /1M tokens | $0.14 | $1.00 | DeepSeek V3.5 −86% |
Output price /1M tokens | $0.28 | $5.00 | DeepSeek V3.5 −94% |
Context window max input length | 128K | 200K | Claude Haiku 4.5 +1.6× |
AA Quality AA Intelligence Index (0–100) | 81 | 79 | DeepSeek V3.5 +2pt |
Arena Elo LMArena human-pref Elo (800–2000) | — | — | Tied |
Throughput tokens per second | 95 | 250 | Claude Haiku 4.5 +163% |
P50 latency first token | 1.5s | 0.4s | Claude Haiku 4.5 −73% |
Vision multimodal | — | ||
Function calling tool use | Tied | ||
Reasoning mode chain-of-thought | — |
| Scenario | DeepSeek V3.5 | Claude Haiku 4.5 |
|---|---|---|
customer support 1000K req · 600/180 tok | $134 | $1.5K |
chat with docs 300K req · 4000/300 tok | $193 | $1.6K |
code generation 500K req · 2000/500 tok | $210 | $2.3K |
voice assistant 600K req · 800/200 tok | $101 | $1.1K |
Roughly GPT-4o-class quality at roughly 90% off. Tradeoff is region availability (Chinese provider) and slower output throughput than Western frontier tiers.
Set DeepSeek V3.5 as primary, Claude Haiku 4.5 as fallback. One key, one bill, automatic failover when DeepSeek V3.5 errors.
$ curl https://api.aipricly.com/v1/chat/completions \
-H "Authorization: Bearer $AIPC_KEY" \
-d '{
"routing": {
"primary": "deepseek/deepseek-v3-5",
"fallback": ["anthropic/claude-haiku-4-5"]
},
"messages": [{"role": "user", "content": "..."}]
}'