跳转到主内容
AIpricly

Google · 发布于 2025-09-01 · text model

Gemini 2.5 Pro

AA INDEX87editorial 2026-05编辑估算

跳转到 OpenRouter · 推荐链接 · 查看披露 →

输入价格$1.25/百万 tokens
输出价格$10.00/百万 tokens
上下文1Mtokens
P50 延迟0.8s首字延迟厂商提供
吞吐量140tok/s厂商提供

The largest context window in the top tier (1M tokens), which makes it the natural pick for whole-codebase code review, full-document legal analysis, and long video transcript processing. Multimodal handling is best-in-class on image + text. Slower first-token than GPT-5, pricier than DeepSeek V3.5 — choose when context size is the constraint.

本模型被以下文章深度评测:GPT-5 dominated the AI conversation this week

完整定价详情

全部价格按美元 / 百万 token 计 · 所有隐性成本已揭示
Pricing tierInputOutputNotes
Standard
Pay-as-you-go API rate card
$1.25$10.00Default if no headers set
With prompt caching
Cached read · cache write
$0.13−90%$10.00Cache writes priced at standard input ($1.25); cached reads 90% off. Requires cache_control header.
Batch API (50% off)
24h SLA, async
$0.63−50%$5.00−50%Send up to 50K requests in one batch; results within 24h
Image input (vision)
Per image, low/high detail differ
$1.25 + image fee$10.00$0.50 per 1024×1024 image at high detail; low detail ~$0.10
Reasoning tokens
Hidden chain-of-thought tokens
$1.25$10.00 × ~3-8When using reasoning.effort: high, output tokens include hidden reasoning. Real cost can be 3-8× headline output.
Structured output
JSON mode + schema
$1.25$10.00No extra cost; first call may be slower (~2x latency) due to schema compilation

能力

Vision (image understanding)Function callingStructured output (JSON schema)Reasoning modeMultilingual (32+ langs)Batch APIAudio I/OFine-tuning

常见场景月度费用

默认用量假设

考虑这些替代方案

出现在这些场景中

同厂商更多模型:Google