输入价格$1.25/百万 tokens
输出价格$10.00/百万 tokens
上下文400Ktokens
P50 延迟0.7s首字延迟厂商提供
吞吐量120tok/s厂商提供
OpenAI's flagship as of 2026, GPT-5 leads on broad benchmark composites and is the default frontier choice for production reasoning workloads. It posts the highest AA Intelligence Index in this table, the most stable function-calling under load, and tools support that most agent frameworks treat as the reference. The tradeoff is cost: input and output rates run several multiples over Claude Haiku or DeepSeek V3.5. Pick GPT-5 when reasoning depth or downstream tool reliability outweighs token spend — e.g. autonomous agents, complex code generation, multi-step financial analysis.
完整定价详情
全部价格按美元 / 百万 token 计 · 所有隐性成本已揭示| Pricing tier | Input | Output | Notes |
|---|---|---|---|
Standard Pay-as-you-go API rate card | $1.25 | $10.00 | Default if no headers set |
With prompt caching Cached read · cache write | $0.13−90% | $10.00 | Cache writes priced at standard input ($1.25); cached reads 90% off. Requires cache_control header. |
Batch API (50% off) 24h SLA, async | $0.63−50% | $5.00−50% | Send up to 50K requests in one batch; results within 24h |
Image input (vision) Per image, low/high detail differ | $1.25 + image fee | $10.00 | $0.50 per 1024×1024 image at high detail; low detail ~$0.10 |
Reasoning tokens Hidden chain-of-thought tokens | $1.25 | $10.00 × ~3-8 | When using reasoning.effort: high, output tokens include hidden reasoning. Real cost can be 3-8× headline output. |
Structured output JSON mode + schema | $1.25 | $10.00 | No extra cost; first call may be slower (~2x latency) due to schema compilation |
能力
Vision (image understanding)Function callingStructured output (JSON schema)Reasoning modeMultilingual (32+ langs)Batch APIAudio I/OFine-tuning