Skip to main content
AIpricly

Glossary

Definitions for the metrics and terms used across AIpricly.

Editorial intelligence estimate
Capability estimate (0–100) authored by the AIpricly editorial team, calibrated against public benchmarks (MMLU, HumanEval, GPQA, MATH) and Artificial Analysis where comparable. Will be replaced with AA partner data once the API key arrives.Editorial estimate · all rows
Arena Elo
Human-preference Elo score (800–2000) from LMArena — blind A/B votes where real users pick the better response without knowing the model. Measures perceived quality, not just benchmarks.LMArena leaderboard
P50 Latency (TTFT)
Median first-token latency in seconds across our benchmark runs. Lower is faster.
Throughput
Sustained output speed in tokens per second after the first token arrives. Higher is faster.
Context Window
Maximum combined input + output token length. Larger windows allow longer documents, conversation history, and code files.