GPT-4.1
Flagship general-purpose model tuned for production-grade reasoning and tool use.
Best for: SmartestCoverage: fullRelease: GASynced within 7d
Pricing: Official LiveLimits: Curated OfficialBenchmarks: Third-Party
Current API snapshot
Official pricing, context, and latency normalized into a single current view.
Input price$2
Output price$8
Context window1M
Latency1400 ms
Benchmark snapshot
Curated quality and performance signals from the latest accepted benchmark source.
Intelligence95
Coding94
Throughput82 tok/s
Benchmark sourceArtificial Analysis
Rank persona scores
Same ranking formulas used across the public tables, exposed directly for this model.
Cheapest46.2
Fastest8.3
Smartest100.0
Best value72.0
Formula reference
Persona definitions are fixed and public. The score meanings are strict, not inferred.
CheapestLowest total token cost only.Score = normalized total token cost only, where total cost = input price + output price.
FastestLowest observed latency only.Score = normalized latency only.
SmartestHighest intelligence benchmark score only.Score = normalized intelligence benchmark only.
Best ValueThe most balanced tradeoff between cost, quality, and responsiveness.Score = 45% intelligence + 35% cost + 10% latency + 10% context.
CodingHighest coding benchmark score only.Score = normalized coding benchmark only.
Sources
Every displayed field is tied back to a collected source and timestamp.
pricingofficial · collected Mar 10, 2026https://developers.openai.com/api/docs/pricing
limitsofficial · collected Mar 9, 2026https://platform.openai.com/docs/models
benchmarksbenchmark · collected Mar 9, 2026https://artificialanalysis.ai/
Cost Calculator
Estimate daily and monthly spend for GPT-4.1.
Operational notes
Pricing notes: Live sync from OpenAI official pricing. Standard-tier rates are used for comparison.
Data coverage: full
Max output tokens: 32.8K
Rate limit notes: Rate limits vary by usage tier.
Updated: Mar 10, 2026