OpenAI

GPT-4.1 mini

Cost-efficient OpenAI model for high-throughput agent and app workloads.

Best for: CheapestCoverage: fullRelease: GASynced within 7d
Pricing: Official LiveLimits: Curated OfficialBenchmarks: Third-Party
Current API snapshot
Official pricing, context, and latency normalized into a single current view.
Input price$0.40
Output price$1.6
Context window1M
Latency780 ms
Benchmark snapshot
Curated quality and performance signals from the latest accepted benchmark source.
Intelligence82
Coding79
Throughput128 tok/s
Benchmark sourceArtificial Analysis
Rank persona scores
Same ranking formulas used across the public tables, exposed directly for this model.
Cheapest92.5
Fastest59.5
Smartest27.8
Best value60.8
Formula reference
Persona definitions are fixed and public. The score meanings are strict, not inferred.
CheapestLowest total token cost only.Score = normalized total token cost only, where total cost = input price + output price.
FastestLowest observed latency only.Score = normalized latency only.
SmartestHighest intelligence benchmark score only.Score = normalized intelligence benchmark only.
Best ValueThe most balanced tradeoff between cost, quality, and responsiveness.Score = 45% intelligence + 35% cost + 10% latency + 10% context.
CodingHighest coding benchmark score only.Score = normalized coding benchmark only.
Sources
Every displayed field is tied back to a collected source and timestamp.
pricingofficial · collected Mar 10, 2026https://developers.openai.com/api/docs/pricing
limitsofficial · collected Mar 9, 2026https://platform.openai.com/docs/models
benchmarksbenchmark · collected Mar 9, 2026https://artificialanalysis.ai/
Cost Calculator
Estimate daily and monthly spend for GPT-4.1 mini.
Operational notes

Pricing notes: Live sync from OpenAI official pricing. Standard-tier rates are used for comparison.

Data coverage: full

Max output tokens: 16.4K

Rate limit notes: Rate limits vary by usage tier.

Updated: Mar 10, 2026