OpenAI

GPT-4.1 mini

Cost-efficient OpenAI model for high-throughput agent and app workloads.

Best for: CheapestCoverage: fullRelease: GASynced within 7d

Pricing: Official LiveLimits: Curated OfficialBenchmarks: Third-Party

Current API snapshot

Official pricing, context, and latency normalized into a single current view.

Input price$0.40

Output price$1.6

Context window1M

Latency780 ms

Benchmark snapshot

Curated quality and performance signals from the latest accepted benchmark source.

Intelligence82

Coding79

Throughput128 tok/s

Benchmark sourceArtificial Analysis

Rank persona scores

Same ranking formulas used across the public tables, exposed directly for this model.

Cheapest92.5

Fastest59.5

Smartest27.8

Best value60.8

Formula reference

Persona definitions are fixed and public. The score meanings are strict, not inferred.

CheapestLowest total token cost only.Score = normalized total token cost only, where total cost = input price + output price.

FastestLowest observed latency only.Score = normalized latency only.

SmartestHighest intelligence benchmark score only.Score = normalized intelligence benchmark only.

Best ValueThe most balanced tradeoff between cost, quality, and responsiveness.Score = 45% intelligence + 35% cost + 10% latency + 10% context.

CodingHighest coding benchmark score only.Score = normalized coding benchmark only.

Sources

Every displayed field is tied back to a collected source and timestamp.

limitsofficial · collected Mar 9, 2026https://platform.openai.com/docs/models

benchmarksbenchmark · collected Mar 9, 2026https://artificialanalysis.ai/

Cost Calculator

Estimate daily and monthly spend for GPT-4.1 mini.

Prompt tokens per requestCompletion tokens per requestRequests per dayRequests per month (optional override)

Operational notes

Pricing notes: Live sync from OpenAI official pricing. Standard-tier rates are used for comparison.

Data coverage: full

Max output tokens: 16.4K

Rate limit notes: Rate limits vary by usage tier.

Updated: Mar 10, 2026