Command A
Cohere’s enterprise-oriented generation model tuned for agentic workflows and RAG.
Best for: FastestCoverage: fullRelease: GASynced within 7d
Pricing: Official LiveLimits: Curated OfficialBenchmarks: Third-Party
Current API snapshot
Official pricing, context, and latency normalized into a single current view.
Input price$2.5
Output price$10
Context window128K
Latency1010 ms
Benchmark snapshot
Curated quality and performance signals from the latest accepted benchmark source.
Intelligence84
Coding78
Throughput88 tok/s
Benchmark sourceArtificial Analysis
Rank persona scores
Same ranking formulas used across the public tables, exposed directly for this model.
Cheapest31.8
Fastest40.5
Smartest38.9
Best value32.7
Formula reference
Persona definitions are fixed and public. The score meanings are strict, not inferred.
CheapestLowest total token cost only.Score = normalized total token cost only, where total cost = input price + output price.
FastestLowest observed latency only.Score = normalized latency only.
SmartestHighest intelligence benchmark score only.Score = normalized intelligence benchmark only.
Best ValueThe most balanced tradeoff between cost, quality, and responsiveness.Score = 45% intelligence + 35% cost + 10% latency + 10% context.
CodingHighest coding benchmark score only.Score = normalized coding benchmark only.
Sources
Every displayed field is tied back to a collected source and timestamp.
pricingofficial · collected Mar 10, 2026https://docs.cohere.com/docs/command-a
limitsofficial · collected Mar 9, 2026https://docs.cohere.com/
benchmarksbenchmark · collected Mar 9, 2026https://artificialanalysis.ai/
Cost Calculator
Estimate daily and monthly spend for Command A.
Operational notes
Pricing notes: Live sync from Cohere official model docs. Comparison uses the Command A pricing block from the official model page.
Data coverage: full
Max output tokens: 4.1K
Rate limit notes: Provider-specific limits vary by account.
Updated: Mar 10, 2026