Groq

Llama 3.3 70B Versatile

Groq-hosted fast serving tier for Meta’s large open model family.

Best for: FastestCoverage: fullRelease: GASynced within 7d
Pricing: Official LiveLimits: Curated OfficialBenchmarks: Third-Party
Current API snapshot
Official pricing, context, and latency normalized into a single current view.
Input price$0.59
Output price$0.79
Context window131.1K
Latency290 ms
Benchmark snapshot
Curated quality and performance signals from the latest accepted benchmark source.
Intelligence79
Coding82
Throughput240 tok/s
Benchmark sourceArtificial Analysis
Rank persona scores
Same ranking formulas used across the public tables, exposed directly for this model.
Cheapest96.1
Fastest100.0
Smartest11.1
Best value48.7
Formula reference
Persona definitions are fixed and public. The score meanings are strict, not inferred.
CheapestLowest total token cost only.Score = normalized total token cost only, where total cost = input price + output price.
FastestLowest observed latency only.Score = normalized latency only.
SmartestHighest intelligence benchmark score only.Score = normalized intelligence benchmark only.
Best ValueThe most balanced tradeoff between cost, quality, and responsiveness.Score = 45% intelligence + 35% cost + 10% latency + 10% context.
CodingHighest coding benchmark score only.Score = normalized coding benchmark only.
Sources
Every displayed field is tied back to a collected source and timestamp.
pricingofficial · collected Mar 10, 2026https://console.groq.com/docs/models
limitsofficial · collected Mar 9, 2026https://console.groq.com/docs/models
benchmarksbenchmark · collected Mar 9, 2026https://artificialanalysis.ai/
Cost Calculator
Estimate daily and monthly spend for Llama 3.3 70B Versatile.
Operational notes

Pricing notes: Live sync from Groq official model docs. Comparison uses the input and output token rates from the official llama-3.3-70b-versatile model row.

Data coverage: full

Max output tokens: 8.2K

Rate limit notes: Throughput limits vary by project and tier.

Updated: Mar 10, 2026