API Provider5 active models3 organizationsgroq.com

Groq: API pricing, performance & models

Groq hosts 5 active AI models, with input pricing from $0.10 per 1M tokens, averaging 750 tok/s output throughput, with up to 100.0M context window. Compare Groq's API pricing, latency, and feature support against other LLM providers.

5Active
Pricing
$0.100/MFrom
$0.125/MAvg
Performance
750tok/sThroughput
0.44sLatency
100.0MMax

Catalog

OpenAI4PlayAI1
Type
Price
5 models

FAQ

Common questions about Groq.

What is Groq?

Groq is an API provider that hosts large language models. Active models: 5; From (input): $0.10 / 1M tok; Avg throughput: 750 tok/s; Avg latency: 0.44 s; Max context: 100.0M.

How many models does Groq offer?

Groq currently serves 5 active models out of 11 historical offerings on LLM Stats.

What is Groq's API pricing?

Groq input pricing starts from $0.10 per 1M tokens, with the most expensive offering at $0.15 per 1M tokens. See the Pricing tab above for the full per-model breakdown.

How fast is Groq?

Groq averages 750 output tokens per second across its catalog, with average latency of 0.44s. Per-model performance is shown in the Performance tab.

Does Groq support multimodal models?

Yes. Groq's catalog includes 1 audio models. See the Models and Capabilities tabs for the full per-model breakdown.

Whose models does Groq host?

Groq hosts models from OpenAI, PlayAI, and Meta. See the Models tab for the full catalog grouped by creator.

How do I start using Groq?

Sign up at https://groq.com/ to get an API key, then call Groq's API directly from your application. Use the Pricing and Performance tabs above to pick the right model for your latency, cost, and context-window requirements.