GroqCloud currently supports the following models:


Production Models

Note: Production models are intended for use in your production environments. They meet or exceed our high standards for speed, quality, and reliability. Read more here.

MODEL IDDEVELOPERCONTEXT WINDOW (TOKENS)MAX COMPLETION TOKENSMAX FILE SIZEDETAILS
gemma2-9b-it
Google
8,192
-
-
llama-3.3-70b-versatile
Meta
128K
32,768
-
llama-3.1-8b-instant
Meta
128K
8,192
-
llama-guard-3-8b
Meta
8,192
-
-
llama3-70b-8192
Meta
8,192
-
-
llama3-8b-8192
Meta
8,192
-
-
whisper-large-v3
OpenAI
-
-
25 MB
whisper-large-v3-turbo
OpenAI
-
-
25 MB
distil-whisper-large-v3-en
HuggingFace
-
-
25 MB

Preview Models

Note: Preview models are intended for evaluation purposes only and should not be used in production environments as they may be discontinued at short notice. Read more about deprecations here.

MODEL IDDEVELOPERCONTEXT WINDOW (TOKENS)MAX COMPLETION TOKENSMAX FILE SIZEDETAILS
allam-2-7b
Saudi Data and AI Authority (SDAIA)
4,096
-
-
deepseek-r1-distill-llama-70b
DeepSeek
128K
-
-
meta-llama/llama-4-maverick-17b-128e-instruct
Meta
131,072
8192
-
meta-llama/llama-4-scout-17b-16e-instruct
Meta
131,072
8192
-
mistral-saba-24b
Mistral
32K
-
-
playai-tts
Playht, Inc
10K
-
playai-tts-arabic
Playht, Inc
10K
-
-
qwen-qwq-32b
Alibaba Cloud
128K
-
-

Preview Systems

Systems are a collection of models and tools that work together to answer a user query.

Note: Preview systems are intended for evaluation purposes only and should not be used in production environments as they may be discontinued at short notice. Read more about deprecations here.

MODEL IDDEVELOPERCONTEXT WINDOW (TOKENS)MAX COMPLETION TOKENSMAX FILE SIZEDETAILS
compound-beta
Groq
128K
8192
-
compound-beta-mini
Groq
128K
8192
-

Learn More About Agentic Tooling
Discover how to build powerful applications with real-time web search and code execution

Deprecated models are models that are no longer supported or will no longer be supported in the future. See our deprecation guidelines and deprecated models here.


Hosted models are directly accessible through the GroqCloud Models API endpoint using the model IDs mentioned above. You can use the https://api.groq.com/openai/v1/models endpoint to return a JSON list of all active models:

1import requests
2import os
3
4api_key = os.environ.get("GROQ_API_KEY")
5url = "https://api.groq.com/openai/v1/models"
6
7headers = {
8    "Authorization": f"Bearer {api_key}",
9    "Content-Type": "application/json"
10}
11
12response = requests.get(url, headers=headers)
13
14print(response.json())