Supported Models

GroqCloud currently supports the following models:


Production Models

Note: Production models are intended for use in your production environments. They meet or exceed our high standards for speed, quality, and reliability. Read more here.

MODEL IDDEVELOPERCONTEXT WINDOW (TOKENS)MAX COMPLETION TOKENSMAX FILE SIZEDETAILS
distil-whisper-large-v3-en
Hugging Face
-
-
100 MB
gemma2-9b-it
Google
8,192
8,192
-
llama-3.1-8b-instant
Meta
131,072
131,072
-
llama-3.3-70b-versatile
Meta
131,072
32,768
-
meta-llama/llama-guard-4-12b
Meta
131,072
1,024
20 MB
whisper-large-v3
OpenAI
-
-
100 MB
whisper-large-v3-turbo
OpenAI
-
-
100 MB

Preview Models

Note: Preview models are intended for evaluation purposes only and should not be used in production environments as they may be discontinued at short notice. Read more about deprecations here.

MODEL IDDEVELOPERCONTEXT WINDOW (TOKENS)MAX COMPLETION TOKENSMAX FILE SIZEDETAILS
deepseek-r1-distill-llama-70b
DeepSeek / Meta
131,072
131,072
-
meta-llama/llama-4-maverick-17b-128e-instruct
Meta
131,072
8,192
20 MB
meta-llama/llama-4-scout-17b-16e-instruct
Meta
131,072
8,192
20 MB
meta-llama/llama-prompt-guard-2-22m
Meta
512
512
-
meta-llama/llama-prompt-guard-2-86m
Meta
512
512
-
mistral-saba-24b
Mistral AI
32,768
32,768
-
playai-tts
PlayAI
8,192
8,192
-
playai-tts-arabic
PlayAI
8,192
8,192
-
qwen-qwq-32b
Alibaba Cloud
131,072
131,072
-
qwen/qwen3-32b
Alibaba Cloud
131,072
40,960
-

Preview Systems

Systems are a collection of models and tools that work together to answer a user query.


Note: Preview systems are intended for evaluation purposes only and should not be used in production environments as they may be discontinued at short notice. Read more about deprecations here.

MODEL IDDEVELOPERCONTEXT WINDOW (TOKENS)MAX COMPLETION TOKENSMAX FILE SIZEDETAILS
compound-beta
Groq
131,072
8,192
-
compound-beta-mini
Groq
131,072
8,192
-

Learn More About Agentic Tooling
Discover how to build powerful applications with real-time web search and code execution

Deprecated models are models that are no longer supported or will no longer be supported in the future. See our deprecation guidelines and deprecated models here.


Hosted models are directly accessible through the GroqCloud Models API endpoint using the model IDs mentioned above. You can use the https://api.groq.com/openai/v1/models endpoint to return a JSON list of all active models:

Python
import requests
import os

api_key = os.environ.get("GROQ_API_KEY")
url = "https://api.groq.com/openai/v1/models"

headers = {
    "Authorization": f"Bearer {api_key}",
    "Content-Type": "application/json"
}

response = requests.get(url, headers=headers)

print(response.json())

Was this page helpful?