Supported Models

Explore all available models on GroqCloud.

Production Models

Note: Production models are intended for use in your production environments. They meet or exceed our high standards for speed, quality, and reliability. Read more here.

MODEL IDDEVELOPERCONTEXT WINDOW (TOKENS)MAX COMPLETION TOKENSMAX FILE SIZEDETAILS
llama-3.1-8b-instant
Meta
131,072
131,072
-
llama-3.3-70b-versatile
Meta
131,072
32,768
-
meta-llama/llama-guard-4-12b
Meta
131,072
1,024
20 MB
openai/gpt-oss-120b
OpenAI
131,072
65,536
-
openai/gpt-oss-20b
OpenAI
131,072
65,536
-
whisper-large-v3
OpenAI
-
-
100 MB
whisper-large-v3-turbo
OpenAI
-
-
100 MB

Preview Models

Note: Preview models are intended for evaluation purposes only and should not be used in production environments as they may be discontinued at short notice. Read more about deprecations here.

MODEL IDDEVELOPERCONTEXT WINDOW (TOKENS)MAX COMPLETION TOKENSMAX FILE SIZEDETAILS
deepseek-r1-distill-llama-70b
DeepSeek / Meta
131,072
131,072
-
meta-llama/llama-4-maverick-17b-128e-instruct
Meta
131,072
8,192
20 MB
meta-llama/llama-4-scout-17b-16e-instruct
Meta
131,072
8,192
20 MB
meta-llama/llama-prompt-guard-2-22m
Meta
512
512
-
meta-llama/llama-prompt-guard-2-86m
Meta
512
512
-
moonshotai/kimi-k2-instruct
Moonshot AI
131,072
16,384
-
playai-tts
PlayAI
8,192
8,192
-
playai-tts-arabic
PlayAI
8,192
8,192
-
qwen/qwen3-32b
Alibaba Cloud
131,072
40,960
-

Preview Systems

Systems are a collection of models and tools that work together to answer a user query.


Note: Preview systems are intended for evaluation purposes only and should not be used in production environments as they may be discontinued at short notice. Read more about deprecations here.

MODEL IDDEVELOPERCONTEXT WINDOW (TOKENS)MAX COMPLETION TOKENSMAX FILE SIZEDETAILS
compound-beta
Groq
131,072
8,192
-
compound-beta-mini
Groq
131,072
8,192
-

Learn More About Agentic Tooling
Discover how to build powerful applications with real-time web search and code execution

Deprecated models are models that are no longer supported or will no longer be supported in the future. See our deprecation guidelines and deprecated models here.


Hosted models are directly accessible through the GroqCloud Models API endpoint using the model IDs mentioned above. You can use the https://api.groq.com/openai/v1/models endpoint to return a JSON list of all active models:

Python
1import requests
2import os
3
4api_key = os.environ.get("GROQ_API_KEY")
5url = "https://api.groq.com/openai/v1/models"
6
7headers = {
8    "Authorization": f"Bearer {api_key}",
9    "Content-Type": "application/json"
10}
11
12response = requests.get(url, headers=headers)
13
14print(response.json())

Was this page helpful?