
| FEATURE | VALUE |
|---|---|
| Context Window (Tokens) | 128K |
| Max Output Tokens | 16,384 |
| Max File Size | - |
| Token Generation Speed | ~140 tps |
| Input Token Price | $0.69/1M tokens |
| Output Token Price | $0.69/1M tokens |
| Tool Use | |
| JSON Mode | |
| Image Support |
Experience the reasoning capabilities of deepseek-r1-distill-qwen-32b with Groq speed now:
pip install groqfrom groq import Groq
client = Groq()
completion = client.chat.completions.create(
model="deepseek-r1-distill-qwen-32b",
messages=[
{
"role": "user",
"content": "Explain why fast inference is critical for reasoning models"
}
]
)
print(completion.choices[0].message.content)