Overview - GroqDocs

Overview

Fast LLM inference, OpenAI-compatible. Simple to integrate, easy to scale. Start building in minutes.

What's New:The Official Llama API, accelerated by Groq. The fastest, lowest-cost way to run Llama. Learn more.

Video cover

curl https://api.groq.com/openai/v1/chat/completions -s \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $GROQ_API_KEY" \
-d '{
"model": "llama-3.3-70b-versatile",
"messages": [{
    "role": "user",
    "content": "Explain the importance of fast language models"
}]
}'

Start building apps on Groq

Quickstart

Get up and running with the Groq API in a few minutes.

Create and setup your API Key

Playground

Experiment with the Groq API

Example Apps

Check out cool Groq built apps

Developer Resources

Essential resources to accelerate your development and maximize productivity

API Reference

Explore all API parameters and response attributes

Developer Community

Check out sneak peeks, announcements & get support

API Cookbook

See code examples and tutorials to jumpstart your app

OpenAI Compatibility

Compatible with OpenAI's client libraries

The Models

We’re adding new models all the time and will let you know when a new one comes online. See full details on our Models page.

Deepseek R1 Distill Llama 70B

Llama 3.3, 3.2, 3.1, and LlamaGuard

Whisper Large v3, Turbo, and Distill

Gemma 2