Groq's Responses API is fully compatible with OpenAI's Responses API, making it easy to integrate advanced conversational AI capabilities into your applications. The Responses API supports both text and image inputs while producing text outputs, stateful conversations, and function calling to connect with external systems.
The Responses API is currently in beta. Please let us know your feedback in our Community.
To use the Responses API with OpenAI's client libraries, configure your client with your Groq API key and set the base URL to https://api.groq.com/openai/v1
:
import OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.GROQ_API_KEY,
baseURL: "https://api.groq.com/openai/v1",
});
const response = await client.responses.create({
model: "openai/gpt-oss-20b",
input: "Tell me a fun fact about the moon in one sentence.",
});
console.log(response.output_text);
You can find your API key here.
Although Groq's Responses API is mostly compatible with OpenAI's Responses API, there are a few features we don't support just yet:
previous_response_id
store
truncation
include
safety_identifier
prompt_cache_key
In addition to a model's regular tool use capabilities, the Responses API supports various built-in tools to extend your model's capabilities.
While all models support the Responses API, these built-in tools are only supported for the following models:
Model ID | Browser Search | Code Execution |
---|---|---|
openai/gpt-oss-20b | ✅ | ✅ |
openai/gpt-oss-120b | ✅ | ✅ |
Here are examples using code execution and browser search:
Enable your models to write and execute Python code for calculations, data analysis, and problem-solving - see our code execution documentation for more details.
import OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.GROQ_API_KEY,
baseURL: "https://api.groq.com/openai/v1",
});
const response = await client.responses.create({
model: "openai/gpt-oss-20b",
input: "What is 1312 X 3333? Output only the final answer.",
tool_choice: "required",
tools: [
{
type: "code_interpreter",
container: {
"type": "auto"
}
}
]
});
console.log(response.output_text);
Give your models access to real-time web content and up-to-date information - see our browser search documentation for more details.
import OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.GROQ_API_KEY,
baseURL: "https://api.groq.com/openai/v1",
});
const response = await client.responses.create({
model: "openai/gpt-oss-20b",
input: "Analyze the current weather in San Francisco and provide a detailed forecast.",
tool_choice: "required",
tools: [
{
type: "browser_search"
}
]
});
console.log(response.output_text);
Use structured outputs to ensure the model's response follows a specific JSON schema. This is useful for extracting structured data from text, ensuring consistent response formats, or integrating with downstream systems that expect specific data structures.
For a complete list of models that support structured outputs, see our structured outputs documentation.
import OpenAI from "openai";
const openai = new OpenAI({
apiKey: process.env.GROQ_API_KEY,
baseURL: "https://api.groq.com/openai/v1",
});
const response = await openai.responses.create({
model: "moonshotai/kimi-k2-instruct-0905",
instructions: "Extract product review information from the text.",
input: "I bought the UltraSound Headphones last week and I'm really impressed! The noise cancellation is amazing and the battery lasts all day. Sound quality is crisp and clear. I'd give it 4.5 out of 5 stars.",
text: {
format: {
type: "json_schema",
name: "product_review",
schema: {
type: "object",
properties: {
product_name: { type: "string" },
rating: { type: "number" },
sentiment: {
type: "string",
enum: ["positive", "negative", "neutral"]
},
key_features: {
type: "array",
items: { type: "string" }
}
},
required: ["product_name", "rating", "sentiment", "key_features"],
additionalProperties: false
}
}
}
});
console.log(response.output_text);
{
"product_name": "UltraSound Headphones",
"rating": 4.5,
"sentiment": "positive",
"key_features": [
"noise cancellation",
"long battery life",
"crisp and clear sound quality"
]
}
Use reasoning to let the model produce an internal chain of thought before generating a response. This is useful for complex problem solving, multi-step agentic workflow planning, and scientific analysis.
For a complete list of models that support reasoning, see our reasoning documentation.
import OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.GROQ_API_KEY,
baseURL: "https://api.groq.com/openai/v1",
});
const response = await client.responses.create({
model: "openai/gpt-oss-20b",
input: "How are AI models trained? Be brief.",
reasoning: {
effort: "low"
}
});
console.log(response.output_text);
{
"id": "resp_01k3hgcytaf7vawfkph3pef9qk",
"object": "response",
"status": "completed",
"created_at": 1756155509,
"output": [
{
"type": "reasoning",
"id": "resp_01k3hgcytaf7vsyqqdk1932swk",
"status": "completed",
"content": [
{
"type": "reasoning_text",
"text": "Need brief explanation."
}
],
"summary": []
},
{
"type": "message",
"id": "msg_01k3hgcytaf7w9wzkh0w18ww1q",
"status": "completed",
"role": "assistant",
"content": [
{
"type": "output_text",
"text": "AI models are trained by showing them many examples and adjusting their internal parameters so they make better predictions.1. **Define a task** (e.g., classify images, translate text, predict next word). 2. **Gather data**—a large set of input‑output pairs. 3. **Choose a model architecture** (e.g., neural network layers). 4. **Initialize weights** randomly or from a pre‑trained checkpoint. 5. **Feed data** through the network, compute an error (loss) between the model’s output and the true answer. 6. **Back‑propagate the error** to update the weights using an optimizer (e.g., SGD, Adam). 7. **Repeat** over many epochs until the loss stops improving. 8. **Validate** on a separate dataset to check generalization. The process uses gradient descent and large‑scale computation (GPUs/TPUs) to handle the massive parameter count.",
"annotations": [],
"logprobs": null
}
]
}
],
"previous_response_id": null,
"model": "openai/gpt-oss-20b",
"reasoning": {
"effort": "low"
},
"max_output_tokens": null,
"text": {
"format": {
"type": "text"
}
},
"tools": [],
"tool_choice": "auto",
"truncation": "disabled",
"metadata": {},
"temperature": 1,
"top_p": 1,
"user": null,
"service_tier": "default",
"background": false,
"error": null,
"incomplete_details": null,
"usage": {
"input_tokens": 80,
"input_tokens_details": {
"cached_tokens": 0,
"reasoning_tokens": 0
},
"output_tokens": 213,
"output_tokens_details": {
"cached_tokens": 0,
"reasoning_tokens": 0
},
"total_tokens": 293
},
"parallel_tool_calls": true,
"store": false,
"top_logprobs": 0,
"max_tool_calls": null
}
The reasoning traces can be found in the result.output
array as type "reasoning":
{
"type": "reasoning",
"id": "resp_01k3hgcytaf7vsyqqdk1932swk",
"status": "completed",
"content": [
{
"type": "reasoning_text",
"text": "Need brief explanation."
}
],
"summary": []
},
Explore more advanced use cases in our built-in browser search and code execution documentation.