Prometheus is an open-source monitoring system that collects and stores metrics as time series data. Its stable API is compatible with a range of systems and tools like Grafana.
This feature is only available to our Enterprise tier customers. To get started, please reach out to our Enterprise team.
Groq exposes Prometheus metrics about your organization's usage through VictoriaMetrics. It supports most Prometheus querying API paths:
/api/v1/query/api/v1/query_range/api/v1/series/api/v1/labels/api/v1/label/<label_name>/values/api/v1/status/tsdbPrometheus queries against Groq endpoints use MetricsQL, a query language that extends Prometheus's native PromQL query language.
Queries can be sent to the following endpoint:
https://api.groq.com/v1/metrics/prometheus
To Authenticate, you will need to provide your Groq API key as a header in the Authorization: Bearer <your-api-key> format.
If you run Grafana, you can add Groq metrics as a Prometheus datasource:
https://api.groq.com/v1/metrics/prometheusAuthorization header to your Groq API key:AuthorizationBearer <your-api-key>Groq provides the following metrics:
requests:increase1m
requests:rate1m
Broken out by model and status_code
e2e_latency_seconds:{percentile}:rate5m
ttft_latency_seconds:{percentile}:rate5m
queue_latency_seconds:{percentile}:rate5m
Broken out by model.
tokens_in:{percentile}:rate5m
tokens_out:{percentile}:rate5m
Broken out by model.
In addition to using the APIs directly, you can see a handful of curated charts directly in our console at Metrics