Groq API

api.groq.com

Groq provides ultra-fast LLM inference with an OpenAI-compatible API. Use Groq to generate chat completions, transcribe audio, and translate audio to English using models like Llama 3.3 and Whisper. Groq's custom LPU hardware delivers the fastest token generation speeds available.

Auth

api_key

Pricing

free

Spec version

1.0

Base URL

https://api.groq.com

Crawl failures

Last verified

2026-03-01T11:07:56.000Z

Health

This is a community-maintained manifest. Health monitoring is not available because this service doesn't host its own /.well-known/agent endpoint yet. Learn more about trust levels →

Capabilities

create_chat_completion

communication

Generate a chat completion using Groq's fast inference. Supports models like llama-3.3-70b-versatile, mixtral-8x7b-32768, and gemma2-9b-it. Accepts a messages array with role/content pairs.

Detail: https://api.groq.com/capabilities/create_chat_completion

create_transcription

communication

Transcribe audio to text using Whisper Large V3 on Groq hardware. Accepts audio files in mp3, mp4, wav, webm, m4a, and other formats. Returns transcribed text with optional timestamps.

Detail: https://api.groq.com/capabilities/create_transcription

create_translation

communication

Translate audio from any supported language into English text using Whisper Large V3. Accepts the same audio formats as transcription.

Detail: https://api.groq.com/capabilities/create_translation

list_models

communication

List all models currently available on Groq's inference platform, including their IDs, ownership, and creation dates.

Detail: https://api.groq.com/capabilities/list_models

Agent Preview

This is what an AI agent sees when it discovers this service via the Gateway:

Service: Groq API
Description: Groq provides ultra-fast LLM inference with an OpenAI-compatible API. Use Groq to generate chat completions, transcribe audio, and translate audio to English using models like Llama 3.3 and Whisper. Groq's custom LPU hardware delivers the fastest token generation speeds available.
Auth: api_key
Capabilities:
  - create_chat_completion: Generate a chat completion using Groq's fast inference. Supports models like llama-3.3-70b-versatile, mixtral-8x7b-32768, and gemma2-9b-it. Accepts a messages array with role/content pairs.
  - create_transcription: Transcribe audio to text using Whisper Large V3 on Groq hardware. Accepts audio files in mp3, mp4, wav, webm, m4a, and other formats. Returns transcribed text with optional timestamps.
  - create_translation: Translate audio from any supported language into English text using Whisper Large V3. Accepts the same audio formats as transcription.
  - list_models: List all models currently available on Groq's inference platform, including their IDs, ownership, and creation dates.