Model catalog
21 models available across 9 providers. One API, consistent format.
Qwen 3 235B
Alibaba
Large MoE (22B active) for complex multilingual tasks.
qwen3-235b-a22b
Qwen 3 32B
Alibaba
Dual-mode thinking/non-thinking. 662 TPS on Groq hardware.
qwen3-32b
Claude Haiku 4.5
Anthropic
Ultra-fast responses at low cost. Ideal for high-throughput.
claude-haiku-4-5
Claude Opus 4.6
Anthropic
Most powerful Claude. Extended thinking, 1M beta context window.
claude-opus-4-6
Claude Sonnet 4.6
Anthropic
Balanced performance and speed for enterprise workloads.
claude-sonnet-4-6
DeepSeek V3.1
DeepSeek
671B MoE (37B active). Extreme efficiency with sparse attention.
deepseek-v3.1
Gemini 2.5 Flash Lite
Fastest Gemini variant at near-zero cost.
gemini-2.5-flash-lite
Gemini 2.5 Pro
2M context with native Google Search grounding.
gemini-2.5-pro
Gemini 3.1 Pro
Latest Gemini with advanced vibe coding and multimodality.
gemini-3.1-pro-preview
Llama 4 Maverick
Meta
400B MoE (17B active). Native multimodal, 562 TPS on Groq.
llama-4-maverick-17b-128e
Llama 4 Scout
Meta
109B MoE (17B active). Lean multimodal, near 600 TPS.
llama-4-scout-17b-16e
Mistral Large 3
Mistral AI
Enterprise-grade with 256K context. EU data sovereignty.
mistral-large-latest
Mistral Small 3
Mistral AI
Efficient model for routine tasks and high volume.
mistral-small-3
Kimi K2
Moonshot AI
1T MoE (32B active). Excels at frontend dev and tool calling.
kimi-k2-instruct
GPT-4.1
OpenAI
Reliable general-purpose model with function calling and vision.
gpt-4.1
GPT-5 Mini
OpenAI
Cost-efficient model for high-volume production tasks.
gpt-5-mini
GPT-5 Nano
OpenAI
Ultra-low cost for triage, extraction, and metadata tasks.
gpt-5-nano
GPT-5.2
OpenAI
Latest OpenAI flagship. 400K context with prompt caching support.
gpt-5.2
GPT-OSS 120B
OpenAI
Open-weight MoE (5.1B active). Optimized for agentic workflows.
gpt-oss-120b
GPT-OSS 20B
OpenAI
Compact 20B model. Over 1,000 TPS on Groq LPU hardware.
gpt-oss-20b
Grok 4
xAI
2M context with fast reasoning and competitive output pricing.
grok-4