Quick Start
The gateway is available athttps://inference.hud.ai. You can use it with any OpenAI-compatible client.
Using Python (OpenAI SDK)
Using curl
Supported Models
HUD Gateway supports models from major providers. For an up-to-date list, visit hud.ai/models.Anthropic
| Model | Routes |
|---|---|
claude-sonnet-4-5 | chat, messages |
claude-haiku-4-5 | chat, messages |
claude-opus-4-5 | chat, messages |
claude-opus-4-1 | chat, messages |
OpenAI
| Model | Routes |
|---|---|
gpt-5.1 | chat, responses |
gpt-5-mini | chat, responses |
gpt-4o | chat, responses |
gpt-4o-mini | chat, responses |
operator | responses |
Google Gemini
| Model | Routes |
|---|---|
gemini-3-pro-preview | chat |
gemini-2.5-pro | chat |
gemini-2.5-computer-use-preview | gemini |
xAI
| Model | Routes |
|---|---|
grok-4-1-fast | chat |
Z-AI (via OpenRouter)
| Model | Routes |
|---|---|
z-ai/glm-4.5v | chat |
Routes
Different models support different API routes:- chat - OpenAI Chat Completions API (
/chat/completions) - messages - Anthropic Messages API (
/messages) - responses - OpenAI Responses API (
/responses) - gemini - Google Gemini native API
Features
Unified Billing
When using HUD Gateway with your HUD API key, usage is automatically deducted from your HUD credits. This simplifies billing by consolidating multiple provider invoices into one.Rate Limits
HUD Gateway automatically handles key rotation and rate limiting across our pool of enterprise keys.Using with HUD Agents
You can use HUD Gateway withOpenAIChatAgent for any model that supports the chat route:
Building Custom Agents with Tracing
For a complete example of building a custom agent that uses HUD Gateway with full tracing support, see the custom agent example. This example demonstrates:- Using the
@instrumentdecorator to capture inference traces - Building a custom
MCPAgentwith HUD Gateway - Automatic token usage and latency tracking