Deploying Environments
Start withhud init (see Environments) to scaffold locally. When ready:
- Go to hud.ai → New → Environment
- Connect your GitHub repo and name your environment
- Push changes and it rebuilds automatically, like Vercel
Running at Scale
Once deployed, create evals on hud.ai from your scripts. Each eval is a frozen configuration—same prompt, same scoring, every time. Your scenario might take arguments:| Eval Name | Arguments |
|---|---|
checkout-laptop | product_name="Laptop", apply_coupon=False |
checkout-phone-coupon | product_name="Phone", apply_coupon=True |
checkout-headphones | product_name="Headphones", apply_coupon=False |
What’s Next?
With your environment deployed:- Scale: Launch thousands of rollouts. Every run generates traces—prompts, tool calls, rewards.
- Analyze: See which evals agents struggle with. Compare models across your entire benchmark.
- Train: Use runs as training data. Fine-tune on successful completions. Run reinforcement learning to optimize for your specific environment.