Variants
Pass the configurations you want to test:Groups
Run each variant multiple times to get a distribution:hud.eval manager will parallelize your evals automatically and show the distribution across all your runs on hud.ai.
Find out which model actually performs best for your use case.
hud.eval manager will parallelize your evals automatically and show the distribution across all your runs on hud.ai.