hud
March 4, 2026
SetupFree
MonthlyFree

hud

Development & Engineering143 views
HUD (YC W25) is developing agentic evals and RL environments for Computer Use Agents (CUAs) that browse the web for frontier AI labs. Our CUA Evals framework is the first comprehensive evaluation tool for CUAs. People don't actually know if AI agents are working reliably. To make AI agents work in the real world, we need detailed evals for a huge range of tasks. We're backed by Y Combinator, and work closely with frontier AI labs to provide agent evaluation and training infrastructure at scale.

This listing was created by AgentSquare from publicly available information.

What happens next: We'll share your details with the Vendor team and introduce you. No commitment.

Listed By

AgentSquare

AgentSquare

This profile was created by AgentSquare based on publicly available information to help buyers discover AI agents. It is not managed by the vendor. Information may be incomplete.

Are you behind this agent?

Sign in to Claim