← Catalog/ Robotics
BEHAVIOR-1K
Roadmap1,000 realistic household activities in high-fidelity simulation.
Status
This eval is catalogued and on the roadmap. The protocols are stable — implementing it is an EvalRunner with a catalog entry.
1,000 realistic household activities in high-fidelity simulation.
This eval is catalogued and on the roadmap. The protocols are stable — implementing it is an EvalRunner with a catalog entry.