Catalog
5 evals — Robotics
The same catalog/evals.yaml the CLI reads. Live means it runs end-to-end today; building and roadmap entries show exactly what is coming and welcome contributions.
| Eval | Category | Paper | License | Status |
|---|---|---|---|---|
| ManiSkill 2 | Robotics | Roadmap | ||
| CALVIN | Robotics | Roadmap | ||
| VIMA-Bench | Robotics | Roadmap | ||
| BEHAVIOR-1K | Robotics | Roadmap | ||
| RoboBench | Robotics | Roadmap |