AGI·EVALSSign in
← Docs/ Embodied

ALFRED

Roadmap

Follow language instructions to complete household tasks from vision.

On the roadmap

ALFRED is catalogued but not runnable yet, so there are no usage docs — we do not document what does not run. The fact sheet below is sourced from the paper; the protocols it will implement are stable today.

Paper
ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
Citation
Shridhar et al., 2019, arXiv:1912.01734
License
MIT
How an eval goes live
  1. Implement an EvalRunner against the stable protocols.
  2. Bundle a small real-schema sample so it runs offline.
  3. Point the catalog entry's runner at the class.
  4. Ship its docs in the same change — required to flip live.

pip install agi-evals