CALVIN

Roadmap

Long-horizon language-conditioned manipulation from undirected play data.

On the roadmap

CALVIN is catalogued but not runnable yet, so there are no usage docs — we do not document what does not run. The fact sheet below is sourced from the paper; the protocols it will implement are stable today.

Paper: CALVIN: A Benchmark for Language-Conditioned Policy Learning
Citation: Mees et al., 2021, arXiv:2112.03227
License: MIT
Homepage: http://calvin.cs.uni-freiburg.de

How an eval goes live

Implement an EvalRunner against the stable protocols.
Bundle a small real-schema sample so it runs offline.
Point the catalog entry's runner at the class.
Ship its docs in the same change — required to flip live.

pip install agi-eval