← Catalog/ Code
RepoBench
RoadmapRepository-level completion requiring cross-file retrieval and context.
Status
This eval is catalogued and on the roadmap. The protocols are stable — implementing it is an EvalRunner with a catalog entry.
Repository-level completion requiring cross-file retrieval and context.
This eval is catalogued and on the roadmap. The protocols are stable — implementing it is an EvalRunner with a catalog entry.