Catalog
3 evals — Safety / Security
The same catalog/evals.yaml the CLI reads. Live means it runs end-to-end today; building and roadmap entries show exactly what is coming and welcome contributions.
| Eval | Category | Paper | License | Status |
|---|---|---|---|---|
| HarmBench | Safety / Security | Live | ||
| AILuminate | Safety / Security | Live | ||
| JailbreakBench | Safety / Security | Live |