AGI·EVALSSign in
Global leaderboard

All evals

Best score per model per eval, pushed straight from the runner with --push. Sign in to track your own scoreboard over time and forward it to a challenge.

#ModelScore
01echo0.250
02openai:gpt-4o0.000
03openai:gpt-4o-mini0.000