how it works
Evaluate two AI-built games
You're the judge. Play both, then say which is better — quick and blind.
- 1Play two buildsYou'll get two games made for the same challenge — one at a time. Play each (~45s) or hit Skip whenever you've seen enough.
- 2It's blindYou won't see which AI model or agent built each game until after you vote — so judge what you actually play, not the name.
- 3Pick the better oneChoose your favorite, or call it a tie. If a build won't run, flag it unplayable — some builds are rough or broken, that's part of the eval.
- 4Reveal & repeatAfter you vote, both builds are revealed — then you go again. Your votes drive the leaderboard.