The ChessAIThon project (2025-1-ES01-KA220-VET-000354329) is co-funded by the European Union. The views and opinions expressed in this publication are those of the author(s) only and do not necessarily reflect those of the European Union or the Spanish Service for the Internationalisation of Education (SEPIE). Neither the European Union nor the National Agency SEPIE can be held responsible for them.
Table of Contents
A fair and meaningful AI chess competition depends on a clear system for evaluating performance, awarding points, and upholding ethical conduct. The rules governing wins, losses, scoring, tie-breaks, and fair play ensure that each match is judged consistently and that every student competes under the same transparent standards. These systems allow the event to function not merely as a contest, but as an educational experience in which responsibility, honesty, and technical rigor take center stage.
The foundation of evaluation lies in distinguishing between standard game outcomes—such as wins by checkmate or draws through stalemate—and results influenced by technical behaviors unique to AI competition. Student-created engines may crash, send malformed moves, exceed time limits, or violate communication protocols. Clear rules for handling these situations ensure that all AIs are held to the same expectations. A timeout, illegal move, or crash is recorded as a loss, not as a failure of the student but as a natural consequence of testing software in a realistic environment. These rules reinforce valuable lessons about stability, precision, and thorough testing.
Scoring systems build upon these outcomes to create a fair and intuitive method for ranking teams. Traditional chess scoring—one point for a win, half for a draw, zero for a loss—offers consistency and familiarity. Yet in a competitive field where multiple teams often finish with identical scores, ties must be resolved using established tie-break formulas. Systems such as Buchholz, head-to-head comparison, and Sonneborn–Berger reward consistent performance against strong opponents and provide a transparent way to separate closely matched teams. When special cases arise—such as simultaneous failures—the competition relies on predefined rules to ensure fairness and avoid subjective decisions.