A Discrete Evolutionary Model for Chess Players' Ratings

The Elo system for rating chess players, also used in other games and sports, was adopted by the World Chess Federation over four decades ago. Although not without controversy, it is accepted as generally reliable and provides a method for assessing players' strengths and ranking them in official tournaments. It is generally accepted that the distribution of players' rating data is approximately normal but, to date, no stochastic model of how the distribution might have arisen has been proposed. We propose such an evolutionary stochastic model, which models the arrival of players into the rating pool, the games they play against each other, and how the results of these games affect their ratings, in a similar manner to the Elo system. Using a continuous approximation to the discrete model, we derive the distribution for players' ratings at time t as a normal distribution, where the variance increases in time as a logarithmic function of t. We validate the model using published rating data from 2007-2010, showing that the parameters obtained from the data can be recovered through simulations of the stochastic model. The distribution of players' ratings is only approximately normal and has been shown to have a small negative skew. We show how to modify our evolutionary stochastic model to take this skewness into account, and we validate the modified model using the published official rating data.

[1]  Neil Charness,et al.  Participation Rates and Maximal Performance: A Log-Linear Explanation for Group Differences, Such as Russian and Male Dominance in Chess , 1996 .

[2]  R. A. Bradley,et al.  RANK ANALYSIS OF INCOMPLETE BLOCK DESIGNS , 1952 .

[3]  R. Henery,et al.  An extension to the Thurstone-Mosteller model for chess , 1992 .

[4]  K. Dill,et al.  Molecular driving forces , 2002 .

[5]  Tom Minka,et al.  TrueSkillTM: A Bayesian Skill Rating System , 2006, NIPS.

[6]  M. Opper,et al.  Advanced mean field methods: theory and practice , 2001 .

[7]  Mark Levene,et al.  A stochastic model for the evolution of the Web , 2002, Comput. Networks.

[8]  Thomas Hofmann,et al.  TrueSkill™: A Bayesian Skill Rating System , 2007 .

[9]  Rating systems based on paired comparison models , 1991 .

[10]  P. McLeod,et al.  Why are (the best) women so good at chess? Participation rates and gender differences in intellectual domains , 2009, Proceedings of the Royal Society B: Biological Sciences.

[11]  K. Dill,et al.  Molecular driving forces : statistical thermodynamics in chemistry and biology , 2002 .

[12]  Hal S. Stern,et al.  Designing a College Football Playoff System , 1999 .

[13]  R. A. Bradley,et al.  RANK ANALYSIS OF INCOMPLETE BLOCK DESIGNS THE METHOD OF PAIRED COMPARISONS , 1952 .

[14]  George H. Weiss,et al.  Book Review: Elements of the Random Walk: An Introduction for Advanced Students and Researchers , 2005 .

[15]  Joseph Rudnick,et al.  Elements of the random walk , 2004 .

[16]  H. A. David,et al.  The Method of Paired Comparisons (2nd ed.). , 1989 .

[17]  R. E. Wheeler Statistical distributions , 1983, APLQ.

[18]  M. Glickman Parameter Estimation in Large Dynamic Paired Comparison Experiments , 1999 .

[19]  P. Moran On the method of paired comparisons. , 1947, Biometrika.

[20]  K. Fernow New York , 1896, American Potato Journal.