Bayesian models for prediction of the set-difference in volleyball

The aim of this paper is to study and develop Bayesian models for the analysis of volleyball match outcomes as recorded by the set-difference. Due to the peculiarity of the outcome variable (set-difference) which takes discrete values from $-3$ to $3$, we cannot consider standard models based on the usual Poisson or binomial assumptions used for other sports such as football/soccer. Hence, the first and foremost challenge was to build models appropriate for the set-differences of each volleyball match. Here we consider two major approaches: a) an ordered multinomial logistic regression model and b) a model based on a truncated version of the Skellam distribution. For the first model, we consider the set-difference as an ordinal response variable within the framework of multinomial logistic regression models. Concerning the second model, we adjust the Skellam distribution in order to account for the volleyball rules. We fit and compare both models with the same covariate structure as in Karlis & Ntzoufras (2003). Both models are fitted, illustrated and compared within Bayesian framework using data from both the regular season and the play-offs of the season 2016/17 of the Greek national men's volleyball league A1.

[1]  Andrea Gabrio,et al.  Bayesian hierarchical models for the prediction of volleyball results , 2019, Journal of applied statistics.

[2]  Aki Vehtari,et al.  Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC , 2015, Statistics and Computing.

[3]  D. Karlis,et al.  Bayesian analysis of the differences of count data , 2006, Statistics in medicine.

[4]  Alan Agresti,et al.  Categorical Data Analysis , 2003 .

[5]  WatanabeSumio Asymptotic Equivalence of Bayes Cross Validation and Widely Applicable Information Criterion in Singular Learning Theory , 2010 .

[6]  Christian P. Robert,et al.  Convergence Monitoring and Adaptation for MCMC Algorithms , 2010 .

[7]  Sumio Watanabe,et al.  Asymptotic Equivalence of Bayes Cross Validation and Widely Applicable Information Criterion in Singular Learning Theory , 2010, J. Mach. Learn. Res..

[8]  Franz J. Király,et al.  Modeling outcomes of soccer matches , 2018, Machine Learning.

[9]  Adrian E. Raftery,et al.  Bayesian Model Averaging: A Tutorial , 2016 .

[10]  Ioannis Ntzoufras,et al.  Bayesian Analysis of Skills Importance in World Champions Men’s Volleyball across Ages , 2019, Int. J. Comput. Sci. Sport.

[11]  Thorsten Gerber,et al.  Handbook Of Mathematical Functions , 2016 .

[12]  J. O. Irwin,et al.  The Frequency Distribution of the Difference between Two Independent Variates Following the Same Poisson Distribution , 1937 .

[13]  Dimitris Karlis,et al.  Bayesian modelling of football outcomes: using the Skellam's distribution for the goal difference , 2008 .

[15]  D. Karlis,et al.  Analysis of sports data by using bivariate Poisson models , 2003 .

[16]  Cengiz Akarçeşme Is it Possible to Estimate Match Result in Volleyball: A new Prediction Model , 2017 .

[17]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[18]  Radford M. Neal MCMC Using Hamiltonian Dynamics , 2011, 1206.1901.

[19]  Andrew Gelman,et al.  The No-U-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo , 2011, J. Mach. Learn. Res..

[20]  G W Fellingham,et al.  Developing an optimal scoring system with a special emphasis on volleyball. , 1994, Research quarterly for exercise and sport.

[21]  Abdullah Erdal Tümer,et al.  Prediction of team league’s rankings in volleyball by artificial neural network method , 2017 .

[22]  Adrian E. Raftery,et al.  [Practical Markov Chain Monte Carlo]: Comment: One Long Run with Diagnostics: Implementation Strategies for Markov Chain Monte Carlo , 1992 .

[23]  Gilbert W. Fellingham,et al.  Skill Evaluation in Women's Volleyball , 2008 .

[24]  G. Fellingham,et al.  Skill Importance in Women's Volleyball , 2010 .

[25]  I. Mesquita,et al.  Tactical determinants of setting zone in elite men's volleyball. , 2012, Journal of sports science & medicine.

[26]  J. G. Skellam The frequency distribution of the difference between two Poisson variates belonging to different populations. , 1946, Journal of the Royal Statistical Society. Series A.

[27]  Jiqiang Guo,et al.  Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[28]  F. Pukelsheim The Three Sigma Rule , 1994 .

[29]  Alan J. Lee Modeling Scores in the Premier League: Is Manchester United Really the Best? , 1997 .

[30]  Ioannis Ntzoufras,et al.  A Bayesian Quest for Finding a Unified Model for Predicting Volleyball Games. , 2019 .

[31]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[32]  Matthew Heiner,et al.  Skill importance in women’s soccer , 2014 .