Testing and ranking on round-robin design for data sport analytics with application to basketball

By modelling results of sport matches as a set of paired fixed effect linear models, the goal of the present article is showing that traditional scoring outputs can be used to do inference on parameters related to the net relative strength or weakness of teams within a league. As hypothesis testing method, we propose either a normal-based and a non-parametric permutation-based approach. As an extension to round-robin of the ranking methodology recently proposed by Arboretti Giancristofaro et al. (2014) and Corain et al. (2016), results of pairwise testing are then exploited to provide a ranking of teams within a league. Through an extensive Monte Carlo simulation study, we investigated the properties of the proposed testing and ranking methodology where we proved its validity under different random distributions. In its simplest univariate version, the proposed methodology allows us to infer on the teams average net scoring within a league, while in its more intriguing multivariate layout it is suitable for looking for any team-related global dominance using a wide set of performance indicators. Finally, by using traditional basketball box scores, we present an application to the Italian Basket League.

[1]  Satoshi Usami,et al.  Bayesian longitudinal paired comparison model and its application to sports data using weighted likelihood bootstrap , 2017, Commun. Stat. Simul. Comput..

[2]  Luigi Salmaso,et al.  Multivariate And Multistrata Nonparametric Tests: The NonParametric Combination Method , 2004 .

[3]  Luigi Salmaso,et al.  A Non-parametric Method for Defining a Global Preference Ranking of Industrial Products , 2007 .

[4]  Luigi Salmaso,et al.  Permutation Tests for Complex Data , 2010 .

[5]  David A. Harville,et al.  The Home-Court Advantage: How Large is it, and Does it Vary from Team to Team? , 1994 .

[6]  Kirk Goldsberry,et al.  A Multiresolution Stochastic Process Model for Predicting Basketball Possession Outcomes , 2014, 1408.0777.

[7]  L. Bornn,et al.  Characterizing the spatial structure of defensive skill in professional basketball , 2014, 1405.0231.

[8]  C. Varin,et al.  Dynamic Bradley–Terry modelling of sports tournaments , 2013 .

[9]  Rosa Arboretti Giancristofaro,et al.  A Comparison among Combination-Based Permutation Statistics for Randomized Complete Block Design , 2012, Commun. Stat. Simul. Comput..

[10]  John Goddard,et al.  Forecasting football results and the efficiency of fixed‐odds betting , 2004 .

[11]  R. A. Bradley,et al.  RANK ANALYSIS OF INCOMPLETE BLOCK DESIGNS THE METHOD OF PAIRED COMPARISONS , 1952 .

[12]  Luigi Salmaso,et al.  Improving power of multivariate combination-based permutation tests , 2015, Stat. Comput..

[13]  Michael A. Trick,et al.  Round robin scheduling - a survey , 2008, Eur. J. Oper. Res..

[14]  Hal S. Stern,et al.  Who's Number 1 in College Football? … And How Might We Decide? , 1995 .

[15]  Luigi Salmaso,et al.  A Critical Review and a Comparative Study on Conditional Permutation Tests for Two-Way ANOVA , 2007, Commun. Stat. Simul. Comput..

[16]  Luigi Salmaso,et al.  A permutation approach for ranking of multivariate populations , 2014, J. Multivar. Anal..

[17]  Manuela Cattelan,et al.  Models for Paired Comparison Data: A Review with Emphasis on Dependent Data , 2012, 1210.1016.

[18]  Todd C. Headrick,et al.  Simulating multivariate g-and-h distributions. , 2010, The British journal of mathematical and statistical psychology.

[19]  Tao Chen,et al.  A functional data approach to model score difference process in professional basketball games , 2018 .

[20]  D. Karlis,et al.  Analysis of sports data by using bivariate Poisson models , 2003 .

[21]  Steffen Nestler,et al.  Restricted Maximum Likelihood Estimation for Parameters of the Social Relations Model , 2015, Psychometrika.

[22]  Luigi Salmaso,et al.  Nonparametric permutation and combination-based multivariate control charts with applications in microelectronics , 2013 .

[23]  D. Harville,et al.  The Selection or Seeding of College Basketball or Football Teams for Postseason Competition , 2003 .

[24]  S. N. Roy On a Heuristic Method of Test Construction and its use in Multivariate Analysis , 1953 .

[25]  Michael Schomaker,et al.  Journal of Quantitative Analysis in Sports Model Averaging in Factor Analysis : An Analysis of Olympic Decathlon Data , 2011 .

[26]  Stefano Bonnini,et al.  Advantages of the Closed Testing Method in Multiple Comparisons Procedures , 2012, Commun. Stat. Simul. Comput..

[27]  R. A. Giancristofaro,et al.  The Multivariate Randomized Complete Block Design: A Novel Permutation Solution in Case of Ordered Categorical Variables , 2012 .

[28]  Erik Strumbelj,et al.  Modeling basketball play-by-play data , 2016, Expert Syst. Appl..

[29]  David L. Gibbs,et al.  Combining Dependent P-values with an Empirical Adaptation of Brown’s Method , 2015, bioRxiv.