A data-driven prediction approach for sports team performance and its application to National Basketball Association

Abstract Performance prediction is an issue of vital importance in many real managerial applications. This paper will propose a prediction approach for sports team performance based on data envelopment analysis (DEA) methodology and data-driven technique. The proposed approach includes two steps: The first one conducts a multivariate logistic regression analysis to examine the relationship between the winning probability and game outcomes at the team-level. The other one addresses a DEA-based player portfolio efficiency analysis to optimally choose players and plan the playing time among players in the court. The second step aims to use players’ and team's historical data to train the future and obtain the most promising outcomes in terms of their average inefficiency status. Finally, we apply the proposed performance prediction approach to National Basketball Association and take Golden State Warriors as an example to illustrate its usefulness and efficacy. We obtain the prediction results for the 2015–16 regular season based on a four-season dataset from the 2011–12 season to the 2014–15 season. Further, we carry out multiple experiments to provide deeper discussion and analysis on according prediction results. It shows that the DEA-based data-driven approach can predict the sports team performance very well and can also provide interesting insights into the performance prediction problem.

[1]  Herman Stekler,et al.  Issues in Sports Forecasting , 2010 .

[2]  Qingyuan Zhu,et al.  Efficiency evaluation of parallel interdependent processes systems: an application to Chinese 985 Project universities , 2019 .

[3]  Andrew Kusiak,et al.  Data-driven smart manufacturing , 2018, Journal of Manufacturing Systems.

[4]  Anthony Constantinou,et al.  pi-football: A Bayesian network model for forecasting Association Football match outcomes , 2012, Knowl. Based Syst..

[5]  Boon L. Lee,et al.  A network DEA quantity and quality-orientated production model: An application to Australian university research services , 2016 .

[6]  Anthony Constantinou,et al.  Profiting from an inefficient association football gambling market: Prediction, risk and uncertainty using Bayesian networks , 2013, Knowl. Based Syst..

[7]  Cl Huang,et al.  The construction of production performance prediction system for semiconductor manufacturing with artificial neural networks , 1999 .

[8]  Li Zhang,et al.  NBA Game Result Prediction Using Feature Analysis and Machine Learning , 2019, Annals of Data Science.

[9]  Léopold Simar,et al.  Efficiency and benchmarking with directional distances: a data-driven approach , 2016, J. Oper. Res. Soc..

[10]  Yongjun Li,et al.  An alternative approach to decompose the potential gains from mergers , 2018, J. Oper. Res. Soc..

[11]  Vadlamani Ravi,et al.  Soft computing system for bank performance prediction , 2008, Appl. Soft Comput..

[12]  Sandro Radovanovic,et al.  Two-phased DEA-MLA approach for predicting efficiency of NBA players , 2014 .

[13]  Peter Omondi-Ochieng,et al.  Gross national income, football workers and national football team performances: A logistic regression analysis , 2015 .

[14]  Erik Strumbelj,et al.  Modeling basketball play-by-play data , 2016, Expert Syst. Appl..

[15]  Pierre Ouellette,et al.  Technological change and efficiency in the presence of quasi-fixed inputs: A DEA application to the hospital sector , 2004, Eur. J. Oper. Res..

[16]  D. Berri,et al.  A Re-Examination of Production Functions and Efficiency Estimates for the National Basketball Association , 2006 .

[17]  Kurt Hornik,et al.  Forecasting sports tournaments by ratings of (prob)abilities: A comparison for the EURO 2008 , 2010 .

[18]  Daniel Lapresa,et al.  Optimization of Game Formats in U-10 Soccer Using Logistic Regression Analysis , 2016, Journal of human kinetics.

[19]  Marcos Matabuena,et al.  A new approach to study the relative age effect with the use of additive logistic regression models: A case of study of FIFA football tournaments (1908-2012) , 2019, PloS one.

[20]  Kaoru Tone,et al.  A slacks-based measure of super-efficiency in data envelopment analysis , 2001, Eur. J. Oper. Res..

[21]  Muhammad Salman Mansoor,et al.  TECHNICAL AND SCALE EFFICIENCY OF ALL-TIME NBA LEADERS , 2018 .

[22]  Baghal Tarek,et al.  Are the "Four Factors" Indicators of One Factor? An Application of Structural Equation Modeling Methodology to NBA Data in Prediction of Winning Percentage , 2012 .

[23]  Lei Zhang,et al.  A data-driven approach for the optimisation of product specifications , 2018, Int. J. Prod. Res..

[24]  Virgílio A. F. Almeida,et al.  Forecasting in the NBA and other team sports: Network effects in action , 2012, TKDD.

[25]  Tao Ding,et al.  Resource sharing and payoff allocation in a three-stage system: Integrating network DEA with the Shapley value method , 2019, Omega.

[26]  M. Melnick,et al.  Relationship between Team Assists and Win-Loss Record in the National Basketball Association , 2001, Perceptual and motor skills.

[27]  Jerad H. Moxley,et al.  Predicting success in the National Basketball Association: Stability & potential , 2015 .

[28]  Sin-Jin Lin,et al.  Integrated news mining technique and AI-based mechanism for corporate performance forecasting , 2018, Inf. Sci..

[29]  Qingyuan Zhu,et al.  A simple MILP to determine closest targets in non-oriented DEA model satisfying strong monotonicity , 2017, Omega.

[30]  Monica Lam,et al.  Neural network techniques for financial performance prediction: integrating fundamental and technical analysis , 2004, Decis. Support Syst..

[31]  Sebastián Lozano,et al.  A network DEA assessment of team efficiency in the NBA , 2014, Ann. Oper. Res..

[32]  Behrouz Arabi,et al.  A new slacks-based measure of Malmquist–Luenberger index in the presence of undesirable outputs , 2015 .

[33]  Hajime Katayama,et al.  A game-level analysis of salary dispersion and team performance in the national basketball association , 2011 .

[34]  Hui Wang,et al.  Energy and CO2 emission performance in electricity generation: A non-radial directional distance function approach , 2012, Eur. J. Oper. Res..

[35]  Quan Zhang,et al.  DEA efficiency prediction based on IG–SVM , 2018, Neural Computing and Applications.

[36]  Zhi Xiao,et al.  The prediction for listed companies' financial distress by using multiple prediction methods with rough set and Dempster-Shafer evidence theory , 2012, Knowl. Based Syst..

[37]  Wellington K. Hsu,et al.  Athletic performance and career longevity following anterior cruciate ligament reconstruction in the National Basketball Association , 2017, Knee Surgery, Sports Traumatology, Arthroscopy.

[38]  Brad R. Humphreys,et al.  Sentiment Bias in National Basketball Association Betting , 2018 .

[39]  Xiaohong Chen,et al.  Profit inefficiency decomposition in a serial-structure system with resource sharing , 2019, J. Oper. Res. Soc..

[40]  Shawn L. Berman,et al.  Tacit Knowledge as a Source of Competitive Advantage in the National Basketball Association , 2002 .

[41]  Nasim Nourafza,et al.  A Review of Data Mining Techniques for Result Prediction in Sports , 2013 .

[42]  Qingyuan Zhu,et al.  Allocating a fixed cost across the decision making units with two-stage network structures , 2018, Omega.

[43]  P. Hopkins,et al.  Time trends for injuries and illness, and their relation to performance in the National Basketball Association. , 2015, Journal of science and medicine in sport.

[44]  Dan M. Frangopol,et al.  Optimal bridge maintenance planning based on probabilistic performance prediction , 2004 .

[45]  Desheng Dash Wu,et al.  Bi‐level Programing Merger Evaluation and Application to Banking Operations , 2016 .

[46]  Thomas Nikodelis,et al.  Classification of Soccer and Basketball Players’ Jumping Performance Characteristics: A Logistic Regression Approach , 2019, Sports.

[47]  Andrew C. Worthington,et al.  A note on the ‘Linsanity’ of measuring the relative efficiency of National Basketball Association guards , 2013 .

[48]  Kaoru Tone,et al.  An out-of-sample evaluation framework for DEA with application in bankruptcy prediction , 2017, Ann. Oper. Res..

[49]  Yongjun Li,et al.  Carbon emission abatement quota allocation in Chinese manufacturing industries: An integrated cooperative game data envelopment analysis approach , 2019, J. Oper. Res. Soc..

[50]  Feng Yang,et al.  Ranking of DMUs with interval cross-efficiencies based on absolute dominance , 2016, Int. J. Inf. Decis. Sci..

[51]  John S. Liu,et al.  A survey of DEA applications , 2013 .

[52]  Hyeonsang Eom,et al.  A compound framework for sports results prediction: A football case study , 2008, Knowl. Based Syst..

[53]  Claver Diallo,et al.  A data-driven approach to multi-product production network planning , 2017, Int. J. Prod. Res..

[54]  Kaoru Tone,et al.  DEA models incorporating uncertain future performance , 2016, Eur. J. Oper. Res..

[55]  Andrea Matta,et al.  A statistical framework of data-driven bottleneck identification in manufacturing systems , 2016 .

[56]  A. Charnes,et al.  Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis , 1984 .

[57]  Faisal Asghar,et al.  A NOVEL APPROACH TO RANKING NATIONAL BASKETBALL ASSOCIATION PLAYERS , 2018 .

[58]  William W. Cooper,et al.  Selecting non-zero weights to evaluate effectiveness of basketball players with DEA , 2009, Eur. J. Oper. Res..

[59]  Zhi Chen,et al.  A balanced data envelopment analysis cross-efficiency evaluation approach , 2018, Expert Syst. Appl..

[60]  Norman E. Fenton,et al.  Towards smart-data: Improving predictive accuracy in long-term football team performance , 2017, Knowl. Based Syst..

[61]  Jie Wu,et al.  DEA-based fixed cost allocation in two-stage systems: Leader-follower and satisfaction degree bargaining game approaches , 2020 .

[62]  Qingyuan Zhu,et al.  A new data envelopment analysis based approach for fixed cost allocation , 2019, Ann. Oper. Res..

[63]  Abhijit Chatterjee,et al.  Prediction of analog performance parameters using fast transienttesting , 2002, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[64]  Xiang Li,et al.  Evaluating NBA player performance using bounded integer data envelopment analysis , 2017, INFOR Inf. Syst. Oper. Res..

[65]  Aaron T. Scanlan,et al.  Physical Determinants of Division 1 Collegiate Basketball, Women's National Basketball League, and Women's National Basketball Association Athletes: With Reference to Lower-Body Sidedness , 2017, Journal of strength and conditioning research.

[66]  Sebastián Lozano Segura,et al.  Estimation of productivity change of NBA teams from 2006-07 to 2012-13 seasons , 2015 .

[67]  Jian-Bo Yang,et al.  Integrated bank performance assessment and management planning using hybrid minimax reference point - DEA approach , 2010, Eur. J. Oper. Res..

[68]  Toshiyuki Sueyoshi,et al.  Mixed integer programming approach of extended DEA-discriminant analysis , 2004, Eur. J. Oper. Res..

[69]  A. Nevill,et al.  Selected issues in the design and analysis of sport performance research , 2001, Journal of sports sciences.

[70]  Philippe du Jardin,et al.  Failure pattern-based ensembles applied to bankruptcy forecasting , 2018, Decis. Support Syst..

[71]  R. Shepherd Theory of cost and production functions , 1970 .

[72]  Eliane Gonçalves Gomes,et al.  Olympic ranking based on a zero sum gains DEA model , 2003, Eur. J. Oper. Res..

[73]  Min Yang,et al.  A DEA-based two-stage network approach for hotel performance analysis: An internal cooperation perspective , 2020 .

[74]  Mehdi Toloo,et al.  Performance assessment in production systems without explicit inputs: an application to basketball players , 2016 .

[75]  Robert Salomon,et al.  Quality, Subjectivity, and Sustained Superior Performance at the Olympic Games , 2015, Manag. Sci..

[76]  supFilipe M. Clemente,et al.  An Online Tactical Metrics Applied to Football Game , 2013 .

[77]  I. M. Premachandra,et al.  DEA as a tool for predicting corporate failure and success: A case of bankruptcy assessment , 2011 .

[78]  M. Farrell The Measurement of Productive Efficiency , 1957 .

[79]  Feng Li,et al.  Measuring environmental sustainability performance of freight transportation seaports in C hina: A data envelopment analysis approach based on the closest targets , 2020, Expert Syst. J. Knowl. Eng..

[80]  Peter Nyhuis,et al.  Applying Simulation and Analytical Models for Logistic Performance Prediction , 2005 .

[81]  Chih-Hai Yang,et al.  Measuring the efficiency of NBA teams: additive efficiency decomposition in two-stage DEA , 2014, Ann. Oper. Res..

[82]  Brandy Aven,et al.  The effects of individual status and group performance on network ties among teammates in the National Basketball Association , 2018, PloS one.

[83]  Micael S. Couceiro,et al.  The ARCANE Project: How an Ecological Dynamics Framework Can Enhance Performance Assessment and Prediction in Football , 2016, Sports Medicine.

[84]  Lidia Angulo Meza,et al.  Measuring the NBA Teams’ Cross-Efficiency by DEA Game , 2014 .

[85]  Qingyuan Zhu,et al.  Allocating a fixed cost based on a DEA-game cross efficiency approach , 2018, Expert Syst. Appl..

[86]  Lin Li,et al.  Data driven bottleneck detection of manufacturing systems , 2009 .

[87]  William L. Weber,et al.  A directional slacks-based measure of technical inefficiency , 2009 .

[88]  Alexandre Dolgui,et al.  Using common weights and efficiency invariance principles for resource allocation and target setting , 2017, Int. J. Prod. Res..

[89]  Alexandro Andrade,et al.  Sleep Quality, Mood and Performance: A Study of Elite Brazilian Volleyball Athletes. , 2016, Journal of sports science & medicine.

[90]  Sebastián Lozano,et al.  Dynamic Network DEA approach to basketball games efficiency , 2018, J. Oper. Res. Soc..

[91]  Sam Robertson,et al.  Explaining match outcome in elite Australian Rules football using team performance indicators , 2016, Journal of sports sciences.

[92]  James E. Payne,et al.  Measuring efficiency in the National Basketball Association1 , 1997 .

[93]  Qingyuan Zhu,et al.  Analysis of fire protection efficiency in the United States: a two-stage DEA-based approach , 2018, OR Spectr..

[94]  Niall M. Moyna,et al.  Team performance indicators which differentiate between winning and losing in elite Gaelic football , 2019, International Journal of Performance Analysis in Sport.

[95]  A. Leicht,et al.  Key team physical and technical performance indicators indicative of team quality in the soccer Chinese super league , 2018, Research in sports medicine.

[96]  Abraham Charnes,et al.  Measuring the efficiency of decision making units , 1978 .

[97]  Sergio José Ibáñez,et al.  THE USE OF CLASSIFICATION AND REGRESSION TREE WHEN CLASSIFYING WINNING AND LOSING BASKETBALL TEAMS , 2017 .