Supervised Learning for Table Tennis Match Prediction

Machine learning, classification and prediction models have applications across a range of fields. Sport analytics is an increasingly popular application, but most existing work is focused on automated refereeing in mainstream sports and injury prevention. Research on other sports, such as table tennis, has only recently started gaining more traction. This paper proposes the use of machine learning to predict the outcome of table tennis single matches. We use player and match statistics as features and evaluate their relative importance in an ablation study. In terms of models, a number of popular models were explored. We found that 5-fold cross-validation and hyperparameter tuning was crucial to improve model performance. We investigated different feature aggregation strategies in our ablation study to demonstrate the robustness of the models. Different models performed comparably, with the accuracy of the results (61-70%) matching state-of-the-art models in comparable sports, such as tennis. The results can serve as a baseline for future table tennis prediction models, and can feed back to prediction research in similar ball sports.

[1]  Roman Voeikov,et al.  TTNet: Real-time temporal and spatial video analysis of table tennis , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[2]  Hui Zhang,et al.  Tac-Simur: Tactic-based Simulative Visual Analytics of Table Tennis , 2020, IEEE Transactions on Visualization and Computer Graphics.

[3]  G. Nassis,et al.  Current Approaches to the Use of Artificial Intelligence for Injury Risk Assessment and Performance Prediction in Team Sports: a Systematic Review , 2019, Sports Medicine - Open.

[4]  Fadi Thabtah,et al.  A machine learning framework for sport result prediction , 2019, Applied Computing and Informatics.

[5]  Sylvain Arlot,et al.  Cross-Validation , 2017, Encyclopedia of Machine Learning and Data Mining.

[6]  Dimitrios I. Fotiadis,et al.  Machine learning applications in cancer prognosis and prediction , 2014, Computational and structural biotechnology journal.

[7]  Andrew Phelps Cassidy,et al.  Calculating feature importance in data streams with concept drift using Online Random Forest , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[8]  Elad Hazan,et al.  Logistic Regression: Tight Bounds for Stochastic and Online Optimization , 2014, COLT.

[9]  William J. Knottenbelt,et al.  A common-opponent stochastic model for predicting the outcome of professional tennis matches , 2012, Comput. Math. Appl..

[10]  Giovanni Maria Farinella,et al.  MACHINE LEARNING IN COMPUTER VISION , 2002 .

[11]  Shihong Lao,et al.  Multiple Player Tracking in Sports Video: A Dual-Mode Two-Way Bayesian Inference Approach With Progressive Observation Modeling , 2011, IEEE Transactions on Image Processing.

[12]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[13]  Elnaz Davoodi,et al.  Horse racing prediction using artificial neural networks , 2010 .

[14]  De Xu,et al.  Visual Measurement and Prediction of Ball Trajectory for Table Tennis Robot , 2010, IEEE Transactions on Instrumentation and Measurement.

[15]  Stan Szpakowicz,et al.  Beyond Accuracy, F-Score and ROC: A Family of Discriminant Measures for Performance Evaluation , 2006, Australian Conference on Artificial Intelligence.

[16]  Concha Bielza,et al.  Machine Learning in Bioinformatics , 2008, Encyclopedia of Database Systems.

[17]  L. Breiman Random Forests , 2001, Encyclopedia of Machine Learning and Data Mining.

[18]  S. Clarke,et al.  Using official ratings to simulate major tennis tournaments , 2000 .

[19]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[20]  Sholom M. Weiss,et al.  An Empirical Comparison of Pattern Recognition, Neural Nets, and Machine Learning Classification Methods , 1989, IJCAI.

[21]  Rytis Maskeliunas,et al.  Recognition of basketball referee signals from videos using Histogram of Oriented Gradients (HOG) and Support Vector Machine (SVM) , 2018, ANT/SEIT.

[22]  Andre Cornman,et al.  Machine Learning for Professional Tennis Match Prediction and Betting , 2017 .

[23]  MEng Computing – Final year project Machine Learning for the Prediction of Professional Tennis Matches , 2015 .

[24]  Stephen R. Clarke,et al.  Combining player statistics to predict outcomes of tennis matches , 2005 .

[25]  Leonardo Noriega,et al.  Multilayer Perceptron Tutorial , 2005 .