Effective injury forecasting in soccer with GPS training data and machine learning

Injuries have a great impact on professional soccer, due to their large influence on team performance and the considerable costs of rehabilitation for players. Existing studies in the literature provide just a preliminary understanding of which factors mostly affect injury risk, while an evaluation of the potential of statistical models in forecasting injuries is still missing. In this paper, we propose a multi-dimensional approach to injury forecasting in professional soccer that is based on GPS measurements and machine learning. By using GPS tracking technology, we collect data describing the training workload of players in a professional soccer club during a season. We then construct an injury forecaster and show that it is both accurate and interpretable by providing a set of case studies of interest to soccer practitioners. Our approach opens a novel perspective on injury prevention, providing a set of simple and practical rules for evaluating and interpreting the complex relations between injury risk and training performance in professional soccer.

[1]  Luca Pappalardo,et al.  The Haka network: Evaluating rugby team performance with dynamic graph analysis , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[2]  Dino Pedreschi,et al.  The harsh rule of the goals: Data-driven performance indicators for football teams , 2015, 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[3]  Roger Font,et al.  Are There Potential Safety Problems Concerning the Use of Electronic Performance-Tracking Systems? The Experience of a Multisport Elite Club. , 2017, International journal of sports physiology and performance.

[4]  Tim J Gabbett,et al.  The training—injury prevention paradox: should athletes be training smarter and harder? , 2016, British Journal of Sports Medicine.

[5]  Erik E. Lehmann,et al.  What Does it Take to be a Star? - The Role of Performance and the Media for German Soccer Players , 2008 .

[6]  François-Xavier Li,et al.  Accumulated workloads and the acute:chronic workload ratio relate to injury risk in elite youth football players , 2016, British Journal of Sports Medicine.

[7]  G Atkinson,et al.  Monitoring Training in Elite Soccer Players: Systematic Bias between Running Speed and Metabolic Power Data , 2013, International Journal of Sports Medicine.

[8]  Mee Hong Ling,et al.  A Survey on Reinforcement Learning Models and Algorithms for Traffic Signal Control , 2017, ACM Comput. Surv..

[9]  Kewei Cheng,et al.  Feature Selection , 2016, ACM Comput. Surv..

[10]  Adam Bloniarz,et al.  Variable Importance Using Decision Trees , 2017, NIPS.

[11]  Tim J Gabbett,et al.  Relationships between training load, injury, and fitness in sub-elite collision sport athletes , 2007, Journal of sports sciences.

[12]  Martin Buchheit,et al.  Monitoring accelerations with GPS in football: time to slow down? , 2014, International journal of sports physiology and performance.

[13]  Tim J Gabbett,et al.  Spikes in acute workload are associated with increased injury risk in elite cricket fast bowlers , 2013, British Journal of Sports Medicine.

[14]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[15]  Daniel A. Keim,et al.  How to Make Sense of Team Sport Data: From Acquisition to Data Modeling and Research Aspects , 2017, Data.

[16]  Tim J Gabbett,et al.  The Development and Application of an Injury Prediction Model for Noncontact, Soft-Tissue Injuries in Elite Collision Sport Athletes , 2010, Journal of strength and conditioning research.

[17]  Haibo He,et al.  ADASYN: Adaptive synthetic sampling approach for imbalanced learning , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[18]  K E Webster,et al.  Effect of physiotherapy attendance on outcome after anterior cruciate ligament reconstruction: a pilot study , 2004, British Journal of Sports Medicine.

[19]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[20]  Giampietro Alberti,et al.  Characterization of In-season Elite Football Trainings by GPS Features: The Identity Card of a Short-Term Football Training Cycle , 2016, 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW).

[21]  Luca Pappalardo,et al.  Quantifying the relation between performance and success in soccer , 2017, 1705.00885.

[22]  Chris Visscher,et al.  Monitoring stress and recovery: new insights for the prevention of injuries and illnesses in elite youth soccer players , 2010, British Journal of Sports Medicine.

[23]  Joachim Gudmundsson,et al.  Spatio-Temporal Analysis of Team Sports , 2016, ACM Comput. Surv..

[24]  Charles W. Champ,et al.  A multivariate exponentially weighted moving average control chart , 1992 .

[25]  Martin Hägglund,et al.  Injuries affect team performance negatively in professional football: an 11-year follow-up of the UEFA Champions League injury study , 2013, British Journal of Sports Medicine.

[26]  T J Gabbett,et al.  Reductions in pre-season training loads reduce training injury rates in rugby league players , 2004, British Journal of Sports Medicine.

[27]  Doungkamol Sindhusake,et al.  GPS and Injury Prevention in Professional Soccer , 2016, Journal of strength and conditioning research.

[28]  Jiri Dvorak,et al.  Consensus statement on injury definitions and data collection procedures in studies of football (soccer) injuries , 2006, British Journal of Sports Medicine.

[29]  Tim J Gabbett,et al.  Relationship between training load and injury in professional rugby league players. , 2011, Journal of science and medicine in sport.

[30]  S. Doberstein,et al.  Impact of Training Patterns on Incidence of Illness and Injury During a Women's Collegiate Basketball Season , 2003, Journal of strength and conditioning research.

[31]  C. Foster,et al.  Monitoring training in athletes with reference to overtraining syndrome. , 1998, Medicine and science in sports and exercise.

[32]  Tim J Gabbett,et al.  Training and game loads and injury risk in elite Australian footballers. , 2014, Journal of strength and conditioning research.

[33]  Tim J Gabbett,et al.  Relationship Between Running Loads and Soft-Tissue Injury in Elite Team Sport Athletes , 2012, Journal of strength and conditioning research.

[34]  R Bahr,et al.  Methods for epidemiological study of injuries to professional football players: developing the UEFA model , 2005, British Journal of Sports Medicine.

[35]  Mitch J Duncan,et al.  Applying GPS to enhance understanding of transport-related physical activity. , 2009, Journal of science and medicine in sport.

[36]  Jiri Dvorak,et al.  Effective Injury Prevention in Soccer , 2010, The Physician and sportsmedicine.

[37]  Dino Pedreschi,et al.  "Engine Matters": A First Large Scale Data Driven Study on Cyclists' Performance , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[38]  P. D. di Prampero,et al.  Sprint running: a new energetic approach , 2005, Journal of Experimental Biology.

[39]  James M. Lucas,et al.  Exponentially weighted moving average control schemes: Properties and enhancements , 1990 .

[40]  Tim J Gabbett,et al.  Calculating acute:chronic workload ratios using exponentially weighted moving averages provides a more sensitive indicator of injury likelihood than rolling averages , 2016, British Journal of Sports Medicine.

[41]  Olivia A. Hurley Impact of Player Injuries on Teams' Mental States, and Subsequent Performances, at the Rugby World Cup 2015 , 2016, Front. Psychol..

[42]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .

[43]  Daniel Medina,et al.  From Training to Match Performance: A Predictive and Explanatory Study on Novel Tracking Data , 2016, 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW).

[44]  Paolo Menaspà,et al.  Are rolling averages a good way to assess training load for injury prevention? , 2016, British Journal of Sports Medicine.

[45]  Massimo Venturelli,et al.  Injury risk factors in young soccer players detected by a multivariate survival model. , 2011, Journal of science and medicine in sport.

[46]  Alexander J. Casson,et al.  Description of a Database Containing Wrist PPG Signals Recorded during Physical Exercise with Both Accelerometer and Gyroscope Measures of Motion , 2017, Data.