Automatic machine-learning based identification of jogging periods from accelerometer measurements of adolescents under field conditions

Background Assessment of health benefits associated with physical activity depend on the activity duration, intensity and frequency, therefore their correct identification is very valuable and important in epidemiological and clinical studies. The aims of this study are: to develop an algorithm for automatic identification of intended jogging periods; and to assess whether the identification performance is improved when using two accelerometers at the hip and ankle, compared to when using only one at either position. Methods The study used diarized jogging periods and the corresponding accelerometer data from thirty-nine, 15-year-old adolescents, collected under field conditions, as part of the GINIplus study. The data was obtained from two accelerometers placed at the hip and ankle. Automated feature engineering technique was performed to extract features from the raw accelerometer readings and to select a subset of the most significant features. Four machine learning algorithms were used for classification: Logistic regression, Support Vector Machines, Random Forest and Extremely Randomized Trees. Classification was performed using only data from the hip accelerometer, using only data from ankle accelerometer and using data from both accelerometers. Results The reported jogging periods were verified by visual inspection and used as golden standard. After the feature selection and tuning of the classification algorithms, all options provided a classification accuracy of at least 0.99, independent of the applied segmentation strategy with sliding windows of either 60s or 180s. The best matching ratio, i.e. the length of correctly identified jogging periods related to the total time including the missed ones, was up to 0.875. It could be additionally improved up to 0.967 by application of post-classification rules, which considered the duration of breaks and jogging periods. There was no obvious benefit of using two accelerometers, rather almost the same performance could be achieved from either accelerometer position. Conclusions Machine learning techniques can be used for automatic activity recognition, as they provide very accurate activity recognition, significantly more accurate than when keeping a diary. Identification of jogging periods in adolescents can be performed using only one accelerometer. Performance-wise there is no significant benefit from using accelerometers on both locations.

[1]  Dennis Nowak,et al.  Sport Engagement by Accelerometry under Field Conditions in German Adolescents: Results from GINIPlus , 2015, PloS one.

[2]  P. Kokkinos,et al.  Physical Activity and Cardiovascular Disease Prevention: Current Recommendations , 2008, Angiology.

[3]  Héctor Pomares,et al.  Window Size Impact in Human Activity Recognition , 2014, Sensors.

[4]  Sung Gyoo Park Medicine and Science in Sports and Exercise , 1981 .

[5]  Dejan Gjorgjevikj,et al.  Robust histogram-based feature engineering of time series data , 2015, 2015 Federated Conference on Computer Science and Information Systems (FedCSIS).

[6]  Dominik Schuldhaus,et al.  Hierarchical, Multi-Sensor Based Classification of Daily Life Activities: Comparison with State-of-the-Art Algorithms Using a Benchmark Dataset , 2013, PloS one.

[7]  Dymitr Ruta Robust method of sparse feature selection for multi-label classification with Naive Bayes , 2014, 2014 Federated Conference on Computer Science and Information Systems.

[8]  Ahmad Lotfi,et al.  Human Movement Recognition Based on the Stochastic Characterisation of Acceleration Data , 2016, Sensors.

[9]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[10]  Adam Zagorecki A versatile approach to classification of multivariate time series data , 2015, 2015 Federated Conference on Computer Science and Information Systems (FedCSIS).

[11]  Ulf Ekelund,et al.  Moderate to vigorous physical activity and sedentary time and cardiometabolic risk factors in children and adolescents. , 2012, JAMA.

[12]  Sanjay Sharma,et al.  The U-shaped relationship between exercise and cardiac morbidity. , 2016, Trends in cardiovascular medicine.

[13]  Willem van Mechelen,et al.  Meta-Analyses of the Effects of Habitual Running on Indices of Health in Physically Inactive Adults , 2015, Sports Medicine.

[14]  P. Thompson,et al.  Running as a Key Lifestyle Medicine for Longevity. , 2017, Progress in cardiovascular diseases.

[15]  Paul J. M. Havinga,et al.  Fusion of Smartphone Motion Sensors for Physical Activity Recognition , 2014, Sensors.

[16]  Nathalie Japkowicz,et al.  The class imbalance problem: A systematic study , 2002, Intell. Data Anal..

[17]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[18]  D. J. van der Valk,et al.  How accurately can sitting and the intensity of walking and cycling be classified using an accelerometer on the waist for the purpose of the “Global recommendations on physical activity for health”? , 2015 .

[19]  Marta Benet,et al.  Regular physical activity modifies smoking-related lung function decline and reduces risk of chronic obstructive pulmonary disease: a population-based cohort study. , 2007, American journal of respiratory and critical care medicine.

[20]  Ari Heinonen,et al.  Health benefits of different sport disciplines for adults: systematic review of observational and intervention studies with meta-analysis , 2015, British Journal of Sports Medicine.

[21]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[22]  Vincent Gremeaux,et al.  Exercise and longevity. , 2012, Maturitas.

[23]  A. Jette,et al.  The Physical Activity Scale for the Elderly (PASE): development and evaluation. , 1993, Journal of clinical epidemiology.

[24]  Andrea Mannini,et al.  Activity recognition using a single accelerometer placed at the wrist or ankle. , 2013, Medicine and science in sports and exercise.

[25]  Herbert A. Sturges,et al.  The Choice of a Class Interval , 1926 .

[26]  Marek Grzegorowski,et al.  Mining Data from Coal Mines: IJCRS'15 Data Challenge , 2015, RSFDGrC.

[27]  Ulf Ekelund,et al.  Guide to the assessment of physical activity: Clinical and research applications: a scientific statement from the American Heart Association. , 2013, Circulation.

[28]  Rob J Hyndman,et al.  Sample Quantiles in Statistical Packages , 1996 .

[29]  Ana M. Bernardos,et al.  Activity logging using lightweight classification techniques in mobile devices , 2012, Personal and Ubiquitous Computing.

[30]  C. Matthews,et al.  Best practices for using physical activity monitors in population-based research. , 2012, Medicine and science in sports and exercise.

[31]  Alexander Horsch,et al.  Physical Activity in German Adolescents Measured by Accelerometry and Activity Diary: Introducing a Comprehensive Approach for Data Management and Preliminary Results , 2013, PloS one.

[32]  Gaël Varoquaux,et al.  The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[33]  Davide Anguita,et al.  Transition-Aware Human Activity Recognition Using Smartphones , 2016, Neurocomputing.

[34]  Dominik Slezak,et al.  Tagging Firefighter Activities at the emergency scene: Summary of AAIA'15 data mining competition at knowledge pit , 2015, 2015 Federated Conference on Computer Science and Information Systems (FedCSIS).

[35]  Ignacio Rojas,et al.  Design, implementation and validation of a novel open framework for agile development of mobile health applications , 2015, BioMedical Engineering OnLine.

[36]  Ulf Ekelund,et al.  A systematic review of reliability and objective criterion-related validity of physical activity questionnaires , 2012, International Journal of Behavioral Nutrition and Physical Activity.

[37]  J. Maurer,et al.  Regular Physical Activity Modifies Smoking-related Lung Function Decline and Reduces Risk of Chronic Obstructive Pulmonary Disease: A Population-based Cohort Study , 2008 .

[38]  Eftim Zdravevski,et al.  Transformation of nominal features into numeric in supervised multi-class problems based on the weight of evidence parameter , 2015, 2015 Federated Conference on Computer Science and Information Systems (FedCSIS).

[39]  Pierre Jallon,et al.  Automatic identification of physical activity types and sedentary behaviors from triaxial accelerometer: laboratory-based calibrations are not enough. , 2015, Journal of applied physiology.

[40]  Paul J. M. Havinga,et al.  Complex Human Activity Recognition Using Smartphone and Wrist-Worn Motion Sensors , 2016, Sensors.

[41]  Seung-Soo Baek,et al.  Role of exercise on the brain , 2016, Journal of exercise rehabilitation.

[42]  Stewart G Trost,et al.  Comparison of three generations of ActiGraph™ activity monitors in children and adolescents , 2012, Journal of sports sciences.

[43]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[44]  Fernando Ribeiro,et al.  Physical activity in primary and secondary prevention of cardiovascular disease: Overview updated , 2016, World journal of cardiology.

[45]  David Howard,et al.  A Comparison of Feature Extraction Methods for the Classification of Dynamic Activities From Accelerometer Data , 2009, IEEE Transactions on Biomedical Engineering.

[46]  E Fortune,et al.  Step detection using multi- versus single tri-axial accelerometer-based systems , 2015, Physiological measurement.

[47]  Barbara J. Jefferis,et al.  Validity of questionnaire-based assessment of sedentary behaviour and physical activity in a population-based cohort of older men; comparisons with objectively measured physical activity data , 2016, International Journal of Behavioral Nutrition and Physical Activity.

[48]  I-Min Lee,et al.  Comparison of Self-Reported and Accelerometer-Assessed Physical Activity in Older Women , 2015, PloS one.

[49]  A. Sinclair,et al.  Diabetes, Nutrition, and Exercise. , 2015, Clinics in geriatric medicine.

[50]  M Strączkiewicz,et al.  Automatic car driving detection using raw accelerometry data. , 2016, Physiological measurement.

[51]  P. Schnohr,et al.  Regular physical activity reduces hospital admission and mortality in chronic obstructive pulmonary disease: a population based cohort study , 2006, Thorax.

[52]  W. Beyer CRC Standard Probability And Statistics Tables and Formulae , 1990 .

[53]  Hui Ting Chan,et al.  Minimum amount of physical activity for reduced mortality and extended life expectancy: a prospective cohort study , 2011, The Lancet.

[54]  Christian Schweizer,et al.  National physical activity recommendations: systematic overview and analysis of the situation in European countries , 2015, BMC Public Health.

[55]  Dennis Nowak,et al.  Physical Activity Levels and Domains Assessed by Accelerometry in German Adolescents from GINIplus and LISAplus , 2016, PloS one.

[56]  Eftim Zdravevski,et al.  SVM Parameter Tuning with Grid Search and Its Impact on Reduction of Model Over-fitting , 2015, RSFDGrC.

[57]  Juha Röning,et al.  Recognizing Human Activities User-independently on Smartphones Based on Accelerometer Data , 2012, Int. J. Interact. Multim. Artif. Intell..

[58]  Marek Gagolewski,et al.  The winning solution to the AAIA'15 data mining competition: Tagging Firefighter Activities at a Fire Scene , 2015, 2015 Federated Conference on Computer Science and Information Systems (FedCSIS).

[59]  Rossitza Goleva,et al.  Improving Activity Recognition Accuracy in Ambient-Assisted Living Systems by Automated Feature Engineering , 2017, IEEE Access.

[60]  H. van Praag,et al.  On the Run for Hippocampal Plasticity. , 2018, Cold Spring Harbor perspectives in medicine.

[61]  Richard P Troiano,et al.  Evolution of accelerometer methods for physical activity research , 2014, British Journal of Sports Medicine.

[62]  Marc Boullé Tagging fireworkers activities from body sensors under distribution drift , 2015, 2015 Federated Conference on Computer Science and Information Systems (FedCSIS).

[63]  Shifan Dai,et al.  Participation in Types of Physical Activities Among US Adults--National Health and Nutrition Examination Survey 1999-2006. , 2015, Journal of physical activity & health.

[64]  Christopher J. Marley,et al.  Brain train to combat brain drain; focus on exercise strategies that optimize neuroprotection , 2016, Experimental physiology.

[65]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[66]  Morten Wang Fagerland,et al.  Does physical activity attenuate, or even eliminate, the detrimental association of sitting time with mortality? A harmonised meta-analysis of data from more than 1 million men and women , 2016, The Lancet.

[67]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[68]  JapkowiczNathalie,et al.  The class imbalance problem: A systematic study , 2002 .

[69]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[70]  U. Ekelund,et al.  Global physical activity levels: surveillance progress, pitfalls, and prospects , 2012, The Lancet.