Statistical machine learning of sleep and physical activity phenotypes from sensor data in 96,220 UK Biobank participants

Current public health guidelines on physical activity and sleep duration are limited by a reliance on subjective self-reported evidence. Using data from simple wrist-worn activity monitors, we developed a tailored machine learning model, using balanced random forests with Hidden Markov Models, to reliably detect a number of activity modes. We show that physical activity and sleep behaviours can be classified with 87% accuracy in 159,504 minutes of recorded free-living behaviours from 132 adults. These trained models can be used to infer fine resolution activity patterns at the population scale in 96,220 participants. For example, we find that men spend more time in both low- and high- intensity behaviours, while women spend more time in mixed behaviours. Walking time is highest in spring and sleep time lowest during the summer. This work opens the possibility of future public health guidelines informed by the health consequences associated with specific, objectively measured, physical activity and sleep behaviours.

[1]  Dinesh John,et al.  Performance of Activity Classification Algorithms in Free-Living Older Adults. , 2016, Medicine and science in sports and exercise.

[2]  Alex Bateman,et al.  An introduction to hidden Markov models. , 2007, Current protocols in bioinformatics.

[3]  Munni Begum,et al.  Comparison of linear and non-linear models for predicting energy expenditure from raw accelerometer data , 2017, Physiological measurement.

[4]  Bronwyn K. Clark,et al.  Field evaluation of a random forest activity classifier for wrist-worn accelerometer data. , 2016, Journal of science and medicine in sport.

[5]  S. Marshall,et al.  An ethical framework for automated, wearable cameras in health behavior research. , 2013, American journal of preventive medicine.

[6]  Gert R. G. Lanckriet,et al.  Hip and Wrist Accelerometer Algorithms for Free-Living Behavior Classification. , 2016, Medicine and science in sports and exercise.

[7]  Nicola Orsini,et al.  Systematic review and meta-analysis of reduction in all-cause mortality from walking and cycling and shape of dose response relationship , 2014, International Journal of Behavioral Nutrition and Physical Activity.

[8]  Ulf Ekelund,et al.  Physical activity levels in three Brazilian birth cohorts as assessed with raw triaxial wrist accelerometry , 2014, International journal of epidemiology.

[9]  J. Staudenmayer,et al.  Classification accuracy of the wrist-worn gravity estimator of normal everyday activity accelerometer. , 2013, Medicine and science in sports and exercise.

[10]  Cathie Sudlow,et al.  UK Biobank: opportunities for cardiovascular research , 2017, European heart journal.

[11]  Gert R. G. Lanckriet,et al.  Objective Assessment of Physical Activity: Classifiers for Public Health. , 2016, Medicine and science in sports and exercise.

[12]  K. Khunti,et al.  Sedentary time in adults and the association with diabetes, cardiovascular disease and death: systematic review and meta-analysis , 2012, Diabetologia.

[13]  M. Delgado-Rodríguez,et al.  Systematic review and meta-analysis. , 2017, Medicina intensiva.

[14]  Shahram Izadi,et al.  SenseCam: A Retrospective Memory Aid , 2006, UbiComp.

[15]  David R Bassett,et al.  2011 Compendium of Physical Activities: a second update of codes and MET values. , 2011, Medicine and science in sports and exercise.

[16]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[17]  Jr. G. Forney,et al.  Viterbi Algorithm , 1973, Encyclopedia of Machine Learning.

[18]  J. Leskovec,et al.  Large-scale physical activity data reveal worldwide activity inequality , 2017, Nature.

[19]  John Staudenmayer,et al.  An artificial neural network to estimate physical activity energy expenditure and identify physical activity type from an accelerometer. , 2009, Journal of applied physiology.

[20]  Michael Catt,et al.  A Novel, Open Access Method to Assess Sleep Duration Using a Wrist-Worn Accelerometer , 2015, PloS one.

[21]  Richard P Troiano,et al.  Evolution of accelerometer methods for physical activity research , 2014, British Journal of Sports Medicine.

[22]  Andrea Mannini,et al.  Activity recognition using a single accelerometer placed at the wrist or ankle. , 2013, Medicine and science in sports and exercise.

[23]  T. Vasankari,et al.  A universal, accurate intensity‐based classification of different physical activities using raw data of accelerometer , 2015, Clinical physiology and functional imaging.

[24]  Steve E Hodges,et al.  Wearable cameras in health: the state of the art and future possibilities. , 2013, American journal of preventive medicine.

[25]  Nils Y. Hammerla,et al.  Large Scale Population Assessment of Physical Activity Using Wrist Worn Accelerometers: The UK Biobank Study , 2017, PloS one.

[26]  Michael Catt,et al.  Validation of the GENEA Accelerometer. , 2011, Medicine and science in sports and exercise.

[27]  Aiden R Doherty,et al.  Automatically assisting human memory: A SenseCam browser , 2011, Memory.

[28]  G Plasqui,et al.  Improving assessment of daily energy expenditure by identifying types of physical activity with a single accelerometer. , 2009, Journal of applied physiology.

[29]  Hannah Badland,et al.  Using wearable cameras to categorise type and context of accelerometer-identified episodes of physical activity , 2013, International Journal of Behavioral Nutrition and Physical Activity.

[30]  Jonathan Lester,et al.  New horizons in sensor development. , 2012, Medicine and science in sports and exercise.

[31]  Chao Chen,et al.  Using Random Forest to Learn Imbalanced Data , 2004 .

[32]  Joss Langford,et al.  Autocalibration of accelerometer data for free-living physical activity assessment using local gravity and temperature: an evaluation on four continents , 2014, Journal of applied physiology.

[33]  V. Preedy,et al.  Prospective Cohort Study , 2010 .

[34]  Paul Welsh,et al.  Association between active commuting and incident cardiovascular disease, cancer, and mortality: prospective cohort study , 2017, British Medical Journal.

[35]  Paul Kelly,et al.  Developing a Method to Test the Validity of 24 Hour Time Use Diaries Using Wearable Cameras: A Feasibility Pilot , 2015, PloS one.

[36]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[37]  S. Blair,et al.  Effect of physical inactivity on major non-communicable diseases worldwide: an analysis of burden of disease and life expectancy , 2012, BDJ.

[38]  Gert R. G. Lanckriet,et al.  A random forest classifier for the prediction of energy expenditure and type of physical activity from wrist and hip accelerometers , 2014, Physiological measurement.

[39]  N. Wareham,et al.  Estimation of Physical Activity Energy Expenditure during Free-Living from Wrist Accelerometry in UK Adults , 2016, PloS one.

[40]  Ulf Ekelund,et al.  Guide to the assessment of physical activity: Clinical and research applications: a scientific statement from the American Heart Association. , 2013, Circulation.

[41]  G. M. Allan,et al.  Kappa statistic , 2005, Canadian Medical Association Journal.

[42]  A. Viera,et al.  Understanding interobserver agreement: the kappa statistic. , 2005, Family medicine.

[43]  Philipp Scholl,et al.  Towards Benchmarked Sleep Detection with Wrist-Worn Sensing Units , 2014, 2014 IEEE International Conference on Healthcare Informatics.

[44]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[45]  Archana Singh-Manoux,et al.  Accelerometer assessed moderate-to-vigorous physical activity and successful ageing: results from the Whitehall II study , 2017, Scientific Reports.

[46]  Charles E Matthews,et al.  Comparative validity of physical activity measures in older adults. , 2011, Medicine and science in sports and exercise.

[47]  Gert R. G. Lanckriet,et al.  Multi-sensor physical activity recognition in free-living , 2014, UbiComp Adjunct.

[48]  Archana Singh-Manoux,et al.  Association Between Questionnaire- and Accelerometer-Assessed Physical Activity: The Role of Sociodemographic Factors , 2014, American journal of epidemiology.

[49]  M. Jokela,et al.  RESULTS FROM THE WHITEHALL II STUDY , 2009 .

[50]  S. Marshall,et al.  Using the SenseCam to improve classifications of sedentary behavior in free-living settings. , 2013, American journal of preventive medicine.

[51]  Tina L Hurst,et al.  Physical activity classification using the GENEA wrist-worn accelerometer. , 2012, Medicine and science in sports and exercise.

[52]  Aiden R. Doherty,et al.  High group level validity but high random error of a self-report travel diary, as assessed by wearable cameras , 2014 .

[53]  Søren Brage,et al.  Impact of study design on development and evaluation of an activity-type classifier. , 2013, Journal of applied physiology.

[54]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[55]  F. Hu,et al.  Sleep Duration and Risk of Type 2 Diabetes: A Meta-analysis of Prospective Studies , 2015, Diabetes Care.