Scalable Passive Sleep Monitoring Using Mobile Phones: Opportunities and Obstacles

Background Sleep is a critical aspect of people’s well-being and as such assessing sleep is an important indicator of a person’s health. Traditional methods of sleep assessment are either time- and resource-intensive or suffer from self-reporting biases. Recently, researchers have started to use mobile phones to passively assess sleep in individuals’ daily lives. However, this work remains in its early stages, having only examined relatively small and homogeneous populations in carefully controlled contexts. Thus, it remains an open question as to how well mobile device-based sleep monitoring generalizes to larger populations in typical use cases. Objective The aim of this study was to assess the ability of machine learning algorithms to detect the sleep start and end times for the main sleep period in a 24-h cycle using mobile devices in a diverse sample. Methods We collected mobile phone sensor data as well as daily self-reported sleep start and end times from 208 individuals (171 females; 37 males), diverse in age (18−66 years; mean 39.3), education, and employment status, across the United States over 6 weeks. Sensor data consisted of geographic location, motion, light, sound, and in-phone activities. No specific instructions were given to the participants regarding phone placement. We used random forest classifiers to develop both personalized and global predictors of sleep state from the phone sensor data. Results Using all available sensor features, the average accuracy of classifying whether a 10-min segment was reported as sleep was 88.8%. This is somewhat better than using the time of day alone, which gives an average accuracy of 86.9%. The accuracy of the model considerably varied across the participants, ranging from 65.1% to 97.3%. We found that low accuracy in some participants was due to two main factors: missing sensor data and misreports. After correcting for these, the average accuracy increased to 91.8%, corresponding to an average median absolute deviation (MAD) of 38 min for sleep start time detection and 36 min for sleep end time. These numbers are close to the range reported by previous research in more controlled situations. Conclusions We find that mobile phones provide adequate sleep monitoring in typical use cases, and that our methods generalize well to a broader population than has previously been studied. However, we also observe several types of data artifacts when collecting data in uncontrolled settings. Some of these can be resolved through corrections, but others likely impose a ceiling on the accuracy of sleep prediction for certain subjects. Future research will need to focus more on the understanding of people’s behavior in their natural settings in order to develop sleep monitoring tools that work reliably in all cases for all people.

[1]  Paul J. Rathouz,et al.  Sleep duration: how well do self-reports reflect objective measures? The CARDIA Sleep Study , 2008 .

[2]  A. Krystal,et al.  Sleep and psychiatric disorders: future directions. , 2006, The Psychiatric clinics of North America.

[3]  Anne Germain,et al.  Sleep-specific mechanisms underlying posttraumatic stress disorder: integrative review and neurobiological hypotheses. , 2008, Sleep medicine reviews.

[4]  A. Krystal,et al.  Ambulatory Polysomnography: Technical Aspects and Normative Values , 1992, Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society.

[5]  R. Dahl,et al.  Pathways to adolescent health sleep regulation and behavior. , 2002, The Journal of adolescent health : official publication of the Society for Adolescent Medicine.

[6]  C. Guilleminault,et al.  Meta-analysis of quantitative sleep parameters from childhood to old age in healthy individuals: developing normative sleep values across the human lifespan. , 2004, Sleep.

[7]  Daniel J Buysse,et al.  The consensus sleep diary: standardizing prospective sleep self-monitoring. , 2012, Sleep.

[8]  H A Skinner The drug abuse screening test. , 1982, Addictive behaviors.

[9]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[10]  P Badia,et al.  Sleep fragmentation and daytime sleepiness. , 1984, Sleep.

[11]  Paul J Rathouz,et al.  Self-Reported and Measured Sleep Duration: How Similar Are They? , 2008, Epidemiology.

[12]  W. Flemons,et al.  Quality of life in sleep disorders. , 2003, Sleep medicine reviews.

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  J. Calabrese,et al.  Development and validation of a screening instrument for bipolar spectrum disorder: the Mood Disorder Questionnaire. , 2000, The American journal of psychiatry.

[15]  Sune Lehmann,et al.  SensibleSleep: A Bayesian Model for Learning Sleep Patterns from Smartphone Events , 2016, PloS one.

[16]  神田 信彦 Beck Depression Inventory-IIについての一考察 , 2004 .

[17]  T. Penzel,et al.  Computer based sleep recording and analysis. , 2000, Sleep medicine reviews.

[18]  Simon Heron,et al.  Encryption: Advanced Encryption Standard (AES) , 2009 .

[19]  P. S. Achilles THE PSYCHOLOGICAL CORPORATION. , 1923, Science.

[20]  Fanglin Chen,et al.  Unobtrusive sleep monitoring using smartphones , 2013, 2013 7th International Conference on Pervasive Computing Technologies for Healthcare and Workshops.

[21]  Fanglin Chen,et al.  StudentLife: assessing mental health, academic performance and behavioral trends of college students using smartphones , 2014, UbiComp.

[22]  Paul J Rathouz,et al.  Objectively measured sleep characteristics among early-middle-aged adults: the CARDIA study. , 2006, American journal of epidemiology.

[23]  R. Spitzer,et al.  The PHQ-9: validity of a brief depression severity measure. , 2001, Journal of general internal medicine.

[24]  Jukka-Pekka Onnela,et al.  Inferring mobility measures from GPS traces with missing data. , 2016, Biostatistics.

[25]  R G Priest,et al.  Night terrors, sleepwalking, and confusional arousals in the general population: their frequency and relationship to other sleep and mental disorders. , 1999, The Journal of clinical psychiatry.

[26]  J. Shotton,et al.  Decision Forests for Classification, Regression, Density Estimation, Manifold Learning and Semi-Supervised Learning , 2011 .

[27]  Jennifer Y F Lau,et al.  The direction of longitudinal associations between sleep problems and depression symptoms: a study of twins aged 8 and 10 years. , 2009, Sleep.

[28]  John Trinder,et al.  Sick and tired: does sleep have a vital role in the immune system? , 2004, Nature Reviews Immunology.

[29]  Dirk Fox,et al.  Advanced Encryption Standard (AES) , 1999, Datenschutz und Datensicherheit.

[30]  H. Kranzler,et al.  The Alcohol Use Disorders Identification Test (AUDIT): validation of a screening instrument for use in medical settings. , 1995, Journal of studies on alcohol.

[31]  M. Carskadon,et al.  Sleep fragmentation in the elderly: Relationship to daytime sleep tendency , 1982, Neurobiology of Aging.

[32]  Konrad Paul Kording,et al.  Mobile Phone Sensor Correlates of Depressive Symptom Severity in Daily-Life Behavior: An Exploratory Study , 2015, Journal of medical Internet research.