Accuracy of Wristband Fitbit Models in Assessing Sleep: Systematic Review and Meta-Analysis

Background Wearable sleep monitors are of high interest to consumers and researchers because of their ability to provide estimation of sleep patterns in free-living conditions in a cost-efficient way. Objective We conducted a systematic review of publications reporting on the performance of wristband Fitbit models in assessing sleep parameters and stages. Methods In adherence with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement, we comprehensively searched the Cumulative Index to Nursing and Allied Health Literature (CINAHL), Cochrane, Embase, MEDLINE, PubMed, PsycINFO, and Web of Science databases using the keyword Fitbit to identify relevant publications meeting predefined inclusion and exclusion criteria. Results The search yielded 3085 candidate articles. After eliminating duplicates and in compliance with inclusion and exclusion criteria, 22 articles qualified for systematic review, with 8 providing quantitative data for meta-analysis. In reference to polysomnography (PSG), nonsleep-staging Fitbit models tended to overestimate total sleep time (TST; range from approximately 7 to 67 mins; effect size=-0.51, P<.001; heterogenicity: I2=8.8%, P=.36) and sleep efficiency (SE; range from approximately 2% to 15%; effect size=-0.74, P<.001; heterogenicity: I2=24.0%, P=.25), and underestimate wake after sleep onset (WASO; range from approximately 6 to 44 mins; effect size=0.60, P<.001; heterogenicity: I2=0%, P=.92) and there was no significant difference in sleep onset latency (SOL; P=.37; heterogenicity: I2=0%, P=.92). In reference to PSG, nonsleep-staging Fitbit models correctly identified sleep epochs with accuracy values between 0.81 and 0.91, sensitivity values between 0.87 and 0.99, and specificity values between 0.10 and 0.52. Recent-generation Fitbit models that collectively utilize heart rate variability and body movement to assess sleep stages performed better than early-generation nonsleep-staging ones that utilize only body movement. Sleep-staging Fitbit models, in comparison to PSG, showed no significant difference in measured values of WASO (P=.25; heterogenicity: I2=0%, P=.92), TST (P=.29; heterogenicity: I2=0%, P=.98), and SE (P=.19) but they underestimated SOL (P=.03; heterogenicity: I2=0%, P=.66). Sleep-staging Fitbit models showed higher sensitivity (0.95-0.96) and specificity (0.58-0.69) values in detecting sleep epochs than nonsleep-staging models and those reported in the literature for regular wrist actigraphy. Conclusions Sleep-staging Fitbit models showed promising performance, especially in differentiating wake from sleep. However, although these models are a convenient and economical means for consumers to obtain gross estimates of sleep parameters and time spent in sleep stages, they are of limited specificity and are not a substitute for PSG.

[1]  Laurel M. Fisher,et al.  A Comparison Study of the Fitbit Activity Monitor and PSG For Assessing Sleep Patterns and Movement in Children , 2016 .

[2]  Wonwoo Byun,et al.  Comparison of Wearable Trackers’ Ability to Estimate Sleep , 2018, International journal of environmental research and public health.

[3]  H. In,et al.  Comparison of Wearable Activity Tracker with Actigraphy for Sleep Evaluation and Circadian Rest-Activity Rhythm Measurement in Healthy Young Adults , 2017, Psychiatry investigation.

[4]  Michael W. Beets,et al.  Comparing measures of free-living sleep in school-aged children. , 2019, Sleep medicine.

[5]  D. Hurley,et al.  Objective measurements of sleep for non‐laboratory settings as alternatives to polysomnography – a systematic review , 2011, Journal of sleep research.

[6]  Job G. Godino,et al.  Measures of sleep and cardiac functioning during sleep using a multi-sensory commercially-available wristband in adolescents , 2016, Physiology & Behavior.

[7]  Z Beattie,et al.  Estimation of sleep stages in a healthy adult population from optical plethysmography and accelerometer signals , 2017, Physiological measurement.

[8]  Gail M. Sullivan,et al.  Using Effect Size-or Why the P Value Is Not Enough. , 2012, Journal of graduate medical education.

[9]  D. Moher,et al.  Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement , 2009, BMJ : British Medical Journal.

[10]  K. Diller,et al.  Accuracy of PurePulse photoplethysmography technology of Fitbit Charge 2 for assessment of heart rate during sleep , 2019, Chronobiology international.

[11]  H. Montgomery-Downs,et al.  Movement toward a novel activity monitoring device , 2012, Sleep and Breathing.

[12]  A. Goldstone,et al.  A validation study of Fitbit Charge 2™ compared with polysomnography in adults , 2018, Chronobiology international.

[13]  Rebecca M. C. Spencer,et al.  Reliability of Sleep Measures from Four Personal Health Monitoring Devices Compared to Research-Based Actigraphy and Polysomnography , 2016, Sensors.

[14]  D. Altman,et al.  Measuring inconsistency in meta-analyses , 2003, BMJ : British Medical Journal.

[15]  D. Plante,et al.  Ability of the Fitbit Alta HR to quantify and classify sleep in patients with suspected central disorders of hypersomnolence: A comparison against polysomnography , 2018, Journal of sleep research.

[16]  J. Solet,et al.  Measuring sleep: accuracy, sensitivity, and specificity of wrist actigraphy compared to polysomnography. , 2013, Sleep.

[17]  K. Tsui,et al.  Preliminary Agreement on Tracking Sleep Between a Wrist-Worn Device Fitbit Alta and Consensus Sleep Diary. , 2019, Telemedicine journal and e-health : the official journal of the American Telemedicine Association.

[18]  Gahan Fallone,et al.  Evidence for the validity of a sleep habits survey for adolescents. , 2003, Sleep.

[19]  Kenneth R Diller,et al.  Performance comparison of different interpretative algorithms utilized to derive sleep parameters from wrist actigraphy data , 2019, Chronobiology international.

[20]  D. Dickinson,et al.  A practical validation study of a commercial accelerometer using good and poor sleepers , 2016, Health psychology open.

[21]  Daniel J Buysse,et al.  Field-based Measurement of Sleep: Agreement between Six Commercial Activity Monitors and a Validated Accelerometer , 2020, Behavioral sleep medicine.

[22]  A. Chesson,et al.  Practice parameters for the use of actigraphy in the assessment of sleep and sleep disorders: an update for 2007. , 2007, Sleep.

[23]  S. Drummond,et al.  Pilot Validation of Ambulatory Activity Monitors for Sleep Measurement in Huntington's Disease Gene Carriers. , 2017, Journal of Huntington's disease.

[24]  S. Tokuno,et al.  A validation study of a consumer wearable sleep tracker compared to a portable EEG system in naturalistic conditions. , 2019, Journal of psychosomatic research.

[25]  Michael L. Prairie,et al.  Utility of the Fitbit Flex to evaluate sleep in major depressive disorder: A comparison against polysomnography and wrist-worn actigraphy. , 2017, Journal of affective disorders.

[26]  S. Donegan,et al.  Clinical Epidemiology and Global Health , 2022 .

[27]  Sara Mariani,et al.  Validity of a commercial wearable sleep tracker in adult insomnia disorder patients and good sleepers. , 2017, Journal of psychosomatic research.

[28]  Zilu Liang,et al.  Validity of Consumer Activity Wristbands and Wearable EEG for Measuring Overall Sleep Parameters and Sleep Structure in Free-Living Conditions , 2018, Journal of Healthcare Informatics Research.

[29]  N. Black,et al.  The feasibility of creating a checklist for the assessment of the methodological quality both of randomised and non-randomised studies of health care interventions. , 1998, Journal of epidemiology and community health.

[30]  Walter R. Thompson,et al.  WORLDWIDE SURVEY OF FITNESS TRENDS FOR 2019 , 2018, ACSM'S Health & Fitness Journal.

[31]  Jelle J Goeman,et al.  Plea for routinely presenting prediction intervals in meta-analysis , 2016, BMJ Open.

[32]  P. Igarashi,et al.  HNF1B as a New Transcriptional Activator of the Renal KIR4.1/KIR5.1 Potassium Channel , 2017, The FASEB Journal.

[33]  L. Hedges,et al.  Introduction to Meta‐Analysis , 2009, International Coaching Psychology Review.

[34]  A. Sadeh The role and validity of actigraphy in sleep medicine: an update. , 2011, Sleep medicine reviews.

[35]  Hyun-Sung An,et al.  Concurrent Validity of Wearable Activity Trackers Under Free-Living Conditions , 2017, Journal of strength and conditioning research.

[36]  Billy Sperlich,et al.  Wearable, yes, but able…?: it is time for evidence-based marketing claims! , 2016, British Journal of Sports Medicine.

[37]  C. Sargent,et al.  How well does a commercially available wearable device measure sleep in young athletes? , 2018, Chronobiology international.

[38]  K. Avis,et al.  Comparison of a Commercial Accelerometer with Polysomnography and Actigraphy in Children and Adolescents. , 2015, Sleep.