Systematic review of the validity and reliability of consumer-wearable activity trackers

BackgroundConsumer-wearable activity trackers are electronic devices used for monitoring fitness- and other health-related metrics. The purpose of this systematic review was to summarize the evidence for validity and reliability of popular consumer-wearable activity trackers (Fitbit and Jawbone) and their ability to estimate steps, distance, physical activity, energy expenditure, and sleep.MethodsSearches included only full-length English language studies published in PubMed, Embase, SPORTDiscus, and Google Scholar through July 31, 2015. Two people reviewed and abstracted each included study.ResultsIn total, 22 studies were included in the review (20 on adults, 2 on youth). For laboratory-based studies using step counting or accelerometer steps, the correlation with tracker-assessed steps was high for both Fitbit and Jawbone (Pearson or intraclass correlation coefficients (CC) > =0.80). Only one study assessed distance for the Fitbit, finding an over-estimate at slower speeds and under-estimate at faster speeds. Two field-based studies compared accelerometry-assessed physical activity to the trackers, with one study finding higher correlation (Spearman CC 0.86, Fitbit) while another study found a wide range in correlation (intraclass CC 0.36–0.70, Fitbit and Jawbone). Using several different comparison measures (indirect and direct calorimetry, accelerometry, self-report), energy expenditure was more often under-estimated by either tracker. Total sleep time and sleep efficiency were over-estimated and wake after sleep onset was under-estimated comparing metrics from polysomnography to either tracker using a normal mode setting. No studies of intradevice reliability were found. Interdevice reliability was reported on seven studies using the Fitbit, but none for the Jawbone. Walking- and running-based Fitbit trials indicated consistently high interdevice reliability for steps (Pearson and intraclass CC 0.76–1.00), distance (intraclass CC 0.90–0.99), and energy expenditure (Pearson and intraclass CC 0.71–0.97). When wearing two Fitbits while sleeping, consistency between the devices was high.ConclusionThis systematic review indicated higher validity of steps, few studies on distance and physical activity, and lower validity for energy expenditure and sleep. The evidence reviewed indicated high interdevice reliability for steps, distance, energy expenditure, and sleep for certain Fitbit models. As new activity trackers and features are introduced to the market, documentation of the measurement properties can guide their use in research settings.

[1]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[2]  P S Freedson,et al.  Calibration of the Computer Science and Applications, Inc. accelerometer. , 1998, Medicine and science in sports and exercise.

[3]  Scott E Crouter,et al.  Validity of 10 electronic pedometers for measuring steps, distance, and energy cost. , 2003, Medicine and science in sports and exercise.

[4]  Scott E Crouter,et al.  Spring-levered versus piezo-electric pedometer accuracy in overweight and obese adults. , 2005, Medicine and science in sports and exercise.

[5]  Patricia A Higgins,et al.  Understanding the error of our ways: mapping the concepts of validity and reliability. , 2006, Nursing outlook.

[6]  J. Ioannidis,et al.  The PRISMA Statement for Reporting Systematic Reviews and Meta-Analyses of Studies That Evaluate Health Care Interventions: Explanation and Elaboration , 2009, Annals of Internal Medicine [serial online].

[7]  W. van Mechelen,et al.  Validity and reproducibility of motion sensors in youth: a systematic update. , 2009, Medicine and science in sports and exercise.

[8]  D. Moher,et al.  Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. , 2010, International journal of surgery.

[9]  S. Michie,et al.  A refined taxonomy of behaviour change techniques to help people change their physical activity and healthy eating behaviours: The CALO-RE taxonomy , 2011, Psychology & health.

[10]  H. Montgomery-Downs,et al.  Movement toward a novel activity monitoring device , 2012, Sleep and Breathing.

[11]  Dinesh John,et al.  Validation and comparison of ActiGraph activity monitors. , 2011, Journal of science and medicine in sport.

[12]  Gregory J Welk,et al.  Protocols for evaluating equivalency of accelerometry-based activity monitors. , 2012, Medicine and science in sports and exercise.

[13]  Dinesh John,et al.  ActiGraph and Actical physical activity monitors: a peek under the hood. , 2012, Medicine and science in sports and exercise.

[14]  Guy Faulkner,et al.  Is this Bit Fit? Measuring the Quality of the Fitbit Step-Counter , 2012 .

[15]  H. Montgomery-Downs,et al.  Use of actigraphy for assessment in pediatric sleep research. , 2012, Sleep medicine reviews.

[16]  Konrad Tollmar,et al.  Health Mashups: Presenting Statistical Patterns between Wellbeing Data and Context in Natural Language to Promote Behavior Change , 2013, TCHI.

[17]  John P. Porcari,et al.  The accuracy of various activity trackers in estimating steps taken and energy expenditure , 2013 .

[18]  Yvonne Rogers,et al.  Introduction to the special issue on the theory and practice of embodied interaction in HCI and interaction design , 2013, TCHI.

[19]  Adolfo Muñoz,et al.  The Usefulness of Activity Trackers in Elderly with Reduced Mobility: A Case Study , 2013, MedInfo.

[20]  Raymond C Browning,et al.  A comparison of energy expenditure estimation of several physical activity monitors. , 2013, Medicine and science in sports and exercise.

[21]  César A. Hidalgo,et al.  Unique in the Crowd: The privacy bounds of human mobility , 2013, Scientific Reports.

[22]  Jesse Dallery,et al.  Internet-based contingency management increases walking in sedentary adults. , 2013, Journal of applied behavior analysis.

[23]  George D Fulk,et al.  Accuracy of 2 Activity Monitors in Detecting Steps in People With Stroke and Traumatic Brain Injury , 2013, Physical Therapy.

[24]  Shaw Bronner,et al.  Comparison of steps and energy expenditure assessment in adults of Fitbit Tracker and Ultra to the Actical and indirect calorimetry , 2013, Journal of medical engineering & technology.

[25]  Gabriel A. Koepp,et al.  "Go4Life" exercise counseling, accelerometer feedback, and activity levels in older people. , 2014, Archives of gerontology and geriatrics.

[26]  Misha Pavel,et al.  Comparative assessment of sleep quality estimates using home monitoring technology , 2014, 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[27]  Gregory J Welk,et al.  Validity of consumer-based physical activity monitors. , 2014, Medicine and science in sports and exercise.

[28]  Donald R. Dengel,et al.  Comparison of FitBit® Ultra to ActiGraph™ GT1M for Assessment of Physical Activity in Young Adults During Treadmill Walking , 2014 .

[29]  Mark V Albert,et al.  Monitoring daily function in persons with transfemoral amputations using a commercial activity monitor: a feasibility study. , 2014, PM & R : the journal of injury, function, and rehabilitation.

[30]  Dinesh John,et al.  ActiGraph™ activity monitors: "the firmware effect". , 2014, Medicine and science in sports and exercise.

[31]  Elizabeth J Lyons,et al.  Behavior Change Techniques Implemented in Electronic Lifestyle Activity Monitors: A Systematic Content Analysis , 2014, Journal of medical Internet research.

[32]  P. Siemonsma,et al.  Validity and Usability of Low-Cost Accelerometers for Internet-Based Self-Monitoring of Physical Activity in Patients With Chronic Obstructive Pulmonary Disease , 2014, Interactive journal of medical research.

[33]  J. Takács,et al.  Validation of the Fitbit One activity monitor device during treadmill walking. , 2014, Journal of science and medicine in sport.

[34]  S. Insana,et al.  Caloric expenditure assessment among older adults: Criterion validity of a novel accelerometry device , 2014, Journal of health psychology.

[35]  Mark A Tully,et al.  The validation of Fibit Zip™ physical activity monitor as a measure of free-living physical activity , 2014, BMC Research Notes.

[36]  K. Banna,et al.  Preliminary efficacy of prize-based contingency management to increase activity levels in healthy adults. , 2014, Journal of applied behavior analysis.

[37]  K. Volpp,et al.  Accuracy of smartphone applications and wearable devices for tracking physical activity data. , 2015, JAMA.

[38]  Remko R. Lengton,et al.  Measuring steps with the Fitbit activity tracker: an inter-device reliability study , 2015, Journal of medical engineering & technology.

[39]  Julie B. Wang,et al.  Wearable Sensor/Device (Fitbit One) and SMS Text-Messaging Prompts to Increase Physical Activity in Overweight and Obese Adults: A Randomized Controlled Trial. , 2015, Telemedicine journal and e-health : the official journal of the American Telemedicine Association.

[40]  Janice J Eng,et al.  Capturing step counts at slow walking speeds in older adults: comparison of ankle and waist placement of measuring device. , 2015, Journal of rehabilitation medicine.

[41]  Mary Beth Rosson,et al.  Use and Adoption Challenges of Wearable Activity Trackers , 2015 .

[42]  Tim Olds,et al.  The validity of consumer-level, activity monitors in healthy adults worn in free-living conditions: a cross-sectional study , 2015, International Journal of Behavioral Nutrition and Physical Activity.

[43]  L. Cadmus-Bertram,et al.  Randomized Trial of a Fitbit-Based Physical Activity Intervention for Women. , 2015, American journal of preventive medicine.

[44]  J. Naslund,et al.  Feasibility of popular m-health technologies for activity tracking among individuals with serious mental illness. , 2015, Telemedicine journal and e-health : the official journal of the American Telemedicine Association.

[45]  K. Patrick,et al.  Technologies to measure and modify physical activity and eating environments. , 2015, American journal of preventive medicine.

[46]  C. Mazzà,et al.  Step Detection and Activity Recognition Accuracy of Seven Physical Activity Monitors , 2015, PloS one.

[47]  J. Schwartz,et al.  Abstract MP11: Fitbit: An Accurate and Reliable Device for Wireless Physical Activity Tracking , 2015 .

[48]  Ian M Colrain,et al.  Validation of Sleep-Tracking Technology Compared with Polysomnography in Adolescents. , 2015, Sleep.

[49]  Michael Popke Weathering the Weather , 2015 .

[50]  C. V. Van Camp,et al.  Increasing physical activity of children during school recess. , 2015, Journal of applied behavior analysis.

[51]  Dinesh John,et al.  Validation of the Fitbit wireless activity tracker for prediction of energy expenditure. , 2015, Journal of physical activity & health.

[52]  L. Phillips,et al.  A Comparison of Accelerometer Accuracy in Older Adults. , 2015, Research in gerontological nursing.

[53]  Fernando Martín-Sánchez,et al.  The use of self-quantification systems for personal health information: big data management activities and prospects , 2015, Health Information Science and Systems.

[54]  Massimiliano de Zambotti,et al.  Evaluation of a consumer fitness-tracking device to assess sleep in adults , 2015, Chronobiology international.

[55]  K. Avis,et al.  Comparison of a Commercial Accelerometer with Polysomnography and Actigraphy in Children and Adolescents. , 2015, Sleep.

[56]  Gregory J Welk,et al.  Comparison of Consumer and Research Monitors under Semistructured Settings. , 2016, Medicine and science in sports and exercise.