Imputation of missing data when measuring physical activity by accelerometry.

PURPOSE We consider the issue of summarizing accelerometer activity count data accumulated over multiple days when the time interval in which the monitor is worn is not uniform for every subject on every day. The fact that counts are not being recorded during periods in which the monitor is not worn means that many common estimators of daily physical activity are biased downward. METHODS Data from the Trial for Activity in Adolescent Girls (TAAG), a multicenter group-randomized trial to reduce the decline in physical activity among middle-school girls, were used to illustrate the problem of bias in estimation of physical activity due to missing accelerometer data. The effectiveness of two imputation procedures to reduce bias was investigated in a simulation experiment. Count data for an entire day, or a segment of the day were deleted at random or in an informative way with higher probability of missingness at upper levels of body mass index (BMI) and lower levels of physical activity. RESULTS When data were deleted at random, estimates of activity computed from the observed data and those based on a data set in which the missing data have been imputed were equally unbiased; however, imputation estimates were more precise. When the data were deleted in a systematic fashion, the bias in estimated activity was lower using imputation procedures. Both imputation techniques, single imputation using the EM algorithm and multiple imputation (MI), performed similarly, with no significant differences in bias or precision. CONCLUSIONS Researchers are encouraged to take advantage of software to implement missing value imputation, as estimates of activity are more precise and less biased in the presence of intermittent missing accelerometer data than those derived from an observed data analysis approach.

[1]  Diane J Catellier,et al.  Design of the Trial of Activity in Adolescent Girls (TAAG). , 2005, Contemporary clinical trials.

[2]  L. Epstein,et al.  Determinants of physical activity in obese children assessed by accelerometer and self-report. , 1996, Medicine and science in sports and exercise.

[3]  Maurice R Puyau,et al.  A longitudinal study of fitness and activity in girls predisposed to obesity. , 2004, Medicine and science in sports and exercise.

[4]  H. Selvadurai,et al.  Randomized controlled study of in‐hospital exercise training programs in children with cystic fibrosis , 2002, Pediatric pulmonology.

[5]  Joseph L Schafer,et al.  Analysis of Incomplete Multivariate Data , 1997 .

[6]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[7]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[8]  Graham Kalton,et al.  Handling Item Nonresponse in the U.S. Component of the IEA Reading Literacy Study , 2001 .

[9]  D. A. Conway,et al.  Stable Factors in Security Returns: Identification Using Cross-Validation , 1988 .

[10]  P. Freedson,et al.  Using objective physical activity measures with youth: how many days of monitoring are needed? , 2000, Medicine and science in sports and exercise.

[11]  Allan Donner,et al.  The Relative Effectiveness of Procedures Commonly Used in Multiple Regression Analysis for Dealing with Missing Values , 1982 .

[12]  Angie S Page,et al.  Commuting to school: are children who walk more physically active? , 2003, American journal of preventive medicine.

[13]  Klaas R. Westerterp,et al.  Effect of exercise training on total daily physical activity in elderly humans , 1999, European Journal of Applied Physiology and Occupational Physiology.

[14]  C. L. Kien,et al.  Physical activity in middle school-aged children participating in a school-based recreation program. , 2003, Archives of pediatrics & adolescent medicine.

[15]  Naresh K. Malhotra,et al.  Analyzing Marketing Research Data with Incomplete Information on the Dependent Variable , 1987 .

[16]  D. Jacobs,et al.  An after-school obesity prevention program for African-American girls: the Minnesota GEMS pilot study. , 2003, Ethnicity & disease.

[17]  R. Little Missing-Data Adjustments in Large Surveys , 1988 .

[18]  S G Trost,et al.  Determinants of physical activity in active and low-active, sixth grade African-American youth. , 1999, The Journal of school health.

[19]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[20]  S. Going,et al.  Defining accelerometer thresholds for activity intensities in adolescent girls. , 2004, Medicine and science in sports and exercise.

[21]  G Gmel,et al.  Imputation of missing values in the case of a multiple item instrument measuring alcohol consumption , 2001, Statistics in medicine.

[22]  Elaine Stone,et al.  The effects of the Pathways Obesity Prevention Program on physical activity in American Indian children. , 2003, Preventive medicine.