A Bayesian Analysis of Body Mass Index Data From Small Domains Under Nonignorable Nonresponse and Selection

Here we analyze body mass index (BMI) data for children and adolescents from the Third National Health and Nutrition Examination Survey (NHANES III). Because of the lack of BMI values for a considerable number of the children and adolescents, and the differential probabilities of selection of these individuals, serious nonresponse and selection bias in inference can be present. To analyze the NHANES III BMI data, a nonignorable nonresponse model has been proposed to estimate the finite population means of small domains formed by crossing age, race and sex within counties. In this approach, the log-BMI values are used to obtain more normally distributed data, and the model includes a spline regression of log-BMI on age, adjusted for race, sex, and the interaction of race and sex. In this work, to assess the status of overweight and obesity in children and adolescents, our new model predicts the more informative finite population percentiles of BMI for these small domains, incorporating additional measures to minimize possible biases. These measures are the most appropriate transformation for the skewed BMI data, incorporation of an intraclass correlation within the households, and inclusion of selection probabilities into the nonignorable nonresponse model to reflect the higher probabilities of selection among black non-Hispanics and Hispanic-Americans. We also consider robustness and sensitivity to the assumptions of the nonignorable nonresponse model by fitting several versions of our proposed model, as well as a very different ignorable nonresponse model that uses a mixture of Student t densities, selection probabilities, and BMI values. It is noteworthy that in the likelihood-based inference literature, we have seen no work by others that includes both nonignorable nonresponse and nonignorable selection. Based on NHANES III data, we show that there are differences in the 85th or 95th percentile for overweight by county, race, and particularly age, and a small difference in sex.

[1]  F. Eugene Yates,et al.  The national center for health statistics , 1973, Annals of Biomedical Engineering.

[2]  G. Cohen,et al.  Are nonrespondents to health surveys less healthy than respondents , 2002 .

[3]  R. Horwitz The planning of observational studies of human populations , 1979 .

[4]  Pla Gw Oral health and nutrition. , 1994 .

[5]  A. Astrup,et al.  Obesity : Preventing and managing the global epidemic , 2000 .

[6]  A. Fagot-Campagna Emergence of Type 2 Diabetes Mellitus in Children: Epidemiological Evidence , 2000, Journal of pediatric endocrinology & metabolism : JPEM.

[7]  Elizabeth A. Stasny,et al.  Hierarchical Models for the Probabilities of a Survey Classification and Nonresponse: An Example from the National Crime Survey , 1991 .

[8]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[9]  W. Davis,et al.  Model-based small area estimates of overweight prevalence using sample selection adjustment. , 1999, Statistics in medicine.

[10]  Shumei S. Guo,et al.  2000 CDC Growth Charts for the United States: methods and development. , 2002, Vital and health statistics. Series 11, Data from the National Health Survey.

[11]  C. R. Rao,et al.  Weighted distributions and size-biased sampling with applications to wildlife populations and human families , 1978 .

[12]  W. Dietz,et al.  Guidelines for overweight in adolescent preventive services: recommendations from an expert committee. The Expert Committee on Clinical Guidelines for Overweight in Adolescent Preventive Services. , 1994, The American journal of clinical nutrition.

[13]  E. Olusegun George,et al.  A remark on the shape of the logistic distribution , 1978 .

[14]  Balgobin Nandram,et al.  Hierarchical Bayesian Nonresponse Models for Binary Data From Small Areas With Uncertainty About Ignorability , 2002 .

[15]  J. Copas,et al.  Inference for Non‐random Samples , 1997 .

[16]  T. Hettmansperger,et al.  Robust Nonparametric Statistical Methods , 1998 .

[17]  Katherine M Flegal,et al.  Prevalence and trends in overweight among US children and adolescents, 1999-2000. , 2002, JAMA.

[18]  Mitchell H Gail,et al.  Excess Deaths Associated With Underweight, Overweight, and Obesity: An Evaluation of Potential Bias. , 2005, Vital & health statistics. Series 3, Analytical and epidemiological studies.

[19]  T J Cole,et al.  Growth charts for both cross-sectional and longitudinal data. , 1994, Statistics in medicine.

[20]  Danny Pfeffermann,et al.  Prediction of finite population totals based on the sample distribution , 2004 .

[21]  Roderick J. A. Little,et al.  The NHANES III multiple imputation project , 1996 .

[22]  B. Rosner,et al.  Percentiles for body mass index in U.S. children 5 to 17 years of age. , 1998, The Journal of pediatrics.

[23]  M. Gail,et al.  Cause-specific excess deaths associated with underweight, overweight, and obesity. , 2007, JAMA.

[24]  Plan and operation of the Third National Health and Nutrition Examination Survey, 1988-94. Series 1: programs and collection procedures. , 1994, Vital and health statistics. Ser. 1, Programs and collection procedures.

[25]  L. Fahrmeir,et al.  Alternative regression models to assess increase in childhood BMI , 2008, BMC medical research methodology.

[26]  W. Dietz Health consequences of obesity in youth: childhood predictors of adult disease. , 1998, Pediatrics.

[27]  T. Cole The LMS method for constructing normalized growth standards. , 1990, European journal of clinical nutrition.

[28]  J. Heckman Sample selection bias as a specification error , 1979 .

[29]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[30]  Danny Pfeffermann,et al.  Small Area Estimation , 2011, International Encyclopedia of Statistical Science.

[31]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data , 1988 .

[32]  D B Allison,et al.  Body mass index as a measure of adiposity among children and adolescents: a validation study. , 1998, The Journal of pediatrics.

[33]  William S. Reece,et al.  Imputation of Missing Values When the Probability of Response Depends on the Variable Being Imputed , 1982 .

[34]  R. Little,et al.  A note about models for selectivity bias. , 1985 .

[35]  Alan E. Gelfand,et al.  Model choice: A minimum posterior predictive loss approach , 1998, AISTATS.

[36]  Calyampudi Radhakrishna Rao,et al.  Linear Statistical Inference and its Applications , 1967 .

[37]  A. Craft,et al.  Implications of childhood obesity for adult health: findings from thousand families cohort study , 2001, BMJ : British Medical Journal.

[38]  Darrell M. Wilson,et al.  Standardized percentile curves of body-mass index for children and adolescents. , 1991, American journal of diseases of children.

[39]  D. Pfeffermann,et al.  Small-Area Estimation Under Informative Probability Sampling of Areas and Within the Selected Areas , 2007 .

[40]  A. Gelman Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper) , 2004 .

[41]  Shumei S. Guo,et al.  CDC GROWTH CHARTS FOR THE UNITED STATES: METHODS AND DEVELOPMENT 2000 , 2002 .