Estimating serum concentrations of dioxin-like compounds in the U.S. population effective 2005-2006 and 2007-2008: A multiple imputation and trending approach incorporating NHANES pooled sample data.

Dioxin-like compounds (DLCs) are monitored in the U.S. population using data collected with the National Health and Nutrition Examination Survey (NHANES). Until recently, participants' serum samples have been analyzed individually, and summary statistics defining reference ranges by age, gender, and race/ethnicity have served as the background by which other biomonitoring data can be evaluated. In the most recent NHANES DLC data, 2005-2006 and 2007-2008, participants' sera have been physically pooled prior to laboratory analysis, introducing major challenges to their utility as a reference population: variability among individuals and relations with covariates are lost, and individual design effects cannot be applied. Further, the substantial drop in limits of detection (LODs) in pooled sample biennials prevents reliable comparisons to individual data, and has complicated estimates of change over time. In this study, we address the drawbacks introduced by pooled samples by generating U.S. population reference ranges based on individual-level data adjusted to 2005-2006 and 2007-2008 levels. Using publicly available data, multiple imputation (MI) generated four NHANES biennials (2001-2008) of individual DLC data; we then trended the change over time in each DLC by demographic stratum. NHANES 2003-2004 individuals were adjusted by the trended change over time. Population estimates of toxic equivalency (TEQ) concentrations were calculated using traditional MI survey analysis methods and reference tables provided for 2005-2006 and 2007-2008 by age, race, and gender. Demographic differences in TEQ concentrations and trended change are reported, e.g. TEQ continues to drop in young adults aged 20-39, but distributions appear stable in older adults 60+; Mexican Americans have consistently lowest dioxins, furans, and PCBs, with non-Hispanic Blacks dropping to the same levels as non-Hispanic Whites in dioxins and PCBs and significantly below non-Hispanic Whites in furans by 2007-2008. Additionally, the ratio of 95th percentile to mean in DLC distributions was found to vary by age, between dioxins, furans, and PCBs, and across mean, making a simple ratio approach impractical for describing population concentrations using pooled samples. We discuss the practical implications of the pooled sample method, the performance of this trending solution in the context of other methods, and expected effects of distribution assumptions on variability and TEQ estimates, particularly in largely undetected congeners. These updated reference populations of individuals, along with information on trending, provide a common and valid basis for interpreting other individually sampled biomonitoring data.

[1]  Donald B. Rubin,et al.  Nested multiple imputation of NMES via partially incompatible MCMC , 2003 .

[2]  R. Vermeulen,et al.  Serum metabolomic pertubations among workers exposed to 2,3,7,8‐tetrachlorodibenzo‐p‐dioxin (TCDD) , 2013, Environmental and molecular mutagenesis.

[3]  L. Needham,et al.  Serum dioxin levels in residents of Calcasieu and Lafayette parishes, Louisiana with comparison to the US population , 2008, Journal of Exposure Science and Environmental Epidemiology.

[4]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[5]  D. Rubin,et al.  Multiple Imputation for Nonresponse in Surveys , 1989 .

[6]  Han K. Kang,et al.  Health status of Army Chemical Corps Vietnam veterans who sprayed defoliant in Vietnam. , 2006, American journal of industrial medicine.

[7]  D. Rubin,et al.  Fully conditional specification in multivariate imputation , 2006 .

[8]  Jorge Nocedal,et al.  A Limited Memory Algorithm for Bound Constrained Optimization , 1995, SIAM J. Sci. Comput..

[9]  J. Weuve,et al.  Polychlorinated Biphenyl Exposures and Cognition in Older U.S. Adults: NHANES (1999–2002) , 2013, Environmental health perspectives.

[10]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[11]  S. Caudill,et al.  Use of pooled samples from the national health and nutrition examination survey , 2012, Statistics in medicine.

[12]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[13]  C. Tohyama,et al.  The 2005 World Health Organization reevaluation of human and Mammalian toxic equivalency factors for dioxins and dioxin-like compounds. , 2006, Toxicological sciences : an official journal of the Society of Toxicology.

[14]  D. Paustenbach,et al.  Addendum to: Evaluation of PCDD/F and dioxin-like PCB serum concentration data from the 2001–2002 National Health and Nutrition Examination Survey of the United States population , 2007, Journal of Exposure Science and Environmental Epidemiology.

[15]  S. Caudill Confidence interval estimation for pooled-sample biomonitoring from a complex survey design. , 2015, Environment International.

[16]  J. Grzywacz,et al.  A Bayesian multiple imputation method for handling longitudinal pesticide data with values below the limit of detection , 2013, Environmetrics.

[17]  Stef van Buuren,et al.  Flexible Imputation of Missing Data , 2012 .

[18]  A. Calafat,et al.  Polybrominated diphenyl ethers, polychlorinated biphenyls, and persistent pesticides in serum from the national health and nutrition examination survey: 2003-2008. , 2014, Environmental science & technology.

[19]  John Van Hoewyk,et al.  A multivariate technique for multiply imputing missing values using a sequence of regression models , 2001 .

[20]  M. Shima,et al.  Association between blood levels of PCDDs/PCDFs/dioxin-like PCBs and history of allergic and other diseases in the Japanese population , 2013, International Archives of Occupational and Environmental Health.

[21]  D. Naiman,et al.  Perspective on serum dioxin levels in the United States: an evaluation of the NHANES data , 2009, Journal of Exposure Science and Environmental Epidemiology.

[22]  A. Gelman Parameterization and Bayesian Modeling , 2004 .

[23]  M. Pavuk,et al.  Serum concentrations of TCDD and other dioxin-like compounds in US Air Force veterans of Operation Ranch Hand. , 2014, Chemosphere.

[24]  Ken P Kleinman,et al.  Much Ado About Nothing , 2007, The American statistician.

[25]  Jürgen Unützer,et al.  A comparison of imputation methods in a longitudinal randomized clinical trial , 2005, Statistics in medicine.

[26]  M. Kenward,et al.  Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls , 2009, BMJ : British Medical Journal.

[27]  Thomas Lumley,et al.  Analysis of Complex Survey Samples , 2004 .

[28]  Magda Gasull,et al.  Population variation in biomonitoring data for persistent organic pollutants (POPs): an examination of multiple population-based datasets for application to Australian pooled biomonitoring data. , 2014, Environment international.

[29]  M. Lorber A pharmacokinetic model for estimating exposure of Americans to dioxin-like compounds in the past, present, and future. , 2002, The Science of the total environment.