CRITICAL WINDOW VARIABLE SELECTION FOR MIXTURES: ESTIMATING THE IMPACT OF MULTIPLE AIR POLLUTANTS ON STILLBIRTH.

Understanding the role of time-varying pollution mixtures on human health is critical as people are simultaneously exposed to multiple pollutants during their lives. For vulnerable sub-populations who have well-defined exposure periods (e.g., pregnant women), questions regarding critical windows of exposure to these mixtures are important for mitigating harm. We extend Critical Window Variable Selection (CWVS) to the multipollutant setting by introducing CWVS for Mixtures (CWVSmix), a hierarchical Bayesian method that combines smoothed variable selection and temporally correlated weight parameters to (i) identify critical windows of exposure to mixtures of time-varying pollutants, (ii) estimate the time-varying relative importance of each individual pollutant and their first order interactions within the mixture, and (iii) quantify the impact of the mixtures on health. Through simulation, we show that CWVSmix offers the best balance of performance in each of these categories in comparison to competing methods. Using these approaches, we investigate the impact of exposure to multiple ambient air pollutants on the risk of stillbirth in New Jersey, 2005-2014. We find consistent elevated risk in gestational weeks 2, 16-17, and 20 for non-Hispanic Black mothers, with pollution mixtures dominated by ammonium (weeks 2, 17, 20), nitrate (weeks 2, 17), nitrogen oxides (weeks 2, 16), PM2.5 (week 2), and sulfate (week 20). The method is available in the R package CWVSmix.

[1]  R. Basu,et al.  Association of Air Pollution and Heat Exposure With Preterm Birth, Low Birth Weight, and Stillbirth in the US , 2020, JAMA network open.

[2]  Brent A Coull,et al.  Bayesian distributed lag interaction models to identify perinatal windows of vulnerability in children's health. , 2016, Biostatistics.

[3]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[4]  Federico Ferrari,et al.  Bayesian Factor Analysis for Inference on Interactions , 2019, Journal of the American Statistical Association.

[5]  Montserrat Fuentes,et al.  Spatial‐Temporal Modeling of the Association between Air Pollution Exposure and Preterm Birth: Identifying Critical Windows of Exposure , 2012, Biometrics.

[6]  M. Green Air pollution and health , 1995 .

[7]  Brent A. Coull,et al.  Estimating the health effects of environmental mixtures using Bayesian semiparametric regression and sparsity inducing priors , 2017, The Annals of Applied Statistics.

[8]  Adrian F. M. Smith,et al.  Sampling-Based Approaches to Calculating Marginal Densities , 1990 .

[9]  James G. Scott,et al.  Bayesian Inference for Logistic Models Using Pólya–Gamma Latent Variables , 2012, 1205.0310.

[10]  Kyu Ha Lee,et al.  Lagged kernel machine regression for identifying time windows of susceptibility to exposures of complex mixtures. , 2015, Biostatistics.

[11]  Howard H. Chang,et al.  INTEGRATIVE STATISTICAL METHODS FOR EXPOSURE MIXTURES AND HEALTH. , 2020, The annals of applied statistics.

[12]  R. Horton,et al.  Stillbirths: ending an epidemic of grief , 2016, The Lancet.

[13]  R. Basu,et al.  Association of stillbirth with ambient air pollution in a California cohort study. , 2015, American journal of epidemiology.

[14]  David M Stieb,et al.  Ambient air pollution, birth weight and preterm birth: a systematic review and meta-analysis. , 2012, Environmental research.

[15]  N. Bruce,et al.  Risk of low birth weight and stillbirth associated with indoor air pollution from solid fuel use in developing countries. , 2010, Epidemiologic reviews.

[16]  S. Tong,et al.  Methodological challenges when estimating the effects of season and seasonal exposures on birth outcomes , 2011, BMC medical research methodology.

[17]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Manish Arora,et al.  Modeling the health effects of time‐varying complex environmental mixtures: Mean field variational Bayes for lagged kernel machine regression , 2018, Environmetrics.

[19]  D. Dunson,et al.  IDENTIFYING MAIN EFFECTS AND INTERACTIONS AMONG EXPOSURES USING GAUSSIAN PROCESSES. , 2019, The annals of applied statistics.

[20]  D. Hoyert,et al.  Cause of Fetal Death: Data From the Fetal Death Report, 2014. , 2016, National vital statistics reports : from the Centers for Disease Control and Prevention, National Center for Health Statistics, National Vital Statistics System.

[21]  Montserrat Fuentes,et al.  Bayesian multinomial probit modeling of daily windows of susceptibility for maternal PM2.5 exposure and congenital heart defects , 2016, Statistics in medicine.

[22]  Cunrui Huang,et al.  Ambient air pollution and stillbirth: An updated systematic review and meta-analysis of epidemiological studies. , 2021, Environmental pollution.

[23]  H. Wackernagle,et al.  Multivariate geostatistics: an introduction with applications , 1998 .

[24]  Joshua L Warren,et al.  Assessment of critical exposure and outcome windows in time-to-event analysis with application to air pollution and preterm birth study. , 2015, Biostatistics.

[25]  Daniel F. Schmidt,et al.  High-Dimensional Bayesian Regularised Regression with the BayesReg Package , 2016, 1611.06649.

[26]  Howard H. Chang,et al.  A spatially varying distributed lag model with application to an air pollution and term low birth weight study , 2020, Journal of the Royal Statistical Society. Series C, Applied statistics.

[27]  L. Folinsbee Human health effects of air pollution. , 1993, Environmental health perspectives.

[28]  J. Berger,et al.  Optimal predictive model selection , 2004, math/0406464.

[29]  Ghalib A. Bello,et al.  Extending the Distributed Lag Model framework to handle chemical mixtures , 2017, Environmental research.

[30]  Chris Gennings,et al.  Characterization of Weighted Quantile Sum Regression for Highly Correlated Data in a Risk Analysis Setting , 2014, Journal of Agricultural, Biological, and Environmental Statistics.

[31]  Howard H. Chang,et al.  Application of a Fusion Method for Gas and Particle Air Pollutants between Observational Data and Chemical Transport Model Simulations Over the Contiguous United States for 2005–2014 , 2019, International journal of environmental research and public health.

[32]  Christopher D. Barr,et al.  Protecting Human Health From Air Pollution: Shifting From a Single-pollutant to a Multipollutant Approach , 2010, Epidemiology.

[33]  A. Herring,et al.  Current approaches used in epidemiologic studies to examine short-term multipollutant air pollution exposures. , 2017, Annals of epidemiology.

[34]  T. Hastie,et al.  Learning Interactions via Hierarchical Group-Lasso Regularization , 2015, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[35]  John Geweke,et al.  Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments , 1991 .

[36]  Gavin Pereira,et al.  Investigating the Impact of Maternal Residential Mobility on Identifying Critical Windows of Susceptibility to Ambient Air Pollution During Pregnancy , 2018, American journal of epidemiology.

[37]  Nazeeba Siddika,et al.  Prenatal ambient air pollution exposure and the risk of stillbirth: systematic review and meta-analysis of the empirical evidence , 2016, Occupational and Environmental Medicine.

[38]  Brent A. Coull,et al.  Kernel Machine and Distributed Lag Models for Assessing Windows of Susceptibility to Mixtures of Time-Varying Environmental Exposures in Children's Health Studies , 2019 .

[39]  Ambarina S. Faiz,et al.  Original Contribution Ambient Air Pollution and the Risk of Stillbirth , 2012 .

[40]  Bhramar Mukherjee,et al.  Distributed lag interaction models with two pollutants , 2018, Journal of the Royal Statistical Society. Series C, Applied statistics.

[41]  E. Colicino,et al.  Per- and poly-fluoroalkyl substances and bone mineral density , 2019, Environmental epidemiology.

[42]  Ghalib A. Bello,et al.  Lagged WQS regression for mixtures with many components , 2020, Environmental research.

[43]  Montserrat Fuentes,et al.  Bayesian spatial–temporal model for cardiac congenital anomalies and ambient air pollution risk assessment , 2012, Environmetrics.

[44]  Joshua L Warren,et al.  Critical window variable selection: estimating the impact of air pollution on very preterm birth. , 2019, Biostatistics.

[45]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .