A Bayesian Hierarchical Modeling Framework for Geospatial Analysis of Adverse Pregnancy Outcomes

Studying the determinants of adverse pregnancy outcomes like stillbirth and preterm birth is of considerable interest in epidemiology. Understanding the role of both individual and community risk factors for these outcomes is crucial for planning appropriate clinical and public health interventions. With this goal, we develop geospatial mixed effects logistic regression models for adverse pregnancy outcomes. Our models account for both spatial autocorrelation and heterogeneity between neighborhoods. To mitigate the low incidence of stillbirth and preterm births in our data, we explore using class rebalancing techniques to improve predictive power. To assess the informative value of the covariates in our models, we use posterior distributions of their coefficients to gauge how well they can be distinguished from zero. As a case study, we model stillbirth and preterm birth in the city of Philadelphia, incorporating both patient-level data from electronic health records (EHR) data and publicly available neighborhood data at the census tract level. We find that patient-level features like self-identified race and ethnicity were highly informative for both outcomes. Neighborhood-level factors were also informative, with poverty important for stillbirth and crime important for preterm birth. Finally, we identify the neighborhoods in Philadelphia at highest risk of stillbirth and preterm birth.

[1]  Michael S Kramer,et al.  The epidemiology of adverse pregnancy outcomes: an overview. , 2003, The Journal of nutrition.

[2]  E. Pebesma,et al.  Classes and Methods for Spatial Data , 2015 .

[3]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[4]  J. Oleson,et al.  Bayesian Point Process Modeling to Quantify Geographic Regions of Excess Stillbirth Risk , 2018, Geographical Analysis.

[5]  J. Martin,et al.  Births: Final Data for 2017. , 2018, National vital statistics reports : from the Centers for Disease Control and Prevention, National Center for Health Statistics, National Vital Statistics System.

[6]  W. Callaghan,et al.  Effects of Maternal Age and Age-Specific Preterm Birth Rates on Overall Preterm Birth Rates - United States, 2007 and 2014. , 2016, MMWR. Morbidity and mortality weekly report.

[7]  Jiaquan Xu,et al.  Deaths: Final Data for 2013. , 2016, National vital statistics reports : from the Centers for Disease Control and Prevention, National Center for Health Statistics, National Vital Statistics System.

[8]  G. Dekker,et al.  Risk Factors for Preterm Birth in an International Prospective Cohort of Nulliparous Women , 2012, PloS one.

[9]  David W. S. Wong,et al.  Comparing implementations of global and local indicators of spatial association , 2018, TEST.

[10]  S. Chib,et al.  Bayesian analysis of binary and polychotomous response data , 1993 .

[11]  Z. Henderson,et al.  CDC Grand Rounds: Public Health Strategies to Prevent Preterm Birth. , 2016, MMWR. Morbidity and mortality weekly report.

[12]  Norman E. Breslow,et al.  Estimation of Disease Rates in Small Areas: A new Mixed Model for Spatial Dependence , 2000 .

[13]  C. Meghea,et al.  Electronic Medical Record Use and Maternal and Child Care and Health , 2016, Maternal and Child Health Journal.

[14]  Duncan Lee,et al.  A comparison of conditional autoregressive models used in Bayesian disease mapping. , 2011, Spatial and spatio-temporal epidemiology.

[15]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[16]  Aparna Lhila Does government provision of healthcare explain the relationship between income inequality and low birthweight? , 2009, Social science & medicine.

[17]  Aki Vehtari,et al.  Understanding predictive information criteria for Bayesian models , 2013, Statistics and Computing.

[18]  Sw. Banerjee,et al.  Hierarchical Modeling and Analysis for Spatial Data , 2003 .

[19]  Xavier Robin,et al.  pROC: an open-source package for R and S+ to analyze and compare ROC curves , 2011, BMC Bioinformatics.

[20]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[21]  Montserrat Fuentes,et al.  Spatial‐Temporal Modeling of the Association between Air Pollution Exposure and Preterm Birth: Identifying Critical Windows of Exposure , 2012, Biometrics.

[22]  Dominique Makowski,et al.  Indices of Effect Existence and Significance in the Bayesian Framework , 2019, Front. Psychol..

[23]  I. Greer,et al.  Social inequalities in preterm birth in Scotland 1980–2003: findings from an area‐based measure of deprivation , 2007, BJOG : an international journal of obstetrics and gynaecology.

[24]  W. Grobman,et al.  Associations of neighbourhood crime with adverse pregnancy outcomes among women in Chicago: analysis of electronic health records from 2009 to 2013 , 2018, Journal of Epidemiology & Community Health.

[25]  Michael R Kramer,et al.  Place Matters: Variation in the Black/White Very Preterm Birth Rate across U.S. Metropolitan Areas, 2002–2004 , 2008, Public health reports.

[26]  S. Myers,et al.  The risk of fetal death: current concepts of best gestational age for delivery. , 2013, American journal of obstetrics and gynecology.

[27]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[28]  Shane T. Jensen,et al.  Spatial modeling of trends in crime over time in Philadelphia , 2019, The Annals of Applied Statistics.

[29]  Kwok Leung Tsui,et al.  A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection , 2018, Journal of healthcare engineering.

[30]  Marian F MacDorman,et al.  Fetal and perinatal mortality, United States, 2004. , 2007, National vital statistics reports : from the Centers for Disease Control and Prevention, National Center for Health Statistics, National Vital Statistics System.

[31]  Catherine A. Calder,et al.  Beyond Moran's I: Testing for Spatial Dependence Based on the Spatial Autoregressive Model , 2007 .

[32]  Michael R. Kramer,et al.  Metropolitan isolation segregation and Black-White disparities in very preterm birth: a test of mediating pathways and variance explained. , 2010, Social science & medicine.

[33]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.

[34]  L. D. Levine,et al.  Development and Evaluation of MADDIE: Method to Acquire Delivery Date Information from Electronic Health Records , 2020, medRxiv.

[35]  P. Moran Notes on continuous stochastic phenomena. , 1950, Biometrika.

[36]  M. Boland,et al.  Individual-Level and Neighborhood-Level Risk Factors for Severe Maternal Morbidity , 2021, Obstetrics and gynecology.

[37]  D. Mattison,et al.  Preterm delivery: a public health perspective. , 2001, Paediatric and perinatal epidemiology.

[38]  Hadley Wickham,et al.  ggmap: Spatial Visualization with ggplot2 , 2013, R J..

[39]  Roberto Romero,et al.  Epidemiology and causes of preterm birth , 2008, The Lancet.

[40]  James G. Scott,et al.  BART with targeted smoothing: An analysis of patient-specific stillbirth risk , 2018, The Annals of Applied Statistics.

[41]  Marian F MacDorman,et al.  Fetal and Perinatal Mortality: United States, 2013. , 2015, National vital statistics reports : from the Centers for Disease Control and Prevention, National Center for Health Statistics, National Vital Statistics System.

[42]  James G. Scott,et al.  Bayesian Inference for Logistic Models Using Pólya–Gamma Latent Variables , 2012, 1205.0310.

[43]  Sumio Watanabe,et al.  Asymptotic Equivalence of Bayes Cross Validation and Widely Applicable Information Criterion in Singular Learning Theory , 2010, J. Mach. Learn. Res..

[44]  David E. Jones,et al.  Spatial Analysis of Preterm Birth Demonstrates Opportunities for Targeted Intervention , 2012, Maternal and Child Health Journal.

[45]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .