Optimal Allocation of Water and Sanitation Facilities To Prevent Communicable Diarrheal Diseases in Senegal Under Partial Interference

For several decades, Senegal has faced inadequate water, sanitation, and hygiene (WASH) facilities in households, contributing to persistent, high levels of communicable diarrheal diseases. Unfortunately, the ideal WASH policy where every household in Senegal installs WASH facilities is impossible due to logistical and budgetary concerns. This work proposes to estimate an optimal allocation rule of WASH facilities in Senegal by combining recent advances in personalized medicine and partial interference in causal inference. Our allocation rule helps public health officials in Senegal decide what fraction of total households in a region should get WASH facilities based on block-level and household-level characteristics. We characterize the excess risk of the allocation rule and show that our rule outperforms other allocation policies in Senegal.

[1]  Mohammad Ali,et al.  Doubly robust estimation in observational studies with partial interference , 2018, Stat.

[2]  Eric B. Laber,et al.  A Robust Method for Estimating Optimal Treatment Regimes , 2012, Biometrics.

[3]  William Easterly The cartel of good intentions: The problem of bureaucracy in foreign aid , 2002 .

[4]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[5]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[6]  Donglin Zeng,et al.  Estimating Individualized Treatment Rules Using Outcome Weighted Learning , 2012, Journal of the American Statistical Association.

[7]  Mark J. van der Laan,et al.  Super Learner In Prediction , 2010 .

[8]  R. Tibshirani,et al.  Generalized Additive Models , 1986 .

[9]  T. Speed,et al.  On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 , 1990 .

[10]  G. Imbens,et al.  The Propensity Score with Continuous Treatments , 2005 .

[11]  S. Esrey,et al.  Interventions for the control of diarrhoeal diseases among young children: improving water supplies and excreta disposal facilities. , 1985, Bulletin of the World Health Organization.

[12]  C. McMichael Water, Sanitation and Hygiene (WASH) in Schools in Low-Income Countries: A Review of Evidence of Impact , 2019, International journal of environmental research and public health.

[13]  Oleg Sofrygin,et al.  Semi-Parametric Estimation and Inference for the Mean Outcome of the Single Time-Point Intervention in a Causally Connected Population , 2016, Journal of causal inference.

[14]  José Manuel Benítez,et al.  Neural Networks in R Using the Stuttgart Neural Network Simulator: RSNNS , 2012 .

[15]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[16]  C. Yilgwan,et al.  Prevalence of diarrhea disease and risk factors in Jos University Teaching Hospital, Nigeria. , 2012, Annals of African medicine.

[17]  S. Beyene,et al.  Prevalence of Diarrhea and Associated Factors among Under Five Years Children in Harena Buluk Woreda Oromia Region, South East Ethiopia, 2018 , 2018, Journal of Public Health International.

[18]  G. Wahba,et al.  A Correspondence Between Bayesian Estimation on Stochastic Processes and Smoothing by Splines , 1970 .

[19]  Bernhard Schölkopf,et al.  A Generalized Representer Theorem , 2001, COLT/EuroCOLT.

[20]  Eric B. Laber,et al.  Interactive model building for Q-learning. , 2014, Biometrika.

[21]  Naoki Egami Spillover Effects in the Presence of Unobserved Networks , 2017, Political Analysis.

[22]  A. Call Delivering Rural Sanitation Programs at Scale, with Equity and Sustainability , 2019 .

[23]  Peter M. Aronow,et al.  Estimating Average Causal Effects Under Interference Between Units , 2013, 1305.6156.

[24]  Susan Murphy,et al.  Inference for non-regular parameters in optimal dynamic treatment regimes , 2010, Statistical methods in medical research.

[25]  M. J. van der Laan,et al.  Statistical Applications in Genetics and Molecular Biology Super Learner , 2010 .

[26]  Prevalence of diarrhoea and associated risk factors among children under five years old in Pader District, northern Uganda , 2020, BMC infectious diseases.

[27]  Davide Viviano,et al.  Policy Targeting under Network Interference , 2019, Review of Economic Studies.

[28]  M. Elizabeth Halloran,et al.  Analysis of Vaccine Studies and Causal Inference , 2019, Handbook of Infectious Disease Data Analysis.

[29]  Tyler J VanderWeele,et al.  On causal inference in the presence of interference , 2012, Statistical methods in medical research.

[30]  Myron Anthony Godinho,et al.  Global, regional, and national progress towards Sustainable Development Goal 3.2 for neonatal and child health: all-cause and cause-specific mortality findings from the Global Burden of Disease Study 2019. , 2021, Lancet.

[31]  T. VanderWeele,et al.  Effect partitioning under interference in two-stage randomized vaccine trials. , 2011, Statistics & probability letters.

[32]  L. Huicho,et al.  Drivers of the reduction in childhood diarrhea mortality 1980-2015 and interventions to eliminate preventable diarrhea deaths by 2030 , 2019, Journal of global health.

[33]  S. Murphy,et al.  Optimal dynamic treatment regimes , 2003 .

[34]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[35]  R. Feachem,et al.  A case-control study of the impact of improved sanitation on diarrhoea morbidity in Lesotho. , 1990, Bulletin of the World Health Organization.

[36]  Elizabeth L. Ogburn,et al.  Causal Inference for Social Network Data , 2017, Journal of the American Statistical Association.

[37]  G. V. Houngbonon,et al.  Access to electricity and digital inclusion: evidence from mobile call detail records , 2021, Humanities and Social Sciences Communications.

[38]  M. Hudgens,et al.  On inverse probability-weighted estimators in the presence of interference , 2016, Biometrika.

[39]  Tamer Rabie,et al.  Interventions to improve water quality for preventing diarrhoea: systematic review and meta-analysis , 2007, BMJ : British Medical Journal.

[40]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[41]  Alfonso Flores-Lagunes,et al.  Estimating the Effects of Length of Exposure to Instruction in a Training Program: The Case of Job Corps , 2012, Review of Economics and Statistics.

[42]  James M. Robins,et al.  Optimal Structural Nested Models for Optimal Sequential Decisions , 2004 .

[43]  Ingo Steinwart,et al.  Optimal regression rates for SVMs using Gaussian kernels , 2013 .

[44]  Bibhas Chakraborty,et al.  Q‐learning for estimating optimal dynamic treatment rules from observational data , 2012, The Canadian journal of statistics = Revue canadienne de statistique.

[45]  A. Hubbard,et al.  A Randomized Controlled Trial to Measure Spillover Effects of a Combined Water, Sanitation, and Handwashing Intervention in Rural Bangladesh , 2018, American journal of epidemiology.

[46]  Mark J. van der Laan,et al.  Causal Inference for a Population of Causally Connected Units , 2014, Journal of causal inference.

[47]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[48]  G. Gulis,et al.  Examining school-based hygiene facilities: a quantitative assessment in a Ghanaian municipality , 2018, BMC Public Health.

[49]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[50]  Michael E. Sobel,et al.  What Do Randomized Studies of Housing Mobility Demonstrate? , 2006 .

[51]  M. Kosorok,et al.  Adaptive Treatment Strategies in Practice: Planning Trials and Analyzing Data for Personalized Medicine , 2015 .

[52]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[53]  M R Kosorok,et al.  Penalized Q-Learning for Dynamic Treatment Regimens. , 2011, Statistica Sinica.

[54]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[55]  Erica E. M. Moodie,et al.  Doubly Robust Estimation of Optimal Dosing Strategies , 2020 .

[56]  I. Díaz,et al.  Targeted learning ensembles for optimal individualized treatment rules with time-to-event outcomes. , 2017, Biometrika.

[57]  J. Robins,et al.  Double/Debiased Machine Learning for Treatment and Structural Parameters , 2017 .

[58]  Nema Dean,et al.  Q-Learning: Flexible Learning About Useful Utilities , 2013, Statistics in Biosciences.

[59]  I. Okeke Diarrheagenic Escherichia coli in sub-Saharan Africa: status, uncertainties and necessities. , 2010, Journal of infection in developing countries.

[60]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction , 2016 .

[61]  J. Bartram,et al.  On-plot drinking water supplies and health: A systematic review. , 2016, International journal of hygiene and environmental health.

[62]  M. Hudgens,et al.  Toward Causal Inference With Interference , 2008, Journal of the American Statistical Association.

[63]  Andreas Ziegler,et al.  ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R , 2015, 1508.04409.

[64]  Mirko S. Winkler,et al.  Prevalence of diarrhoea and risk factors among children under five years old in Mbour, Senegal: a cross-sectional study , 2017, Infectious Diseases of Poverty.

[65]  Edoardo M. Airoldi,et al.  Identification and Estimation of Treatment and Interference Effects in Observational Studies on Networks , 2016, Journal of the American Statistical Association.

[66]  J. Critchley,et al.  Hand washing promotion for preventing diarrhoea , 2015, The Cochrane database of systematic reviews.

[67]  Falco J. Bargagli-Stoffi,et al.  Heterogeneous Treatment and Spillover Effects Under Clustered Network Interference , 2020, 2008.00707.

[68]  Michael G. Hudgens,et al.  Large Sample Randomization Inference of Causal Effects in the Presence of Interference , 2014, Journal of the American Statistical Association.

[69]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[70]  J. Freidman,et al.  Multivariate adaptive regression splines , 1991 .

[71]  M. Kosorok,et al.  Reinforcement learning design for cancer clinical trials , 2009, Statistics in medicine.

[72]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[73]  Le Thi Hoai An,et al.  Solving a Class of Linearly Constrained Indefinite Quadratic Problems by D.C. Algorithms , 1997, J. Glob. Optim..

[74]  B. Chakraborty,et al.  Statistical Methods for Dynamic Treatment Regimes: Reinforcement Learning, Causal Inference, and Personalized Medicine , 2013 .

[75]  Wenbin Lu,et al.  Modelling and estimation for optimal treatment decision with interference , 2019, Stat.

[76]  S. Murphy,et al.  PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES. , 2011, Annals of statistics.

[77]  Donglin Zeng,et al.  Personalized Dose Finding Using Outcome Weighted Learning , 2016, Journal of the American Statistical Association.

[78]  Min Zhang,et al.  Estimating optimal treatment regimes from a classification perspective , 2012, Stat.

[79]  Eric B. Laber,et al.  Tree-based methods for individualized treatment regimes. , 2015, Biometrika.

[80]  I NICOLETTI,et al.  The Planning of Experiments , 1936, Rivista di clinica pediatrica.

[81]  D. Rubin [On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.] Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies , 1990 .

[82]  Avi Feller,et al.  Analyzing Two-Stage Experiments in the Presence of Interference , 2016, 1608.06805.