Optimal Allocation Rule for Infectious Disease Prevention Under Partial Interference and Outcome-Oriented Targets With Repeated Cross-Sectional Surveys

When developing allocation policies for infectious disease prevention, policymakers often set outcome-oriented targets and measure the policy’s progress with repeated cross-sectional surveys. For example, the United Nations’ Integrated Global Action Plan for Pneumonia and Diarrhea (GAPPD) recommended increasing water, sanitation, and hygiene (WASH) resources to reduce childhood diarrhea incidence and outlined a specific level of diarrhea incidence to achieve. For policymakers, the goal is to design an allocation rule that optimally allocates WASH resources to meet the outcome targets. The paper develops methods to estimate an optimal allocation rule that achieves these pre-defined outcome-oriented targets using repeated cross-sectional surveys. The estimated allocation rule helps policymakers optimally decide what fraction of units in a region should get the resources based on its characteristics. Critically, our estimated policies account for spillover effects within regions in the form of partial interference, and we characterize the policies’ performance in terms of excess risk. We apply our methods to design Senegal’s WASH policy to prevent diarrheal diseases using the 2014-2017 Demographic and Health Survey. We show that our policy not only outperforms competing policies but also provides new insights about how Senegal should design WASH policies to achieve its outcome targets.

[1]  Menggang Yu,et al.  Policy Learning for Optimal Individualized Dose Intervals , 2022, AISTATS.

[2]  G. V. Houngbonon,et al.  Access to electricity and digital inclusion: evidence from mobile call detail records , 2021, Humanities and Social Sciences Communications.

[3]  Guanyi Wang,et al.  Who should get vaccinated? Individualized allocation of vaccines over SIR network , 2020, Journal of Econometrics.

[4]  Falco J. Bargagli Stoffi,et al.  Heterogeneous Treatment and Spillover Effects Under Clustered Network Interference , 2020, SSRN Electronic Journal.

[5]  Erica E. M. Moodie,et al.  Doubly Robust Estimation of Optimal Dosing Strategies , 2020 .

[6]  Hyunseung Kang,et al.  Efficient Semiparametric Estimation of Network Treatment Effects Under Partial Interference , 2020, 2004.08950.

[7]  Brandon M. Greenwell,et al.  Multivariate Adaptive Regression Splines , 2019, Hands-On Machine Learning with R.

[8]  M. Elizabeth Halloran,et al.  Analysis of Vaccine Studies and Causal Inference , 2019, Handbook of Infectious Disease Data Analysis.

[9]  Robert Opiro,et al.  Prevalence of diarrhoea and associated risk factors among children under five years old in Pader District, northern Uganda , 2019, BMC infectious diseases.

[10]  Davide Viviano,et al.  Policy Targeting under Network Interference , 2019, Review of Economic Studies.

[11]  C. McMichael Water, Sanitation and Hygiene (WASH) in Schools in Low-Income Countries: A Review of Evidence of Impact , 2019, International journal of environmental research and public health.

[12]  Wenbin Lu,et al.  Modelling and estimation for optimal treatment decision with interference , 2019, Stat.

[13]  S. Beyene,et al.  Prevalence of Diarrhea and Associated Factors among Under Five Years Children in Harena Buluk Woreda Oromia Region, South East Ethiopia, 2018 , 2018, Journal of Public Health International.

[14]  Francisco Herrera,et al.  Learning from Imbalanced Data Sets , 2018, Springer International Publishing.

[15]  Mohammad Ali,et al.  Doubly robust estimation in observational studies with partial interference , 2018, Stat.

[16]  A. Hubbard,et al.  A Randomized Controlled Trial to Measure Spillover Effects of a Combined Water, Sanitation, and Handwashing Intervention in Rural Bangladesh , 2018, American journal of epidemiology.

[17]  J. Robins,et al.  Double/Debiased Machine Learning for Treatment and Structural Parameters , 2017 .

[18]  Iv'an D'iaz,et al.  Targeted learning ensembles for optimal individualized treatment rules with time-to-event outcomes. , 2017, Biometrika.

[19]  M. J. van der Laan,et al.  Semi-Parametric Estimation and Inference for the Mean Outcome of the Single Time-Point Intervention in a Causally Connected Population , 2016, Journal of causal inference.

[20]  Donglin Zeng,et al.  Personalized Dose Finding Using Outcome Weighted Learning , 2016, Journal of the American Statistical Association.

[21]  Mark J van der Laan,et al.  Optimal Individualized Treatments in Resource-Limited Settings , 2016, The international journal of biostatistics.

[22]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[23]  Eric B. Laber,et al.  Tree-based methods for individualized treatment regimes. , 2015, Biometrika.

[24]  J. Critchley,et al.  Hand washing promotion for preventing diarrhoea , 2015, The Cochrane database of systematic reviews.

[25]  Andreas Ziegler,et al.  ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R , 2015, 1508.04409.

[26]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[27]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction , 2016 .

[28]  Toru Kitagawa,et al.  Who should be Treated? Empirical Welfare Maximization Methods for Treatment Choice , 2015 .

[29]  Pierre Magal,et al.  Susceptible-infectious-recovered models revisited: from the individual level to the population level. , 2014, Mathematical biosciences.

[30]  Michael G. Hudgens,et al.  Large Sample Randomization Inference of Causal Effects in the Presence of Interference , 2014, Journal of the American Statistical Association.

[31]  Peter M. Aronow,et al.  Estimating Average Causal Effects Under Interference Between Units , 2013, 1305.6156.

[32]  C. Yilgwan,et al.  Prevalence of diarrhea disease and risk factors in Jos University Teaching Hospital, Nigeria. , 2012, Annals of African medicine.

[33]  Donglin Zeng,et al.  Estimating Individualized Treatment Rules Using Outcome Weighted Learning , 2012, Journal of the American Statistical Association.

[34]  Hilmar Schneider,et al.  Evaluating continuous training programmes by using the generalized propensity score , 2012 .

[35]  Tyler J VanderWeele,et al.  On causal inference in the presence of interference , 2012, Statistical methods in medical research.

[36]  José Manuel Benítez,et al.  Neural Networks in R Using the Stuttgart Neural Network Simulator: RSNNS , 2012 .

[37]  Alfonso Flores-Lagunes,et al.  Estimating the Effects of Length of Exposure to Instruction in a Training Program: The Case of Job Corps , 2012, Review of Economics and Statistics.

[38]  T. VanderWeele,et al.  Effect partitioning under interference in two-stage randomized vaccine trials. , 2011, Statistics & probability letters.

[39]  S. Murphy,et al.  PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES. , 2011, Annals of statistics.

[40]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[41]  I. Okeke Diarrheagenic Escherichia coli in sub-Saharan Africa: status, uncertainties and necessities. , 2010, Journal of infection in developing countries.

[42]  K. Hirano,et al.  Asymptotics for Statistical Treatment Rules , 2009 .

[43]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[44]  M. Hudgens,et al.  Toward Causal Inference With Interference , 2008, Journal of the American Statistical Association.

[45]  Tamer Rabie,et al.  Interventions to improve water quality for preventing diarrhoea: systematic review and meta-analysis , 2007, BMJ : British Medical Journal.

[46]  Michael E. Sobel,et al.  What Do Randomized Studies of Housing Mobility Demonstrate? , 2006 .

[47]  Keying Ye,et al.  Applied Bayesian Modeling and Causal Inference From Incomplete-Data Perspectives , 2005, Technometrics.

[48]  G. Imbens,et al.  The Propensity Score with Continuous Treatments , 2005 .

[49]  R. Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[50]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[51]  S. Murphy,et al.  Optimal dynamic treatment regimes , 2003 .

[52]  William Easterly The cartel of good intentions: The problem of bureaucracy in foreign aid , 2002 .

[53]  Duncan J Watts,et al.  A simple model of global cascades on random networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[54]  Bernhard Schölkopf,et al.  A Generalized Representer Theorem , 2001, COLT/EuroCOLT.

[55]  Le Thi Hoai An,et al.  Solving a Class of Linearly Constrained Indefinite Quadratic Problems by D.C. Algorithms , 1997, J. Glob. Optim..

[56]  T. Speed,et al.  On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 , 1990 .

[57]  D. Rubin [On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.] Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies , 1990 .

[58]  Stefan Sperlich,et al.  Generalized Additive Models , 2014 .

[59]  Mark S. Granovetter Threshold Models of Collective Behavior , 1978, American Journal of Sociology.

[60]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[61]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[62]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[63]  G. Wahba,et al.  A Correspondence Between Bayesian Estimation on Stochastic Processes and Smoothing by Splines , 1970 .

[64]  I NICOLETTI,et al.  The Planning of Experiments , 1936, Rivista di clinica pediatrica.

[65]  A. Call Delivering Rural Sanitation Programs at Scale, with Equity and Sustainability , 2019 .

[66]  Mark J. van der Laan,et al.  Causal Inference for a Population of Causally Connected Units , 2014, Journal of causal inference.

[67]  Ingo Steinwart,et al.  Optimal regression rates for SVMs using Gaussian kernels , 2013 .

[68]  B. Chakraborty,et al.  Statistical Methods for Dynamic Treatment Regimes: Reinforcement Learning, Causal Inference, and Personalized Medicine , 2013 .

[69]  Mark J. van der Laan,et al.  Super Learner In Prediction , 2010 .

[70]  M. J. van der Laan,et al.  Statistical Applications in Genetics and Molecular Biology Super Learner , 2010 .

[71]  James M. Robins,et al.  Optimal Structural Nested Models for Optimal Sequential Decisions , 2004 .

[72]  L. Breiman Random Forests , 2001, Machine Learning.

[73]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[74]  R. Feachem,et al.  A case-control study of the impact of improved sanitation on diarrhoea morbidity in Lesotho. , 1990, Bulletin of the World Health Organization.

[75]  S. Esrey,et al.  Interventions for the control of diarrhoeal diseases among young children: improving water supplies and excreta disposal facilities. , 1985, Bulletin of the World Health Organization.

[76]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[77]  G. S. Watson,et al.  Smooth regression analysis , 1964 .

[78]  E. Nadaraya On Estimating Regression , 1964 .