Detecting local risk factors for residual malaria in northern Ghana using Bayesian model averaging

BackgroundThere is a need for comprehensive evaluations of the underlying local factors that contribute to residual malaria in sub-Saharan Africa. However, it is difficult to compare the wide array of demographic, socio-economic, and environmental variables associated with malaria transmission using standard statistical approaches while accounting for seasonal differences and nonlinear relationships. This article uses a Bayesian model averaging (BMA) approach for identifying and comparing potential risk and protective factors associated with residual malaria.ResultsThe relative influence of a comprehensive set of demographic, socio-economic, environmental, and malaria intervention variables on malaria prevalence were modelled using BMA for variable selection. Data were collected in Bunkpurugu-Yunyoo, a rural district in northeast Ghana that experiences holoendemic seasonal malaria transmission, over six biannual surveys from 2010 to 2013. A total of 10,022 children between the ages 6 to 59 months were used in the analysis. Multiple models were developed to identify important risk and protective factors, accounting for seasonal patterns and nonlinear relationships. These models revealed pronounced nonlinear associations between malaria risk and distance from the nearest urban centre and health facility. Furthermore, the association between malaria risk and age and some ethnic groups was significantly different in the rainy and dry seasons. BMA outperformed other commonly used regression approaches in out-of-sample predictive ability using a season-to-season validation approach.ConclusionsThis modelling framework offers an alternative approach to disease risk factor analysis that generates interpretable models, can reveal complex, nonlinear relationships, incorporates uncertainty in model selection, and produces accurate predictions. Certain modelling applications, such as designing targeted local interventions, require more sophisticated statistical methods which are capable of handling a wide range of relevant data while maintaining interpretability and predictive performance, and directly characterize uncertainty. To this end, BMA represents a valuable tool for constructing more informative models for understanding risk factors for malaria, as well as other vector-borne and environmentally mediated diseases.

[1]  Nancy Fullman,et al.  Global malaria mortality between 1980 and 2010: a systematic analysis , 2012, The Lancet.

[2]  Monica F. Myers,et al.  Climate change and the resurgence of malaria in the East African highlands , 2002, Nature.

[3]  J. M. Sloughter,et al.  Probabilistic Quantitative Precipitation Forecasting Using Bayesian Model Averaging , 2007 .

[4]  D B Dunson,et al.  Commentary: practical advantages of Bayesian analysis of epidemiologic data. , 2001, American journal of epidemiology.

[5]  Chein-I Chang,et al.  Semi-Supervised Linear Spectral Unmixing Using a Hierarchical Bayesian Model for Hyperspectral Imagery , 2008, IEEE Transactions on Signal Processing.

[6]  P. Amratia,et al.  Spatial heterogeneity can undermine the effectiveness of country-wide test and treat policy for malaria: a case study from Burkina Faso , 2016, Malaria Journal.

[7]  M. Coosemans,et al.  Residual transmission of malaria : an old issue for new approaches , 2013 .

[8]  E. Seto,et al.  Factors determining the heterogeneity of malaria incidence in children in Kampala, Uganda. , 2008, The Journal of infectious diseases.

[9]  Andrew J Tatem,et al.  The global distribution and population at risk of malaria: past, present, and future. , 2004, The Lancet. Infectious diseases.

[10]  Arantxa Roca-Feltrer,et al.  Estimating the potential public health impact of seasonal malaria chemoprevention in African children , 2012, Nature Communications.

[11]  Adrian E. Raftery,et al.  Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors , 1999 .

[12]  A. Ghani,et al.  Costs and cost-effectiveness of malaria control interventions - a systematic review , 2011, Malaria Journal.

[13]  K. Lindblade,et al.  The impact of distance of residence from a peripheral health facility on pediatric health utilisation in rural western Kenya , 2009, Tropical medicine & international health : TM & IH.

[14]  J. L. Parra,et al.  Very high resolution interpolated climate surfaces for global land areas , 2005 .

[15]  Andrew Jarvis,et al.  Hole-filled SRTM for the globe Version 4 , 2008 .

[16]  F. Nosten,et al.  Past and new challenges for malaria control and elimination: the role of operational research for innovation in designing interventions , 2015, Malaria Journal.

[17]  A. Pandey,et al.  Socio-economic & household risk factors of malaria in tribal areas of Madhya Pradesh, central India , 2015, The Indian journal of medical research.

[18]  J. Oppong Accommodating the rainy season in Third World location-allocation applications , 1996 .

[19]  R. O’Hara,et al.  A review of Bayesian variable selection methods: what, how and which , 2009 .

[20]  Martin Feldkircher,et al.  Bayesian model averaging employing fixed and flexible priors: The BMS package for R , 2015 .

[21]  J. Michaelsen,et al.  A global satellite-assisted precipitation climatology , 2015 .

[22]  Ameet Bakhai,et al.  Comparison of Bayesian model averaging and stepwise methods for model selection in logistic regression , 2004, Statistics in medicine.

[23]  A. Raftery,et al.  Using Bayesian Model Averaging to Calibrate Forecast Ensembles , 2005 .

[24]  S. Adjei,et al.  Public social policy development and implementation: a case study of the Ghana National Health Insurance scheme. , 2007, Health policy and planning.

[25]  L. F. Chaves,et al.  Malaria resurgence in the East African highlands: temperature trends revisited. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[26]  G. Killeen,et al.  Characterizing, controlling and eliminating residual malaria transmission , 2014, Malaria Journal.

[27]  Stefan Kienberger,et al.  Spatial-explicit modeling of social vulnerability to malaria in East Africa , 2014, International Journal of Health Geographics.

[28]  E. Fèvre,et al.  Shrinking a large dataset to identify variables associated with increased risk of Plasmodium falciparum infection in Western Kenya , 2015, Epidemiology and Infection.

[29]  U. Dalrymple,et al.  The effect of malaria control on Plasmodium falciparum in Africa between 2000 and 2015 , 2015, Nature.

[30]  Adel Javanmard,et al.  Confidence intervals and hypothesis testing for high-dimensional regression , 2013, J. Mach. Learn. Res..

[31]  S. Adjei,et al.  Spatial variation of malaria incidence in young children from a geographically homogeneous area with high endemicity. , 2008, The Journal of infectious diseases.

[32]  D. Madigan,et al.  Bayesian Model Averaging in Proportional Hazard Models: Assessing the Risk of a Stroke , 1997 .

[33]  Andrew J Tatem,et al.  Fine-scale malaria risk mapping from routine aggregated case data , 2014, Malaria Journal.

[34]  Nimako Sarpong,et al.  Principal component analysis of socioeconomic factors and their association with malaria in children from the Ashanti Region, Ghana , 2010, Malaria Journal.

[35]  E. Worrall,et al.  The burden of malaria epidemics and cost-effectiveness of interventions in epidemic situations in Africa. , 2004, The American journal of tropical medicine and hygiene.

[36]  Brendan A. Wintle,et al.  The Use of Bayesian Model Averaging to Better Represent Uncertainty in Ecological Models , 2003 .

[37]  B. Uzochukwu,et al.  Influence of education and knowledge on perceptions and practices to control malaria in Southeast Nigeria. , 2006, Social science & medicine.

[38]  D. Guwatudde,et al.  Urban malaria: primary caregivers’ knowledge, attitudes, practices and predictors of malaria incidence in a cohort of Ugandan children , 2003, Tropical medicine & international health : TM & IH.

[39]  Alan M MacEachren,et al.  It’s a long, long walk: accessibility to hospitals, maternity and integrated health centers in Niger , 2012, International Journal of Health Geographics.

[40]  Adrian E. Raftery,et al.  Bayesian Model Averaging , 1998 .

[41]  Michael Hagenlocher,et al.  Mapping malaria risk and vulnerability in the United Republic of Tanzania: a spatial explicit model , 2015, Population Health Metrics.

[42]  Christopher Holmes,et al.  Bayesian Methods for Nonlinear Classification and Regressing , 2002 .

[43]  D. Boakye,et al.  A reduction in malaria transmission intensity in Northern Ghana after 7 years of indoor residual spraying , 2017, Malaria Journal.

[44]  N. Speybroeck,et al.  Ranking Malaria Risk Factors to Guide Malaria Control Efforts in African Highlands , 2009, PloS one.

[45]  Michael A Babyak,et al.  What You See May Not Be What You Get: A Brief, Nontechnical Introduction to Overfitting in Regression-Type Models , 2004, Psychosomatic medicine.

[46]  B. Pond Malaria indicator surveys demonstrate a markedly lower prevalence of malaria in large cities of sub-Saharan Africa , 2013, Malaria Journal.

[47]  Adrian E. Raftery,et al.  Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data , 2005, Bioinform..

[48]  P. Vounatsou,et al.  Malaria risk in Nigeria: Bayesian geostatistical modelling of 2010 malaria indicator survey data , 2015, Malaria Journal.

[49]  C. Källestål,et al.  Geographical accessibility and spatial coverage modeling of the primary health care network in the Western Province of Rwanda , 2012, International Journal of Health Geographics.

[50]  Identifying risk factors for Plasmodium infection and anaemia in Kinshasa, Democratic Republic of Congo , 2016, Malaria Journal.

[51]  R. Tibshirani,et al.  A SIGNIFICANCE TEST FOR THE LASSO. , 2013, Annals of statistics.

[52]  D. Posada,et al.  Model selection and model averaging in phylogenetics: advantages of akaike information criterion and bayesian approaches over likelihood ratio tests. , 2004, Systematic biology.

[53]  B. Greenwood,et al.  Socio-economic risk factors for malaria in a peri-urban area of The Gambia. , 1995, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[54]  Mevin B. Hooten,et al.  A guide to Bayesian model selection for ecologists , 2015 .

[55]  Fabrice Rossi,et al.  Lasso based feature selection for malaria risk exposure prediction , 2015, MLDM 2015.

[56]  G. Casella,et al.  Penalized regression, standard errors, and Bayesian lassos , 2010 .

[57]  Chris Newbold,et al.  Relation between severe malaria morbidity in children and level of Plasmodium falciparum transmission in Africa , 1997, The Lancet.

[58]  Anna Genell,et al.  Model selection in Medical Research: A simulation study comparing Bayesian Model Averaging and Stepwise Regression , 2010, BMC medical research methodology.

[59]  R. Snow,et al.  A climate-based distribution model of malaria transmission in sub-Saharan Africa. , 1999, Parasitology today.

[60]  B. Greenwood Review: Intermittent preventive treatment – a new approach to the prevention of malaria in children in areas with seasonal malaria transmission , 2006, Tropical medicine & international health : TM & IH.

[61]  Martin Krzywinski,et al.  Points of significance: P values and the search for significance , 2016, Nature Methods.

[62]  Victor A Alegana,et al.  The changing risk of Plasmodium falciparum malaria infection in Africa: 2000–10: a spatial and temporal analysis of transmission intensity , 2014, The Lancet.

[63]  Adrian E. Raftery,et al.  Bayesian Model Averaging: A Tutorial , 2016 .

[64]  F. Vahidnia,et al.  Relationship between HIV risk perception and condom use: Evidence from a population-based survey in Mozambique. , 2006, International family planning perspectives.

[65]  Daniel Chandramohan,et al.  Epidemiology of malaria in the forest-savanna transitional zone of Ghana , 2009, Malaria Journal.

[66]  P. D. de Vries,et al.  Malaria Prevalence, Spatial Clustering and Risk Factors in a Low Endemic Area of Eastern Rwanda: A Cross Sectional Study , 2013, PloS one.

[67]  P. Gething,et al.  Re-examining environmental correlates of Plasmodium falciparum malaria endemicity: a data-intensive variable selection approach , 2015, Malaria Journal.

[68]  J V Tu,et al.  Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. , 1996, Journal of clinical epidemiology.

[69]  Bani K. Mallick,et al.  Hyperspectral remote sensing of plant biochemistry using Bayesian model averaging with variable and band selection , 2013 .

[70]  R. Snow,et al.  Shrinking the malaria map: progress and prospects , 2010, Lancet.

[71]  Refik Soyer,et al.  Bayesian Methods for Nonlinear Classification and Regression , 2004, Technometrics.

[72]  D. Ayele,et al.  Prevalence and risk factors of malaria in Ethiopia , 2012, Malaria Journal.

[73]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[74]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[75]  L. Kazembe,et al.  Using Structured Additive Regression Models to Estimate Risk Factors of Malaria: Analysis of 2010 Malawi Malaria Indicator Survey Data , 2014, PloS one.

[76]  Sandro Galea,et al.  Urbanization, urbanicity, and health , 2002, Journal of Urban Health.

[77]  S. Richardson,et al.  Variable selection and Bayesian model averaging in case‐control studies , 2001, Statistics in medicine.

[78]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[79]  J. Rose,et al.  Characterizing the Risk of Infection from Mycobacterium tuberculosis in Commercial Passenger Aircraft Using Quantitative Microbial Risk Assessment , 2009, Risk analysis : an official publication of the Society for Risk Analysis.