Adaptive geostatistical design and analysis for prevalence surveys

Non-adaptive geostatistical designs (NAGDs) offer standard ways of collecting and analysing geostatistical data in which sampling locations are fixed in advance of any data collection. In contrast, adaptive geostatistical designs (AGDs) allow collection of geostatistical data over time to depend on information obtained from previous information to optimise data collection towards the analysis objective. AGDs are becoming more important in spatial mapping, particularly in poor resource settings where uniformly precise mapping may be unrealistically costly and the priority is often to identify critical areas where interventions can have the most health impact. Two constructions are: singleton and batch adaptive sampling. In singleton sampling, locations xi are chosen sequentially and at each stage, xk+1 depends on data obtained at locations x1,…,xk. In batch sampling, locations are chosen in batches of size b>1, allowing each new batch, {x(k+1),…,x(k+b)}, to depend on data obtained at locations x1,…,xkb. In most settings, batch sampling is more realistic than singleton sampling. We propose specific batch AGDs and assess their efficiency relative to their singleton adaptive and non-adaptive counterparts using simulations. We then show how we are applying these findings to inform an AGD of a rolling Malaria Indicator Survey, part of a large-scale, five-year malaria transmission reduction project in Malawi.

[1]  P. Diggle,et al.  Model‐based geostatistics , 2007 .

[2]  Andrew J Tatem,et al.  Assembling a global database of malaria parasite prevalence for the Malaria Atlas Project , 2007, Malaria Journal.

[3]  M. Stein,et al.  Spatial sampling design for prediction with estimated parameters , 2006 .

[4]  David L. Smith,et al.  A World Malaria Map: Plasmodium falciparum Endemicity in 2007 , 2009, PLoS medicine.

[5]  J. Andrew Royle,et al.  An algorithm for the construction of spatial coverage designs with implementation in SPLUS , 1998 .

[6]  Peter J. Diggle,et al.  Bayesian Geostatistical Design , 2006 .

[7]  David Russo,et al.  Design of an Optimal Sampling Network for Estimating the Variogram , 1984 .

[8]  W. G. Müller,et al.  Optimal designs for variogram estimation , 1999 .

[9]  K. Ritter Asymptotic optimality of regular sequence designs , 1996 .

[10]  P. Cazenave,et al.  Total and functional parasite specific IgE responses in Plasmodium falciparum-infected patients exhibiting different clinical status , 2007, Malaria Journal.

[11]  P. Diggle,et al.  The geographic distribution of onchocerciasis in the 20 participating countries of the African Programme for Onchocerciasis Control: (2) pre-control endemicity levels and estimated number infected , 2014, Parasites & Vectors.

[12]  Stamatis Cambanis,et al.  Sampling Designs for Estimation of a Random Process , 1993, Proceedings. IEEE International Symposium on Information Theory.

[13]  Andrew J Tatem,et al.  Correction: A World Malaria Map: Plasmodium falciparum Endemicity in 2007 , 2009, PLoS Medicine.

[14]  Evangelos A. Yfantis,et al.  Efficiency of kriging estimation for square, triangular, and hexagonal grids , 1987 .

[15]  D G Krige,et al.  A statistical approach to some mine valuation and allied problems on the Witwatersrand , 2015 .

[16]  Dianne J Terlouw,et al.  Rolling Malaria Indicator Surveys (rMIS): a potential district-level malaria monitoring and evaluation (M&E) tool for program managers. , 2012, The American journal of tropical medicine and hygiene.

[17]  Agnes M. Herzberg,et al.  Equally spaced design points in polynomial regression: A comparison of systematic sampling methods with the optimal design of experiments , 1984 .

[18]  J. Chilès,et al.  Geostatistics: Modeling Spatial Uncertainty , 1999 .

[19]  D. Dunson,et al.  Bayesian geostatistical modelling with informative sampling locations. , 2011, Biometrika.

[20]  D. Russo,et al.  Optimal spatial sampling design for the estimation of the variogram based on a least squares approach , 1999 .

[21]  Bingbo Gao,et al.  A stratified optimization method for a multivariate marine environmental monitoring network in the Yangtze River estuary and its adjacent sea , 2015, Int. J. Geogr. Inf. Sci..

[22]  S. Cambanis,et al.  Sampling Designs for Estimating Integrals of Stochastic Processes , 1992 .

[23]  Peter J. Diggle,et al.  PrevMap:An R Package for Prevalence Mapping , 2017 .

[24]  Andrew J Tatem,et al.  The global distribution and population at risk of malaria: past, present, and future. , 2004, The Lancet. Infectious diseases.

[25]  Werner G. Müller,et al.  Collecting Spatial Data: Optimum Design of Experiments for Random Fields , 1998 .

[26]  Jinfeng Wang,et al.  A spatial sampling optimization package using MSN theory , 2011, Environ. Model. Softw..

[27]  Alan E Gelfand,et al.  On the effect of preferential sampling in spatial prediction , 2012, Environmetrics.

[28]  James V. Zidek,et al.  Reducing estimation bias in adaptively changing monitoring networks with preferential site selection , 2014, 1412.1303.

[29]  Alex B. McBratney,et al.  The design of optimal sampling schemes for local estimation and mapping of of regionalized variables—I: Theory and method , 1981 .

[30]  Peter J. Diggle,et al.  Statistical Analysis of Spatial and Spatio-Temporal Point Patterns , 2013 .

[31]  David L. Smith,et al.  A Long Neglected World Malaria Map: Plasmodium vivax Endemicity in 2010 , 2012, PLoS neglected tropical diseases.

[32]  Linda M Collins,et al.  Adaptive sampling in research on risk-related behaviors. , 2002, Drug and alcohol dependence.

[33]  P. Diggle,et al.  Model-based geostatistics (with discussion). , 1998 .

[34]  R. Lark,et al.  Developing methods to improve sampling efficiency for automated soil mapping , 2005 .

[35]  Robert Haining,et al.  Sample surveying to estimate the mean of a heterogeneous surface: reducing the error variance through zoning , 2010, Int. J. Geogr. Inf. Sci..

[36]  Werner G. Müller,et al.  A comparison of spatial design methods for correlated observations , 2005 .

[37]  Douglas W. Nychka,et al.  Design of Air-Quality Monitoring Networks , 1998 .

[38]  J R Aboal,et al.  The effect of sampling design on extensive bryomonitoring surveys of air pollution. , 2005, The Science of the total environment.

[39]  James V. Zidek,et al.  A case study in preferential sampling: Long term monitoring of air pollution in the UK , 2014 .

[40]  Peter J. Diggle,et al.  Geostatistical analysis of binomial data: generalised linear or transformed Gaussian modelling? , 2013 .

[41]  P. Diggle,et al.  Geostatistical inference under preferential sampling , 2010 .