Spatiotemporal incidence rate data analysis by nonparametric regression

To monitor the incidence rates of cancers, AIDS, cardiovascular diseases, and other chronic or infectious diseases, some global, national, and regional reporting systems have been built to collect/provide population-based data about the disease incidence. Such databases usually report daily, monthly, or yearly disease incidence numbers at the city, county, state, or country level, and the disease incidence numbers collected at different places and different times are often correlated, with the ones closer in place or time being more correlated. The correlation reflects the impact of various confounding risk factors, such as weather, demographic factors, lifestyles, and other cultural and environmental factors. Because such impact is complicated and challenging to describe, the spatiotemporal (ST) correlation in the observed disease incidence data has complicated ST structure as well. Furthermore, the ST correlation is hidden in the observed data and cannot be observed directly. In the literature, there has been some discussion about ST data modeling. But, the existing methods either impose various restrictive assumptions on the ST correlation that are hard to justify, or ignore partially or entirely the ST correlation. This paper aims to develop a flexible and effective method for ST disease incidence data modeling, using nonparametric local smoothing methods. This method can properly accommodate the ST data correlation. Theoretical justifications and numerical studies show that it works well in practice.

[1]  K. Kafadar,et al.  Smoothing geographical data, particularly rates of disease. , 1996, Statistics in medicine.

[2]  Gerard B. M. Heuvelink,et al.  Space-Time Geostatistics for Geography: A Case Study of Radiation Monitoring Across Parts of Germany , 2010 .

[3]  Olaf Berke,et al.  Exploratory disease mapping: kriging the spatial risk function from regional count data , 2004 .

[4]  Andrew O. Finley,et al.  spBayes for Large Univariate and Multivariate Point-Referenced Spatio-Temporal Data Models , 2013, 1310.8192.

[5]  Yuhong Yang,et al.  Nonparametric Regression with Correlated Errors , 2001 .

[6]  G Christakos,et al.  A study of the breast cancer dynamics in North Carolina. , 1997, Social science & medicine.

[7]  N. Andersson,et al.  International Journal of Health Geographics Epidemiological Geomatics in Evaluation of Mine Risk Education in Afghanistan: Introducing Population Weighted Raster Maps , 2006 .

[8]  Osamu Kurita,et al.  Approximate formulas of average distances associated with regions and their applications to location problems , 1991, Math. Program..

[9]  Naomi Altman,et al.  Kernel Smoothing of Data with Correlated Errors , 1990 .

[10]  R. Webster,et al.  Binomial cokriging for estimating and mapping the risk of childhood cancer. , 1998, IMA journal of mathematics applied in medicine and biology.

[11]  Johan A. K. Suykens,et al.  Kernel Regression in the Presence of Correlated Errors , 2011, J. Mach. Learn. Res..

[12]  Yves Rosseel,et al.  neuRosim: An R Package for Generating fMRI Data , 2011 .

[13]  N. Cressie,et al.  Classes of nonseparable, spatio-temporal stationary covariance functions , 1999 .

[14]  Maged N Kamel Boulos,et al.  International Journal of Health Geographics Open Access towards Evidence-based, Gis-driven National Spatial Health Information Infrastructure and Surveillance Services in the United Kingdom , 2022 .

[15]  A Rodriguez-Bachiller,et al.  Errors in the Measurement of Spatial Distances between Discrete Regions , 1983 .

[16]  Peter J. Diggle,et al.  Spatial and spatio-temporal Log-Gaussian Cox processes:extending the geostatistical paradigm , 2013, 1312.6536.

[17]  C. Loader Bandwidth selection: classical or plug-in? , 1999 .

[18]  Bo Li,et al.  Nonparametric Estimation of Spatial and Space-Time Covariance Function , 2013 .