Refining Coarse-grained Spatial Data using Auxiliary Spatial Data Sets with Various Granularities

We propose a probabilistic model for refining coarse-grained spatial data by utilizing auxiliary spatial data sets. Existing methods require that the spatial granularities of the auxiliary data sets are the same as the desired granularity of target data. The proposed model can effectively make use of auxiliary data sets with various granularities by hierarchically incorporating Gaussian processes. With the proposed model, a distribution for each auxiliary data set on the continuous space is modeled using a Gaussian process, where the representation of uncertainty considers the levels of granularity. The fine-grained target data are modeled by another Gaussian process that considers both the spatial correlation and the auxiliary data sets with their uncertainty. We integrate the Gaussian process with a spatial aggregation process that transforms the fine-grained target data into the coarse-grained target data, by which we can infer the fine-grained target Gaussian process from the coarse-grained data. Our model is designed such that the inference of model parameters based on the exact marginal likelihood is possible, in which the variables of fine-grained target and auxiliary data are analytically integrated out. Our experiments on real-world spatial data sets demonstrate the effectiveness of the proposed model.

[1]  Stephen L. Rathbun,et al.  Spatial modelling in irregularly shaped regions: Kriging estuaries , 1998 .

[2]  Xi Liu,et al.  MUSCAT: Multi-Scale Spatio-Temporal Learning with Application to Climate Modeling , 2018, IJCAI.

[3]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[4]  P. Goovaerts Combining Areal and Point Data in Geostatistical Interpolation: Applications to Soil Science and Medical Geography , 2010, Mathematical geosciences.

[5]  Andrew J Tatem,et al.  Fine-scale malaria risk mapping from routine aggregated case data , 2014, Malaria Journal.

[6]  Licia Capra,et al.  Beyond the Baseline: Establishing the Value in Mobile Phone Based Poverty Estimates , 2016, WWW.

[7]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[8]  Jon Wakefield,et al.  Pointless spatial modeling. , 2018, Biostatistics.

[9]  P. Kyriakidis A Geostatistical Framework for Area-to-Point Spatial Interpolation , 2004 .

[10]  Walter Jetz,et al.  Downscaling of species distribution models: a hierarchical approach , 2013 .

[11]  Ming Li,et al.  Forecasting Fine-Grained Air Quality Based on Big Data , 2015, KDD.

[12]  H. Storch,et al.  The Analog Method as a Simple Statistical Downscaling Technique: Comparison with More Complicated Methods , 1999 .

[13]  Roger Woodard,et al.  Interpolation of Spatial Data: Some Theory for Kriging , 1999, Technometrics.

[14]  P. Whetton,et al.  Guidelines for Use of Climate Scenarios Developed from Statistical Downscaling Methods , 2004 .

[15]  Alexandre Boucher,et al.  Super-resolution land cover mapping with indicator geostatistics , 2006 .

[16]  G. Wotling,et al.  Regionalization of extreme precipitation distribution using the principal components of the topographical environment , 2000 .

[17]  A. Azzouz 2011 , 2020, City.

[18]  Bradley A. Miller,et al.  Impact of Multi-Scale Predictor Selection for Modeling Soil Properties , 2015 .

[19]  Jon Wakefield,et al.  Pointless Continuous Spatial Surface Reconstruction , 2017, 1709.09659.

[20]  Hugh Glaser,et al.  Linked Open Government Data: Lessons from Data.gov.uk , 2012, IEEE Intelligent Systems.

[21]  Gianni Bellocchi,et al.  Multiscale regression model to infer historical temperatures in a central Mediterranean sub-regional area , 2010 .

[22]  Licia Capra,et al.  Poverty on the cheap: estimating poverty maps using aggregated mobile communication networks , 2014, CHI.

[24]  Alexander J. Smola,et al.  Who Supported Obama in 2012?: Ecological Inference through Distribution Regression , 2015, KDD.

[25]  Stephan J. Goetz,et al.  Social and political forces as determinants of poverty: A spatial analysis , 2007 .

[26]  Xing Xie,et al.  Discovering regions of different functions in a city using human mobility and POIs , 2012, KDD.

[27]  Marco De Nadai,et al.  A multi-source dataset of urban life in the city of Milan and the Province of Trentino , 2015, Scientific Data.

[28]  Yu Zheng,et al.  U-Air: when urban air quality inference meets big data , 2013, KDD.

[29]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[30]  Alex J. Cannon Quantile regression neural networks: Implementation in R and application to precipitation downscaling , 2011, Comput. Geosci..

[31]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[32]  Pabitra Mitra,et al.  Statistical downscaling of precipitation using long short-term memory recurrent neural networks , 2018, Theoretical and Applied Climatology.

[33]  Benjamin M. Taylor,et al.  Continuous inference for aggregated point process data , 2017, 1704.05627.

[34]  Sangram Ganguly,et al.  Generating High Resolution Climate Change Projections through Single Image Super-Resolution: An Abridged Version , 2018, IJCAI.

[35]  António Xavier,et al.  Disaggregating statistical data at the field level: An entropy approach , 2018 .

[36]  Alex Pentland,et al.  Once Upon a Crime: Towards Crime Prediction from Demographics and Mobile Data , 2014, ICMI.

[37]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[38]  Richard E. Howitt,et al.  Spatial disaggregation of agricultural production data using maximum entropy , 2003 .

[39]  Subimal Ghosh,et al.  SVM-PGSL coupled approach for statistical downscaling to predict rainfall from GCM output , 2010 .

[40]  Daniel Krewski,et al.  Spatial analysis of air pollution and mortality in California. , 2013, American journal of respiratory and critical care medicine.

[41]  No-Wook Park,et al.  Spatial Downscaling of TRMM Precipitation Using Geostatistics and Fine Scale Environmental Variables , 2013 .

[42]  Kenji Fukumizu,et al.  Variational Learning on Aggregate Outputs with Gaussian Processes , 2018, NeurIPS.

[43]  Daniel Kifer,et al.  Crime Rate Inference with Big Data , 2016, KDD.

[44]  Taha B. M. J. Ouarda,et al.  Automated regression-based statistical downscaling tool , 2008, Environ. Model. Softw..

[45]  Xiaoou Tang,et al.  Learning a Deep Convolutional Network for Image Super-Resolution , 2014, ECCV.

[46]  Sangram Ganguly,et al.  DeepSD: Generating High Resolution Climate Change Projections through Single Image Super-Resolution , 2017, KDD.