Handling Uncertainty in Geo-Spatial Data

An inherent challenge arising in any dataset containing information of space and/or time is uncertainty due to various sources of imprecision. Integrating the impact of the uncertainty is a paramount when estimating the reliability (confidence) of any query result from the underlying input data. To deal with uncertainty, solutions have been proposed independently in the geo-science and the data-science research community. This interdisciplinary tutorial bridges the gap between the two communities by providing a comprehensive overview of the different challenges involved in dealing with uncertain geo-spatial data, by surveying solutions from both research communities, and by identifying similarities, synergies and open research problems.

[1]  Walter Christaller Die zentralen Orte in Süddeutschland , 1980 .

[2]  Pascal Neis,et al.  A Comprehensive Framework for Intrinsic OpenStreetMap Quality Analysis , 2014, Trans. GIS.

[3]  Philip S. Yu,et al.  PROUD: a probabilistic approach to processing similarity queries over uncertain data streams , 2009, EDBT '09.

[4]  Dieter Pfoser,et al.  Capturing the Uncertainty of Moving-Object Representations , 1999, SSD.

[5]  Ouri Wolfson,et al.  Spatio-temporal data reduction with deterministic error bounds , 2003, DIALM-POMC.

[6]  Ezio Todini,et al.  Influence of parameter estimation uncertainty in Kriging , 1996 .

[7]  Jennifer Widom,et al.  Working Models for Uncertain Data , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[8]  Sunil Prabhakar,et al.  Querying imprecise data in moving object environments , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[9]  Stefano Tarantola,et al.  Sensitivity and uncertainty analysis in spatial modelling based on GIS , 2000 .

[10]  Luca de Alfaro,et al.  A content-driven reputation system for the wikipedia , 2007, WWW '07.

[11]  Mohamed F. Mokbel,et al.  Recommendations in location-based social networks: a survey , 2015, GeoInformatica.

[12]  Feifei Li,et al.  Semantics of Ranking Queries for Probabilistic Data , 2011, IEEE Transactions on Knowledge and Data Engineering.

[13]  Jian Pei,et al.  Ranking queries on uncertain data: a probabilistic threshold approach , 2008, SIGMOD Conference.

[14]  Gabriele Buttafuoco,et al.  Assessing spatial uncertainty in mapping soil erodibility factor using geostatistical stochastic simulation , 2012, Environmental Earth Sciences.

[15]  Roberto Tamassia,et al.  Continuous probabilistic nearest-neighbor queries for uncertain trajectories , 2009, EDBT '09.

[16]  Serge Abiteboul,et al.  On the Representation and Querying of Sets of Possible Worlds , 1991, Theor. Comput. Sci..

[17]  Alexander Gribov,et al.  New Flexible Non-parametric Data Transformation for Trans-Gaussian Kriging , 2012 .

[18]  Jianwen Su,et al.  Universal trajectory queries for moving object databases , 2004, IEEE International Conference on Mobile Data Management, 2004. Proceedings. 2004.

[19]  B. Mandelbrot How Long Is the Coast of Britain? Statistical Self-Similarity and Fractional Dimension , 1967, Science.

[20]  Miriam J. Metzger,et al.  The credibility of volunteered geographic information , 2008 .

[21]  Honghai Qi,et al.  GIS-Based Spatial Monte Carlo Analysis for Integrated Flood Management with Two Dimensional Flood Simulation , 2013, Water Resources Management.

[22]  Guillaume Touya,et al.  Quality Assessment of the French OpenStreetMap Dataset , 2010, Trans. GIS.

[23]  Klaus H. Hinrichs,et al.  Managing uncertainty in moving objects databases , 2004, TODS.

[24]  Yufei Tao,et al.  Range search on multidimensional uncertain data , 2007, TODS.

[25]  Liu Liu,et al.  Towards fusing uncertain location data from heterogeneous sources , 2016, GeoInformatica.

[26]  Alok N. Choudhary,et al.  Uncertain Range Queries for Necklaces , 2010, 2010 Eleventh International Conference on Mobile Data Management.

[27]  Michael F. Goodchild,et al.  Assuring the quality of volunteered geographic information , 2012 .

[28]  Hans-Peter Kriegel,et al.  Querying Uncertain Spatio-Temporal Data , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[29]  Peter M. Atkinson,et al.  Assessing uncertainty in estimates with ordinary and indicator kriging , 2001 .

[30]  Sunil Prabhakar,et al.  U-DBMS: A Database System for Managing Constantly-Evolving Data , 2005, VLDB.

[31]  B. Mandelbrot How Long Is the Coast of Britain ? , 2002 .

[32]  M. Haklay How Good is Volunteered Geographical Information? A Comparative Study of OpenStreetMap and Ordnance Survey Datasets , 2010 .

[33]  Pierre Goovaerts,et al.  Geostatistical modelling of uncertainty in soil science , 2001 .

[34]  Frank Dürr,et al.  Efficient real-time trajectory tracking , 2011, The VLDB Journal.

[35]  Timos K. Sellis,et al.  Probabilistic Range Monitoring of Streaming Uncertain Positions in GeoSocial Networks , 2012, SSDBM.

[36]  Karine Zeitouni,et al.  Spatio-temporal compression of trajectories in road networks , 2014, GeoInformatica.

[37]  Jian Pei,et al.  Query answering techniques on uncertain and probabilistic data: tutorial summary , 2008, SIGMOD Conference.

[38]  Bart Kuijpers,et al.  Trajectory databases: Data models, uncertainty and complete query languages , 2007, J. Comput. Syst. Sci..

[39]  A. Prasad Sistla,et al.  Updating and Querying Databases that Track Mobile Units , 1999, Distributed and Parallel Databases.

[40]  OriaVincent,et al.  Spatio-temporal compression of trajectories in road networks , 2015 .

[41]  Christopher Ré,et al.  Event queries on correlated probabilistic streams , 2008, SIGMOD Conference.

[42]  R. Webster,et al.  Kriging: a method of interpolation for geographical information systems , 1990, Int. J. Geogr. Inf. Sci..

[43]  Eric S. Raymond,et al.  The cathedral and the bazaar - musings on Linux and Open Source by an accidental revolutionary , 2001 .

[44]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[45]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[46]  Hans-Peter Kriegel,et al.  Similarity search and mining in uncertain databases , 2010, Proc. VLDB Endow..

[47]  W. Tobler A Computer Movie Simulating Urban Growth in the Detroit Region , 1970 .

[48]  Ashok Kumar,et al.  Application of ArcGIS geostatistical analyst for interpolating environmental data from observations , 2007 .

[49]  Hans-Peter Kriegel,et al.  Managing uncertainty in spatial and spatio-temporal data , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[50]  Saul Kripke,et al.  A completeness theorem in modal logic , 1959, Journal of Symbolic Logic.

[51]  Yufei Tao,et al.  Efficient Evaluation of Probabilistic Advanced Spatial Queries on Existentially Uncertain Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[52]  Hans-Peter Kriegel,et al.  Probabilistic Nearest-Neighbor Query on Uncertain Objects , 2007, DASFAA.

[53]  Dieter Pfoser,et al.  On User-Generated Geocontent , 2011, SSTD.

[54]  Jian Li,et al.  Consensus answers for queries over probabilistic databases , 2008, PODS.