Uncertain spatial data handling: Modeling, indexing and query

Managing and manipulating uncertainty in spatial databases are important problems for various practical applications of geographic information systems. Unlike the traditional fuzzy approaches in relational databases, in this paper a probability-based method to model and index uncertain spatial data is proposed. In this scheme, each object is represented by a probability density function (PDF) and a general measure is proposed for measuring similarity between the objects. To index objects, an optimized Gaussian mixture hierarchy (OGMH) is designed to support both certain/uncertain data and certain/uncertain queries. An uncertain R-tree is designed with two query filtering schemes, UR1 and UR2, for the special case when the query is certain. By performing a comprehensive comparison among OGMH, UR1, UR2 and a standard R-tree on US Census Bureau TIGER/Line^(R) Southern California landmark point dataset, it is found that UR1 is the best for certain queries. As an example of uncertain query support OGMH is applied to the Mojave Desert endangered species protection real dataset. It is found that OGMH provides more selective, efficient and flexible search than the results provided by the existing trial and error approach for endangered species habitat search. Details of the experiments are given and discussed.

[1]  P. Deb Finite Mixture Models , 2008 .

[2]  Bharat Bhargava,et al.  Advanced Database Systems , 1993, Lecture Notes in Computer Science.

[3]  Bir Bhanu,et al.  Indexing Structure For Handling Uncertain Spatial Data , 2003 .

[4]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[5]  Anil K. Jain,et al.  Unsupervised Learning of Finite Mixture Models , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  K. Parker,et al.  90 years and 535 miles: Vegetation changes along the Mexican border , 1987 .

[7]  David G. Stork,et al.  Pattern Classification , 1973 .

[8]  V. B. Robinson A perspective on managing uncertainty in geographic information systems with fuzzy sets , 1998, 1998 IEEE International Conference on Fuzzy Systems Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36228).

[9]  Christos Faloutsos,et al.  Advanced Database Systems , 1997, Lecture Notes in Computer Science.

[10]  Raghu Ramakrishnan,et al.  Database Management Systems , 1976 .

[11]  Markus Schneider,et al.  Uncertainty Management for Spatial Data in Databases: Fuzzy Spatial Data Types , 1999, SSD.

[12]  Agnès Voisard,et al.  Spatial Databases: With Application to GIS , 2001 .

[13]  Gary J. Hunter,et al.  Understanding error in spatial databases , 1992 .

[14]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[15]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[16]  Bir Bhanu,et al.  Probabilistic Spatial Database Operations , 2003, SSTD.

[17]  B. Bhanu,et al.  Handling Uncertain Spatial Data : Comparisons between Indexing Structures , 2022 .