Measuring consistency with respect to topological dependency constraints

In contrast to the enormous development of database management systems to support spatial databases, very little work has been done in evaluating the quality of spatial data in terms of how much they satisfy a set of topo-semantic integrity constraints, in particular, a set of topological dependency constraints. In the same way, mechanisms for enforcing the satisfaction of those constraints are not necessarily available or even feasible. In this paper we propose measures to evaluate the degree of violation of a topological dependency constraint by geometries stored in a spatial database instance. We also propose how these measures can be aggregated to globally evaluate the data quality of a database instance such that they enable to compare database instances in terms of their constraint satisfaction. We provide an experimental evaluation of those measures using synthetic and real data. We validate our measures by i) analyzing their correlation with the semantic distance of topological relations and ii) checking that the more we randomly modify geometries to make database instances inconsistent, the more our global data quality measure decreases, showing its sensibility to the introduced constraint violations.

[1]  Stephan Mäs,et al.  Categories of Geospatial and Temporal Integrity Constraints , 2009, 2009 International Conference on Advanced Geographic Information Systems & Web Services.

[2]  Alberto H. F. Laender,et al.  Integrity Constraints in Spatial Databases , 2002, Database Integrity.

[3]  Alberto H. F. Laender,et al.  Spatial data integrity constraints in object oriented geographic data modeling , 1999, GIS '99.

[4]  Stephan Mäs,et al.  Reasoning on Spatial Semantic Integrity Constraints , 2007, COSIT.

[5]  M. Andrea Rodríguez,et al.  Defining and Comparing Content Measures of Topological Relations , 2004, GeoInformatica.

[6]  Alberto Del Bimbo,et al.  The computational aspect of retrieval by spatial arrangement , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[7]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[8]  David M. Mark,et al.  Modelling Conceptual Neighbourhoods of Toplogical Line-Region Relations , 1995, Int. J. Geogr. Inf. Sci..

[9]  Dimitris Papadias,et al.  Algorithms for Querying by Spatial Structure , 1998, VLDB.

[10]  Carlos Ordonez,et al.  Measuring referential integrity in distributed databases , 2007, CIMS '07.

[11]  Sophie Cockcroft A Taxonomy of Spatial Data Integrity Constraints , 1997, GeoInformatica.

[12]  Max J. Egenhofer,et al.  Conceptual Neighborhoods of Topological Relations Between Lines , 2008, SDH.

[13]  Leopoldo E. Bertossi,et al.  An inconsistency tolerant approach to querying spatial databases , 2008, GIS '08.

[14]  Anthony Hunter,et al.  Approaches to Measuring Inconsistent Information , 2005, Inconsistency Tolerance.

[15]  Nectaria Tryfona,et al.  A Model for Expressing topological Integrity Constraints in Geographic Databases , 1992, Spatio-Temporal Reasoning.

[16]  M. Egenhofer Categorizing Binary Topological Relations Between Regions, Lines, and Points in Geographic Databases , 1998 .

[17]  Robert Laurini,et al.  A Methodology for Spatial Consistency Improvement of Geographic Databases , 2000, GeoInformatica.

[18]  MAX J. EGENHOFER,et al.  Point Set Topological Relations , 1991, Int. J. Geogr. Inf. Sci..

[19]  Dov M. Gabbay,et al.  Making inconsistency respectable: a logical framework for inconsistency in reasoning , 1991, FAIR.

[20]  Anthony G. Cohn,et al.  A Spatial Logic based on Regions and Connection , 1992, KR.

[21]  Max J. Egenhofer,et al.  Reasoning about Gradual Changes of Topological Relationships , 1992, Spatio-Temporal Reasoning.

[22]  M. Andrea Rodríguez,et al.  Semantic Integrity Constraints for Spatial Databases , 2009, AMW.

[23]  Fabien Robineau,et al.  OpenGIS Simple Features Specification For SQL, Revision 0 , 1997 .

[24]  Ramez Elmasri,et al.  Fundamentals of Database Systems, 5th Edition , 2006 .