Spatial join for high-resolution objects

Modern database applications including computer-aided design (CAD), medical imaging, molecular biology, or multimedia information systems impose new requirements on efficient spatial query processing. One of the most common query types in spatial database management systems is the spatial join. In this paper, we investigate spatial join processing for two sets of very complex spatial objects. We present an approach that is based on a fast filter step performing the spatial join on simple primitives which conservatively approximate the objects. Our main attention is focused on the problem how to generate approximations adequate for high-resolution objects. In this paper, we introduce gray approximations as a general concept which helps to range between replicating and nonreplicating object approximations. The key idea of our approach is to build these replications based on statistical information taking the data distribution of the respective join-partner relation into account. Furthermore, our approach uses compression techniques for the effective storage and retrieval of the decomposed spatial objects. We demonstrate the benefits of our new method for the spatial intersection join on high resolution data. The experimental evaluation on real-world test data points out that our new concept accelerates the spatial intersection join considerably.

[1]  Hans-Peter Kriegel,et al.  A cost model for interval intersection queries on RI-trees , 2002, Proceedings 14th International Conference on Scientific and Statistical Database Management.

[2]  Hans-Peter Kriegel,et al.  Query Processing of Spatial Objects: Complexity versus Redundancy , 1993, SSD.

[3]  Volker Gaede,et al.  Optimal Redundancy in Spatial Database Systems , 1995, SSD.

[4]  D. J. Wheeler,et al.  A Block-sorting Lossless Data Compression Algorithm , 1994 .

[5]  Sridhar Ramaswamy,et al.  Scalable Sweeping-Based Spatial Join , 1998, VLDB.

[6]  Hans-Peter Kriegel,et al.  Efficient processing of spatial joins using R-trees , 1993, SIGMOD Conference.

[7]  Oliver Günther,et al.  Multidimensional access methods , 1998, CSUR.

[8]  Christos Faloutsos,et al.  Analysis of the Clustering Properties of the Hilbert Space-Filling Curve , 2001, IEEE Trans. Knowl. Data Eng..

[9]  Jack A. Orenstein Spatial query processing in an object-oriented database system , 1986, SIGMOD '86.

[10]  David J. DeWitt,et al.  Partition based spatial-merge join , 1996, SIGMOD '96.

[11]  Ming-Ling Lo,et al.  Spatial hash-joins , 1996, SIGMOD '96.

[12]  Michael Stonebraker,et al.  The Sequoia 2000 Benchmark , 1993, SIGMOD Conference.

[13]  Hans-Peter Kriegel,et al.  Managing Intervals Efficiently in Object-Relational Databases , 2000, VLDB.

[14]  Ming-Ling Lo,et al.  Spatial joins using seeded trees , 1994, SIGMOD '94.

[15]  Hugh E. Williams,et al.  A compression scheme for large databases , 2000, Proceedings 11th Australasian Database Conference. ADC 2000 (Cat. No.PR00528).

[16]  Christos Faloutsos,et al.  Analysis of the n-Dimensional Quadtree Decomposition for Arbitrary Hyperectangles , 1997, IEEE Trans. Knowl. Data Eng..

[17]  Jack A. Orenstein Redundancy in spatial databases , 1989, SIGMOD '89.