Using B+-trees for processing of line segments in large spatial databases

Points, lines, and regions are the three basic entities for constituting vector-based objects in spatial databases. Many indexing methods (G-tree, K-D-B tree, Quad-tree, PMR-tree, Grid-file, R-tree, and so on) have been widely discussed for handling point or region data. These traditional methods can efficiently organize point or region objects in a space into a hashing or hierarchical directory. They provide efficient access methods to meet the requirement of accurate retrievals. However, two problems are encountered when their techniques are applied to deal with line segments. The first is that representing line segments by means of point or region objects cannot exactly and properly preserve the spatial information about the proximities of line segments. The second problem is derived from the large dead space and overlapping areas in external and internal nodes of the hierarchical directory caused by the use of rectangles to enclose line objects. In this paper, we propose an indexing structure for line segments based on B+-tree to remedy these two problems. Through the experimental results, we demonstrate that our approach has significant improvement over the storage efficiency. In addition, the retrieval efficiency has also been significantly prompted as compared to the method using R-tree index scheme. These improvements derive mainly from the proposed data processing techniques and the new indexing method.

[1]  T. H. Merrett,et al.  A class of data structures for associative searching , 1984, PODS.

[2]  Hanan Samet,et al.  A qualitative comparison study of data structures for large line segment databases , 1992, SIGMOD '92.

[3]  Dimitris Papadias,et al.  Spatial Relations, Minimum Bounding Rectangles, and Spatial Data Structures , 1997, Int. J. Geogr. Inf. Sci..

[4]  J. T. Robinson,et al.  The K-D-B-tree: a search structure for large multidimensional dynamic indexes , 1981, SIGMOD '81.

[5]  Hanan Samet,et al.  A Probabilistic Analysis of Trie-Based Sorting of Large Collections of Line Segments in Spatial Databases , 2005, SIAM J. Comput..

[6]  Oliver Günther,et al.  Multidimensional access methods , 1998, CSUR.

[7]  Hans-Werner Six,et al.  Spatial searching in geometric databases , 1988, Proceedings. Fourth International Conference on Data Engineering.

[8]  Henk M. Blanken,et al.  The generalized grid file: description and performance aspects , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[9]  H. V. Jagadish,et al.  On Indexing Line Segments , 1990, VLDB.

[10]  Hanan Samet,et al.  Efficient Processing of Spatial Queries in Line Segment Databases , 1991, SSD.

[11]  Eric Mays,et al.  Fully persistent B+-trees , 1991, SIGMOD '91.

[12]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[13]  Akhil Kumar G-Tree: A New Data Structure for Organizing Multidimensional Data , 1994, IEEE Trans. Knowl. Data Eng..

[14]  Marios Hadjieleftheriou,et al.  R-Trees - A Dynamic Index Structure for Spatial Searching , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[15]  H. Buchner The Grid File : An Adaptable , Symmetric Multikey File Structure , 2001 .