Linear clustering of objects with multiple attributes

There is often a need to map a multi-dimensional space on to a one-dimensional space. For example, this kind of mapping has been proposed to permit the use of one-dimensional indexing techniques to a multi-dimensional index space such as in a spatial database. This kind of mapping is also of value in assigning physical storage, such as assigning buckets to records that have been indexed on multiple attributes, to minimize the disk access effort. In this paper, we discuss what the desired properties of such a mapping are, and evaluate, through analysis and simulation, several mappings that have been proposed in the past. We present a mapping based on Hilbert's space-filling curve, which out-performs previously proposed mappings on average over a variety of different operating conditions.

[1]  Christos Faloutsos,et al.  Multiattribute hashing using Gray codes , 1986, SIGMOD '86.

[2]  Ronald Fagin,et al.  Extendible hashing—a fast access method for dynamic files , 1979, ACM Trans. Database Syst..

[3]  Jack A. Orenstein Spatial query processing in an object-oriented database system , 1986, SIGMOD '86.

[4]  Christos Faloutsos,et al.  Gray Codes for Partial Match and Range Queries , 1988, IEEE Trans. Software Eng..

[5]  Jürg Nievergelt,et al.  The Grid File: An Adaptable, Symmetric Multikey File Structure , 1984, TODS.

[6]  Jack A. Orenstein Redundancy in spatial databases , 1989, SIGMOD '89.

[7]  Christos Faloutsos,et al.  Fractals for secondary key retrieval , 1989, PODS.

[8]  H. V. Jagadish Spatial search with polyhedra , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[9]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[10]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[11]  D. Hilbert Ueber die stetige Abbildung einer Line auf ein Flächenstück , 1891 .

[12]  Hanan Samet,et al.  Hierarchical representations of collections of small rectangles , 1988, CSUR.

[13]  G. Peano Sur une courbe, qui remplit toute une aire plane , 1890 .

[14]  Arthur R. Butz,et al.  Alternative Algorithm for Hilbert's Space-Filling Curve , 1971, IEEE Transactions on Computers.

[15]  James B. Rothnie,et al.  Attribute based file organization in a paged memory environment , 1974, CACM.

[16]  Christos Faloutsos,et al.  The R+-Tree: A Dynamic Index for Multi-Dimensional Objects , 1987, VLDB.

[17]  T. H. Merrett,et al.  A class of data structures for associative searching , 1984, PODS.

[18]  Douglas R. Anderson,et al.  Mapping Multidimensional Space to One Dimension for Computer Output Display , 1968, IEEE Transactions on Computers.