SkyMap: A Trie-Based Index Structure for High-Performance Skyline Query Processing

Skyline queries have become commonplace in many applications. The main problem is to efficiently find the set of Pareto-optimal choices from a large amount of database items. Several algorithms and indexing techniques have been proposed recently, but until now no indexing technique was able to address all problems for skyline queries in realistic applications: fast access, superior scalability even for higher dimensions, and low costs for maintenance in face of data updates. In this paper we design and evaluate a trie-based indexing technique that solves the major efficiency bottlenecks of skyline queries. It scales gracefully even for high dimensional queries, is largely independent of the underlying data distributions, and allows for efficient updates. Our experiments on real and synthetic datasets show a performance increase of up to two orders of magnitude compared to previous indexing techniques.

[1]  R. Nelsen An Introduction to Copulas , 1998 .

[2]  Seung-won Hwang,et al.  Navigation system for product search , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[3]  Ilaria Bartolini,et al.  Efficient sort-based skyline evaluation , 2008, TODS.

[4]  Seung-won Hwang,et al.  BSkyTree: scalable skyline computation using a balanced pivot selection , 2010, EDBT '10.

[5]  Yuan Tian,et al.  Z-SKY: an efficient skyline query processing framework based on Z-order , 2010, The VLDB Journal.

[6]  Jian Pei,et al.  Efficient Skyline and Top-k Retrieval in Subspaces , 2007, IEEE Transactions on Knowledge and Data Engineering.

[7]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[8]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[9]  Anthony K. H. Tung,et al.  Finding k-dominant skylines in high dimensional space , 2006, SIGMOD Conference.

[10]  Beng Chin Ooi,et al.  Indexing for progressive skyline computation , 2003, Data Knowl. Eng..

[11]  Paolo Viappiani,et al.  Preference-based Search using Example-Critiquing with Suggestions , 2006, J. Artif. Intell. Res..

[12]  Nikos Mamoulis,et al.  Scalable skyline computation using object-based space partitioning , 2009, SIGMOD Conference.

[13]  Jan Chomicki,et al.  Skyline with presorting , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[14]  R. Nelsen An Introduction to Copulas (Springer Series in Statistics) , 2006 .

[15]  Donald Kossmann,et al.  Shooting Stars in the Sky: An Online Algorithm for Skyline Queries , 2002, VLDB.

[16]  H. Sagan Space-filling curves , 1994 .