VMPSP: Efficient Skyline Computation Using VMP-Based Space Partitioning

The skyline query returns a set of interesting points that are not dominated by any other points in the multi-dimensional data sets. This query has already been considerably studied over last several years in preference analysis and multi-criteria decision making applications fields. Space partitioning, the best non-index framework, has been proposed and existing methods based on it do not consider the balance of partitioned subspaces. To overcome this limitation, we first develop a cost evaluation model of space partitioning in skyline computation, propose an efficient approach to compute the skyline set using balanced partitioning. We illustrate the importance of the balance in partitioning. Based on this, we propose a method to construct a balanced partitioning point VMP whose ith attribute value is the median value of all points in ith dimension. We also design a structure RST to reduce dominance tests among those subspaces which are comparable. The experimental evaluation indicates that our algorithm is faster at least several times than existing state-of-the-art algorithms.

[1]  Seung-won Hwang,et al.  QSkycube: Efficient Skycube Computation using Point-Based Space Partitioning , 2010, Proc. VLDB Endow..

[2]  Nikos Mamoulis,et al.  Scalable skyline computation using object-based space partitioning , 2009, SIGMOD Conference.

[3]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[4]  Seung-won Hwang,et al.  BSkyTree: scalable skyline computation using a balanced pivot selection , 2010, EDBT '10.

[5]  Ken C. K. Lee,et al.  Approaching the Skyline in Z Order , 2007, VLDB.

[6]  Ilaria Bartolini,et al.  Efficient sort-based skyline evaluation , 2008, TODS.

[7]  Jan Chomicki,et al.  Skyline with presorting , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[8]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[9]  Donald Kossmann,et al.  Shooting Stars in the Sky: An Online Algorithm for Skyline Queries , 2002, VLDB.

[10]  Jarek Gryz,et al.  Maximal Vector Computation in Large Data Sets , 2005, VLDB.

[11]  Ira Assent,et al.  Work-Efficient Parallel Skyline Computation for the GPU , 2015, Proc. VLDB Endow..

[12]  Beng Chin Ooi,et al.  Efficient Progressive Skyline Computation , 2001, VLDB.