Computing Skyline Incrementally in Response to Online Preference Modification

Skyline queries retrieve the most interesting objects from a database with respect to multi-dimensional preferences. Identifying and extracting the relevant data corresponding to multiple criteria provided by users remains a difficult task, especially when the dataset is large. EC 2 Sky, our proposal, focuses on how to answer efficiently skyline queries in the presence of dynamic user preferences and despite large volumes of data. In 2008-2009, Wong et al. showed that the skyline associated with any preference on a particular dimension can be computed, without domination tests, from the skyline points associated with first order preferences on that same dimension. Consequently, they propose to materialize skyline points associated with the most preferred values in a specific data structure called IPO-tree (Implicit Preference Order Tree). However, the size of the IPO-tree is exponential with respect to the number of dimensions. While reusing the merging property proposed by Wong et al. to deal with the refinements of preferences on a single dimension, we propose an incremental method for calculating the skyline points related to several dimensions associated with dynamic preferences. For this purpose, a materialization of linear size which allows a great flexibility for dimension preference updates is defined. This contribution improves notably the execution time and storage size of queries. Experiments on synthetic data highlight the relevance of EC 2 Sky compared to IPO-Tree.

[1]  Marie-Odile Cordier,et al.  Incremental Computation of Skyline Queries with Dynamic Preferences , 2012, DEXA.

[2]  Wei Wang,et al.  Efficient Optimization of Multiple Subspace Skyline Queries , 2008, Journal of Computer Science and Technology.

[3]  Marlene Goncalves,et al.  Evaluating Top-k Skyline Queries over Relational Databases , 2007, DEXA.

[4]  Beng Chin Ooi,et al.  Efficient Progressive Skyline Computation , 2001, VLDB.

[5]  Chedy Raïssi,et al.  Computing closed skycubes , 2010, Proc. VLDB Endow..

[6]  Hirotaka Nakayama,et al.  Theory of Multiobjective Optimization , 1985 .

[7]  Wolf-Tilo Balke,et al.  Exploiting Indifference for Customization of Partial Order Skylines , 2006, 2006 10th International Database Engineering and Applications Symposium (IDEAS'06).

[8]  Xiang Lian,et al.  Efficient Processing of Metric Skyline Queries , 2009, IEEE Transactions on Knowledge and Data Engineering.

[9]  Jarek Gryz,et al.  Algorithms and analyses for maximal vector computation , 2007, The VLDB Journal.

[10]  Raymond Chi-Wing Wong,et al.  Online Skyline Analysis with Dynamic Preferences on Nominal Attributes , 2009, IEEE Transactions on Knowledge and Data Engineering.

[11]  Wenfei Fan,et al.  Keys with Upward Wildcards for XML , 2001, DEXA.

[12]  Jan Chomicki,et al.  Skyline with Presorting: Theory and Optimizations , 2005, Intelligent Information Systems.

[13]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[14]  G. R. Bitran,et al.  The structure of admissible points with respect to cone dominance , 1979 .

[15]  A. W. Kemp,et al.  Univariate Discrete Distributions , 1993 .

[16]  Jan Chomicki,et al.  Preference elicitation in prioritized skyline queries , 2010, The VLDB Journal.

[17]  Raymond Chi-Wing Wong,et al.  Efficient skyline querying with variable user preferences on nominal attributes , 2008, Proc. VLDB Endow..

[18]  Qing Liu,et al.  Efficient Computation of the Skyline Cube , 2005, VLDB.

[19]  H. T. Kung,et al.  On the Average Number of Maxima in a Set of Vectors and Applications , 1978, JACM.

[20]  Jian Pei,et al.  Catching the Best Views of Skyline: A Semantic Approach Based on Decisive Subspaces , 2005, VLDB.

[21]  Anthony K. H. Tung,et al.  On Efficient Processing of Subspace Skyline Queries on High Dimensional Data , 2007, 19th International Conference on Scientific and Statistical Database Management (SSDBM 2007).

[22]  Jian Pei,et al.  Efficient Skyline and Top-k Retrieval in Subspaces , 2007, IEEE Transactions on Knowledge and Data Engineering.

[23]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[24]  Yufei Tao,et al.  On Skylining with Flexible Dominance Relation , 2008, 2008 IEEE 24th International Conference on Data Engineering.