Efficient skyline evaluation over partially ordered domains

Although there has been a considerable body of work on skyline evaluation in multidimensional data with totally ordered attribute domains, there are only a few methods that consider attributes with partially ordered domains. Existing work maps each partially ordered domain to a total order and then adapts algorithms for totally-ordered domains to solve the problem. Nevertheless these methods either use stronger notions of dominance, which generate false positives, or require expensive dominance checks. In this paper, we propose two new methods, which do not have these drawbacks. The first method uses an appropriate mapping of a partial order to a total order, inspired by the lattice theorem and an off-the-shelf skyline algorithm. The second technique uses an appropriate storage and indexing approach, inspired by column stores, which enables efficient verification of whether a pair of objects are incompatible. We demonstrate that both our methods are up to an order of magnitude more efficient than previous work and scale well with different problem parameters, such as complexity of partial orders.

[1]  Jan Chomicki,et al.  Preference formulas in relational queries , 2003, TODS.

[2]  Beng Chin Ooi,et al.  Efficient Progressive Skyline Computation , 2001, VLDB.

[3]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[4]  Jarek Gryz,et al.  Algorithms and analyses for maximal vector computation , 2007, The VLDB Journal.

[5]  Jens Vygen,et al.  The Book Review Column1 , 2020, SIGACT News.

[6]  Yin Yang,et al.  Kernel-based skyline cardinality estimation , 2009, SIGMOD Conference.

[7]  Ronen I. Brafman,et al.  Introducing Variable Importance Tradeoffs into CP-Nets , 2002, UAI.

[8]  Donald Kossmann,et al.  Shooting Stars in the Sky: An Online Algorithm for Skyline Queries , 2002, VLDB.

[9]  Jan Chomicki,et al.  Skyline with Presorting: Theory and Optimizations , 2005, Intelligent Information Systems.

[10]  Kian-Lee Tan,et al.  Stratified computation of skylines with partially-ordered domains , 2005, SIGMOD '05.

[11]  Anthony K. H. Tung,et al.  On High Dimensional Skylines , 2006, EDBT.

[12]  Jian Pei,et al.  SUBSKY: Efficient Computation of Skylines in Subspaces , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[13]  Stavros Papadopoulos,et al.  Topologically Sorted Skylines for Partially Ordered Domains , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[14]  Bernhard Seeger,et al.  Efficient Computation of Reverse Skyline Queries , 2007, VLDB.

[15]  Jan Chomicki,et al.  Discovering Relative Importance of Skyline Attributes , 2009, Proc. VLDB Endow..

[16]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[17]  Ronen I. Brafman,et al.  CP-nets: A Tool for Representing and Reasoning withConditional Ceteris Paribus Preference Statements , 2011, J. Artif. Intell. Res..

[18]  睦憲 柳浦,et al.  Combinatorial Optimization : Theory and Algorithms (3rd Edition), B. Korte and J. Vygen 著, 出版社 Springer, 発行 2006年, 全ページ 597頁, 価格 53.45ユーロ, ISBN 3-540-25684-9 , 2006 .

[19]  Hasso Plattner,et al.  A common database approach for OLTP and OLAP using an in-memory column database , 2009, SIGMOD Conference.

[20]  Xiang Lian,et al.  Dynamic skyline queries in metric spaces , 2008, EDBT '08.

[21]  W. Trotter,et al.  Combinatorics and Partially Ordered Sets: Dimension Theory , 1992 .

[22]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[23]  Anthony K. H. Tung,et al.  Categorical skylines for streaming data , 2008, SIGMOD Conference.

[24]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[25]  Javier Montero,et al.  A Poset Dimension Algorithm , 1999, J. Algorithms.

[26]  Ken C. K. Lee,et al.  Approaching the Skyline in Z Order , 2007, VLDB.

[27]  Ilaria Bartolini,et al.  Efficient sort-based skyline evaluation , 2008, TODS.

[28]  Bin Jiang,et al.  Probabilistic Skylines on Uncertain Data , 2007, VLDB.

[29]  Xiang Lian,et al.  Monochromatic and bichromatic reverse skyline search over uncertain databases , 2008, SIGMOD Conference.

[30]  Michael Stonebraker,et al.  C-Store: A Column-oriented DBMS , 2005, VLDB.

[31]  Werner Kießling,et al.  Transformation of TCP-Net Queries into Preference Database Queries , 2006 .

[32]  Nikos Mamoulis,et al.  Scalable skyline computation using object-based space partitioning , 2009, SIGMOD Conference.

[33]  Bin Jiang,et al.  Mining preferences from superior and inferior examples , 2008, KDD.