Group skyline computation

Given a multi-dimensional dataset of tuples, skyline computation returns a subset of tuples that are not dominated by any other tuples when all dimensions are considered together. Conventional skyline computation, however, is inadequate to answer various queries that need to analyze not just individual tuples of a dataset but also their combinations. In this paper, we study group skyline computation which is based on the notion of dominance relation between groups of the same number of tuples. It determines the dominance relation between two groups by comparing their aggregate values such as sums or averages of elements of individual dimensions, and identifies a set of skyline groups that are not dominated by any other groups. We investigate properties of group skyline computation and develop a group skyline algorithm GDynamic which is equivalent to a dynamic algorithm that fills a table of skyline groups. Experimental results show that GDynamic is a practical group skyline algorithm.

[1]  Nikos Mamoulis,et al.  Scalable skyline computation using object-based space partitioning , 2009, SIGMOD Conference.

[2]  Yufei Tao,et al.  Maintaining sliding window skylines on data streams , 2006, IEEE Transactions on Knowledge and Data Engineering.

[3]  Vagelis Hristidis,et al.  Authority-based keyword search in databases , 2008, TODS.

[4]  Jignesh M. Patel,et al.  Efficient Continuous Skyline Computation , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[5]  Lei Chen,et al.  Continuous monitoring of skylines over uncertain data streams , 2012, Inf. Sci..

[6]  Mikhail J. Atallah,et al.  Computing all skyline probabilities for uncertain data , 2009, PODS.

[7]  Ken C. K. Lee,et al.  Approaching the Skyline in Z Order , 2007, VLDB.

[8]  Ilaria Bartolini,et al.  Efficient sort-based skyline evaluation , 2008, TODS.

[9]  Hongjun Lu,et al.  Stabbing the sky: efficient skyline computation over sliding windows , 2005, 21st International Conference on Data Engineering (ICDE'05).

[10]  Norbert Zeh,et al.  Parallel Computation of Skyline Queries , 2007, 21st International Symposium on High Performance Computing Systems and Applications (HPCS'07).

[11]  Wolf-Tilo Balke,et al.  Highly Scalable Multiprocessing Algorithms for Preference-Based Database Retrieval , 2010, DASFAA.

[12]  Donald Kossmann,et al.  Shooting Stars in the Sky: An Online Algorithm for Skyline Queries , 2002, VLDB.

[13]  Ben Y. Zhao,et al.  Parallelizing Skyline Queries for Scalable Distribution , 2006, EDBT.

[14]  Christian Böhm,et al.  Probabilistic skyline queries , 2009, CIKM.

[15]  Moni Naor,et al.  Optimal aggregation algorithms for middleware , 2001, PODS.

[16]  B. C. Brookes,et al.  Information Sciences , 2020, Cognitive Skills You Need for the 21st Century.

[17]  Jan Chomicki,et al.  Skyline with presorting , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[18]  Jian Pei,et al.  Distance-Based Representative Skyline , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[19]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[20]  Jarek Gryz,et al.  Maximal Vector Computation in Large Data Sets , 2005, VLDB.

[21]  Jaewoo Kang,et al.  Efficient skycube computation using point and domain-based filtering , 2010, Inf. Sci..

[22]  Bin Jiang,et al.  Probabilistic Skylines on Uncertain Data , 2007, VLDB.

[23]  Marios Hadjieleftheriou,et al.  R-Trees - A Dynamic Index Structure for Spatial Searching , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[24]  Man Lung Yiu,et al.  Group-by skyline query processing in relational engines , 2009, CIKM.

[25]  Chiang Lee,et al.  Top-k Combinatorial Skyline Queries , 2010, DASFAA.

[26]  Xuemin Lin,et al.  Selecting Stars: The k Most Representative Skyline Operator , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[27]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[28]  Raymond Chi-Wing Wong,et al.  Creating Competitive Products , 2009, Proc. VLDB Endow..

[29]  Jonghyun Park,et al.  Parallel Skyline Computation on Multicore Architectures , 2009, ICDE.

[30]  Christos Doulkeridis,et al.  Angle-based space partitioning for efficient parallel skyline computation , 2008, SIGMOD Conference.