Topologically Sorted Skylines for Partially Ordered Domains

The vast majority of work on skyline queries considers totally ordered domains, whereas in many applications some attributes are partially ordered, as for instance, domains of set values, hierarchies, intervals and preferences. The only work addressing this issue has limited progressiveness and pruning ability, and it is only applicable to static skylines. This paper overcomes these problems with the following contributions. (i) We introduce a generic framework, termed TSS, for handling partially ordered domains using topological sorting. (ii) We propose a novel dominance check that eliminates false hits/misses, further enhancing progressiveness and pruning ability. (iii) We extend our methodology to dynamic skylines with respect to an input query. In this case, the dominance relationships change according to the query specification, and their computation is rather complex. We perform an extensive experimental evaluation demonstrating that TSS is up to 9 times and up to 2 orders of magnitude faster than existing methods in the static and the dynamic case, respectively.

[1]  Werner Kießling,et al.  Foundations of Preferences in Database Systems , 2002, VLDB.

[2]  Qing Liu,et al.  Efficient Computation of the Skyline Cube , 2005, VLDB.

[3]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[4]  Anthony K. H. Tung,et al.  DADA: a data cube for dominant relationship analysis , 2006, SIGMOD Conference.

[5]  Jan Chomicki,et al.  Skyline with presorting , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[6]  Franco P. Preparata,et al.  Sequencing-by-hybridization revisited: the analog-spectrum proposal , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[7]  Michael Ian Shamos,et al.  Computational geometry: an introduction , 1985 .

[8]  Rada Chirkova,et al.  Efficiently Querying Large XML Data Repositories: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.

[9]  Ilaria Bartolini,et al.  Efficient sort-based skyline evaluation , 2008, TODS.

[10]  Heng Tao Shen,et al.  Multi-source Skyline Query Processing in Road Networks , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[11]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[12]  Kian-Lee Tan,et al.  Stratified computation of skylines with partially-ordered domains , 2005, SIGMOD '05.

[13]  Bin Jiang,et al.  Probabilistic Skylines on Uncertain Data , 2007, VLDB.

[14]  Bernhard Seeger,et al.  Efficient Computation of Reverse Skyline Queries , 2007, VLDB.

[15]  Jan Chomicki,et al.  Preference formulas in relational queries , 2003, TODS.

[16]  Raymond Chi-Wing Wong,et al.  Mining favorable facets , 2007, KDD '07.

[17]  Beng Chin Ooi,et al.  Efficient Progressive Skyline Computation , 2001, VLDB.

[18]  Donald Kossmann,et al.  Shooting Stars in the Sky: An Online Algorithm for Skyline Queries , 2002, VLDB.

[19]  Dimitris Sacharidis,et al.  Caching Dynamic Skyline Queries , 2008, SSDBM.

[20]  Jarek Gryz,et al.  Algorithms and analyses for maximal vector computation , 2007, The VLDB Journal.

[21]  Peter J. Haas,et al.  Interactive data Analysis: The Control Project , 1999, Computer.

[22]  Alexander Borgida,et al.  Efficient management of transitive relationships in large data and knowledge bases , 1989, SIGMOD '89.

[23]  Cyrus Shahabi,et al.  The spatial skyline queries , 2006, VLDB.