Probabilistic contextual skylines

The skyline query returns the most interesting tuples according to a set of explicitly defined preferences among attribute values. This work relaxes this requirement, and allows users to pose meaningful skyline queries without stating their choices. To compensate for missing knowledge, we first determine a set of uncertain preferences based on user profiles, i.e., information collected for previous contexts. Then, we define a probabilistic contextual skyline query (p-CSQ) that returns the tuples which are interesting with high probability. We emphasize that, unlike past work, uncertainty lies within the query and not the data, i.e., it is in the relationships among tuples rather than in their attribute values. Furthermore, due to the nature of this uncertainty, popular skyline methods, which rely on a particular tuple visit order, do not apply for p-CSQs. Therefore, we present novel non-indexed and index-based algorithms for answering p-CSQs. Our experimental evaluation concludes that the proposed techniques are significantly more efficient compared to a standard block nested loops approach.

[1]  Jan Chomicki,et al.  Preference formulas in relational queries , 2003, TODS.

[2]  Raymond Chi-Wing Wong,et al.  Mining favorable facets , 2007, KDD '07.

[3]  Beng Chin Ooi,et al.  Efficient Progressive Skyline Computation , 2001, VLDB.

[4]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[5]  Anthony K. H. Tung,et al.  DADA: a data cube for dominant relationship analysis , 2006, SIGMOD Conference.

[6]  M. Lacroix,et al.  Preferences; Putting More Knowledge into Queries , 1987, VLDB.

[7]  Bin Jiang,et al.  Probabilistic Skylines on Uncertain Data , 2007, VLDB.

[8]  Xiang Lian,et al.  Monochromatic and bichromatic reverse skyline search over uncertain databases , 2008, SIGMOD Conference.

[9]  Michael Ian Shamos,et al.  Computational geometry: an introduction , 1985 .

[10]  Georgia Koutrika,et al.  Personalization of queries in database systems , 2004, Proceedings. 20th International Conference on Data Engineering.

[11]  Vagelis Hristidis,et al.  PREFER: a system for the efficient execution of multi-parametric ranked queries , 2001, SIGMOD '01.

[12]  Liang Jeff Chen,et al.  Context-sensitive ranking , 2012 .

[13]  Xuemin Lin,et al.  Selecting Stars: The k Most Representative Skyline Operator , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[14]  Werner Kießling,et al.  Foundations of Preferences in Database Systems , 2002, VLDB.

[15]  Cyrus Shahabi,et al.  The spatial skyline queries , 2006, VLDB.

[16]  Man Lung Yiu,et al.  Efficient Processing of Top-k Dominating Queries on Multi-Dimensional Data , 2007, VLDB.

[17]  Kian-Lee Tan,et al.  Stratified computation of skylines with partially-ordered domains , 2005, SIGMOD '05.

[18]  Stavros Papadopoulos,et al.  Topologically Sorted Skylines for Partially Ordered Domains , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[19]  Bernhard Seeger,et al.  Efficient Computation of Reverse Skyline Queries , 2007, VLDB.

[20]  Dimitris Sacharidis,et al.  Caching Dynamic Skyline Queries , 2008, SSDBM.

[21]  Jarek Gryz,et al.  Algorithms and analyses for maximal vector computation , 2007, The VLDB Journal.

[22]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[23]  Anthony K. H. Tung,et al.  Finding k-dominant skylines in high dimensional space , 2006, SIGMOD Conference.

[24]  Rakesh Agrawal,et al.  A framework for expressing and combining preferences , 2000, SIGMOD '00.

[25]  John R. Smith,et al.  The onion technique: indexing for linear optimization queries , 2000, SIGMOD '00.

[26]  Raymond Chi-Wing Wong,et al.  Efficient skyline querying with variable user preferences on nominal attributes , 2008, Proc. VLDB Endow..

[27]  Qing Liu,et al.  Efficient Computation of the Skyline Cube , 2005, VLDB.

[28]  Jan Chomicki,et al.  Skyline with presorting , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[29]  Donald Kossmann,et al.  Shooting Stars in the Sky: An Online Algorithm for Skyline Queries , 2002, VLDB.

[30]  Ken C. K. Lee,et al.  Approaching the Skyline in Z Order , 2007, VLDB.

[31]  Ilaria Bartolini,et al.  Efficient sort-based skyline evaluation , 2008, TODS.

[32]  Heng Tao Shen,et al.  Multi-source Skyline Query Processing in Road Networks , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[33]  Evaggelia Pitoura,et al.  Adding Context to Preferences , 2007, 2007 IEEE 23rd International Conference on Data Engineering.