Probabilistic Reverse Top-k Queries

Ranking-aware query is one of the most fundamental queries in the database management field. The ranking query that returns top-k elements with maximal ranking scores according to a ranking function has been widely studied for decades. Recently, some researchers also focus on finding all customers who treat the given query object one of their top-k favorite elements, namely reverse top-k query. In such applications, each customer is described as a vector. However, none of the existing work has considered the uncertain data case for reverse top-k query, which is our focus. In this paper, we propose two methods to handle probabilistic reverse top-k query, namely BLS and ALS. As a basic solution, BLS approach checks each pair of user and product to find the query result. While as an advanced solution, ALS approach uses two pruning rules and historical information to significantly improve the efficiency. Both detailed analysis and experiments upon real and synthetic data sets illustrate the efficiency of our proposed methods.

[1]  Christos Doulkeridis,et al.  Monochromatic and Bichromatic Reverse Top-k Queries , 2011, IEEE Transactions on Knowledge and Data Engineering.

[2]  Xi Zhang,et al.  On the semantics and evaluation of top-k queries in probabilistic databases , 2008, ICDE Workshops.

[3]  Stanley B. Zdonik,et al.  Top-k queries on uncertain data: on score distribution and typical answers , 2009, SIGMOD Conference.

[4]  Jeffrey Xu Yu,et al.  Sliding-window top-k queries on uncertain streams , 2008, Proc. VLDB Endow..

[5]  Christos Doulkeridis,et al.  Monitoring reverse top-k queries over mobile devices , 2011, MobiDE '11.

[6]  Christos Doulkeridis,et al.  Identifying the most influential data objects with reverse top-k queries , 2010, Proc. VLDB Endow..

[7]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[8]  Wilfred Ng,et al.  Robust Ranking of Uncertain Data , 2011, DASFAA.

[9]  Feifei Li,et al.  Semantics of Ranking Queries for Probabilistic Data and Expected Ranks , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[10]  Jian Pei,et al.  Ranking queries on uncertain data: a probabilistic threshold approach , 2008, SIGMOD Conference.

[11]  Xiang Lian,et al.  Probabilistic inverse ranking queries in uncertain databases , 2011, The VLDB Journal.

[12]  Kevin Chen-Chuan Chang,et al.  Enabling data retrieval: by ranking and beyond , 2007 .

[13]  Ihab F. Ilyas,et al.  Ranking with Uncertain Scores , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[14]  Surya Nepal,et al.  Query processing issues in image (multimedia) databases , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[15]  Christos Doulkeridis,et al.  Reverse top-k queries , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[16]  Charu C. Aggarwal,et al.  Managing and Mining Uncertain Data , 2009, Advances in Database Systems.

[17]  Ken C. K. Lee,et al.  Reverse ranking query over imprecise spatial data , 2010, COM.Geo '10.

[18]  Mohamed A. Soliman,et al.  Top-k Query Processing in Uncertain Databases , 2007, 2007 IEEE 23rd International Conference on Data Engineering.