Interactive Search for One of the Top-k

When a large dataset is given, it is not desirable for a user to read all tuples one-by-one in the whole dataset to find satisfied tuples. The traditional top-k query finds the best k tuples (i.e., the top-k tuples) w.r.t. the user's preference. However, in practice, it is difficult for a user to specify his/her preference explicitly. We study how to enhance the top-k query with user interaction. Specifically, we ask a user several questions, each of which consists of two tuples and asks the user to indicate which one s/he prefers. Based on the feedback, the user's preference is learned implicitly and one of the top-k tuples w.r.t. the learned preference is returned. Here, instead of directly following the top-k query to return all the top-k tuples, since it requires heavy user effort during the interaction (e.g., answering many questions), we reduce the output size to strike for a trade-off between the user effort and the output size. To achieve this, we present an algorithm 2D-PI which asks an asymptotically optimal number of questions in a 2-dimensional space, and two algorithms HD-PI and RH with provable performance guarantee in a d-dimensional space (d >= 2), where they focus on the number of questions asked and the execution time, respectively. Experiments were conducted on synthetic and real datasets, showing that our algorithms outperform existing ones by asking fewer questions within less time to return satisfied tuples.

[1]  Raymond Chi-Wing Wong,et al.  k-Hit Query: Top-k Query with Probabilistic Utility Function , 2015, SIGMOD Conference.

[2]  Li Qian,et al.  Learning User Preferences By Adaptive Pairwise Comparison , 2015, Proc. VLDB Endow..

[3]  Ihab F. Ilyas,et al.  Ranking with Uncertain Scores , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[4]  Yunjun Gao,et al.  On processing reverse k-skyband and ranked reverse skyline queries , 2015, Inf. Sci..

[5]  Raymond Chi-Wing Wong,et al.  FindYourFavorite: An Interactive System for Finding the User's Favorite Tuple in the Database , 2019, SIGMOD Conference.

[6]  Chen Chen,et al.  Sorting-based Interactive Regret Minimization , 2020, APWeb/WAIM.

[7]  Robert D. Nowak,et al.  Active Ranking using Pairwise Comparisons , 2011, NIPS.

[8]  Raymond Chi-Wing Wong,et al.  Strongly Truthful Interactive Regret Minimization , 2019, SIGMOD Conference.

[9]  Brian Eriksson,et al.  Learning to Top-K Search using Pairwise Comparisons , 2013, AISTATS.

[10]  Franz Aurenhammer,et al.  Handbook of Computational Geometry , 2000 .

[11]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[12]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[13]  Raymond Chi-Wing Wong,et al.  An experimental survey of regret minimization query and variants: bridging the best worlds between top-k query and skyline query , 2019, The VLDB Journal.

[14]  Raymond Chi-Wing Wong,et al.  Being Happy with the Least: Achieving α-happiness with Minimum Number of Tuples , 2020, 2020 IEEE 36th International Conference on Data Engineering (ICDE).

[15]  M. Tamer Özsu,et al.  Flexible integration of multimedia sub-queries with qualitative preferences , 2007, Multimedia Tools and Applications.

[16]  Alex Thomo,et al.  Computing k-Regret Minimizing Sets , 2014, Proc. VLDB Endow..

[17]  Seung-won Hwang,et al.  Personalized top-k skyline queries in high-dimensional space , 2009, Inf. Syst..

[18]  Melanie Revilla,et al.  Ideal and Maximum Length for a Web Survey , 2017, International Journal of Market Research.

[19]  Raymond Chi-Wing Wong,et al.  k-Regret Minimizing Set: Efficient Algorithms and Hardness , 2017, ICDT.

[20]  Tie-Yan Liu,et al.  Learning to rank for information retrieval , 2009, SIGIR.

[21]  Cheng Long,et al.  Efficient k-Regret Query Algorithm with Restriction-free Bound for any Dimensionality , 2018, SIGMOD Conference.

[22]  Abolfazl Asudeh,et al.  RRR: Rank-Regret Representative , 2018, SIGMOD Conference.

[23]  Richard J. Lipton,et al.  Regret-minimizing representative databases , 2010, Proc. VLDB Endow..

[24]  Wolf-Tilo Balke,et al.  User Interaction Support for Incremental Refinement of Preference-Based Queries , 2007, RCIS.

[25]  Kazuhisa Makino,et al.  Interactive regret minimization , 2012, SIGMOD Conference.

[26]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[27]  Matthias Grossglauser,et al.  Just Sort It! A Simple and Effective Approach to Active Preference Learning , 2015, ICML.

[28]  Hans-Peter Kriegel,et al.  Efficient User-Adaptable Similarity Search in Large Multimedia Databases , 1997, VLDB.

[29]  Mark de Berg,et al.  Computational geometry: algorithms and applications , 1997 .

[30]  Bernard Chazelle,et al.  An optimal convex hull algorithm in any fixed dimension , 1993, Discret. Comput. Geom..

[31]  Georgia Koutrika,et al.  Preference-Based Query Personalization , 2013, Advanced Query Processing.

[32]  Ilaria Bartolini,et al.  FeedbackBypass: A New Approach to Interactive Similarity Query Processing , 2001, VLDB.

[33]  Bin Jiang,et al.  Mining preferences from superior and inferior examples , 2008, KDD.

[34]  Wolf-Tilo Balke,et al.  Eliciting Matters - Controlling Skyline Sizes by Incremental Integration of User Preferences , 2007, DASFAA.

[35]  Wolf-Tilo Balke,et al.  Interactive skyline queries , 2012, Inf. Sci..