Being Happy with the Least: Achieving α-happiness with Minimum Number of Tuples

When faced with a database containing millions of products, a user may be only interested in a (typically much) smaller representative subset. Various approaches were proposed to create a good representative subset that fits the user’s needs which are expressed in the form of a utility function (e.g., the top-k and diversification query). Recently, a regret minimization query was proposed: it does not require users to provide their utility functions and returns a small set of tuples such that any user’s favorite tuple in this subset is guaranteed to be not much worse than his/her favorite tuple in the whole database. In a sense, this query finds a small set of tuples that makes the user happy (i.e., not regretful) even if s/he gets the best tuple in the selected set but not the best tuple among all tuples in the database.In this paper, we study the min-size version of the regret minimization query; that is, we want to determine the least tuples needed to keep users happy at a given level. We term this problem as the α-happiness query where we quantify the user’s happiness level by a criterion, called the happiness ratio, and guarantee that each user is at least α happy with the set returned (i.e., the happiness ratio is at least α) where α is a real number from 0 to 1. As this is an NP-hard problem, we derive an approximate solution with theoretical guarantee by considering the problem from a geometric perspective. Since in practical scenarios, users are interested in achieving higher happiness levels (i.e., α is closer to 1), we performed extensive experiments for these scenarios, using both real and synthetic datasets. Our evaluations show that our algorithm outperforms the best-known previous approaches in two ways: (i) it answers the α-happiness query by returning fewer tuples to users and, (ii) it answers much faster (up to two orders of magnitude times improvement for large α).

[1]  Yuichi Yoshida,et al.  Regret Ratio Minimization in Multi-Objective Submodular Function Maximization , 2017, AAAI.

[2]  Raymond Chi-Wing Wong,et al.  Geometry approach for k-regret query , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[3]  Stavros Sintos,et al.  Faster Approximation Algorithm for the k-Regret Minimizing Set and Related Problems , 2018, ALENEX.

[4]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[5]  Pankaj K. Agarwal,et al.  Approximating extent measures of points , 2004, JACM.

[6]  Jirí Matousek,et al.  Ray shooting and parametric search , 1992, STOC '92.

[7]  John N. Tsitsiklis,et al.  Introduction to linear optimization , 1997, Athena scientific optimization and computation series.

[8]  Raymond Chi-Wing Wong,et al.  k-Hit Query: Top-k Query with Probabilistic Utility Function , 2015, SIGMOD Conference.

[9]  Jeffrey Xu Yu,et al.  Diversifying Top-K Results , 2012, Proc. VLDB Endow..

[10]  Cheng Long,et al.  Efficient k-Regret Query Algorithm with Restriction-free Bound for any Dimensionality , 2018, SIGMOD Conference.

[11]  Abolfazl Asudeh,et al.  Efficient Computation of Regret-ratio Minimizing Set: A Compact Maxima Representative , 2017, SIGMOD Conference.

[12]  Laurence A. Wolsey,et al.  An analysis of the greedy algorithm for the submodular set covering problem , 1982, Comb..

[13]  David P. Dobkin,et al.  The quickhull algorithm for convex hulls , 1996, TOMS.

[14]  Mohamed A. Soliman,et al.  Top-k Query Processing in Uncertain Databases , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[15]  José H. Dulá,et al.  An Algorithm for Identifying the Frame of a Pointed Finite Conical Hull , 1998, INFORMS J. Comput..

[16]  Jeff M. Phillips,et al.  Chernoff-Hoeffding Inequality and Applications , 2012, ArXiv.

[17]  Subhash Suri,et al.  Efficient Algorithms for k-Regret Minimizing Sets , 2017, SEA.

[18]  Kazuhisa Makino,et al.  Interactive regret minimization , 2012, SIGMOD Conference.

[19]  Raymond Chi-Wing Wong,et al.  Strongly Truthful Interactive Regret Minimization , 2019, SIGMOD Conference.

[20]  Ashwin Lall,et al.  k-Regret Queries with Nonlinear Utilities , 2015, Proc. VLDB Endow..

[21]  Raymond Chi-Wing Wong,et al.  k-Regret Minimizing Set: Efficient Algorithms and Hardness , 2017, ICDT.

[22]  Alex Thomo,et al.  Computing k-Regret Minimizing Sets , 2014, Proc. VLDB Endow..

[23]  Raymond Chi-Wing Wong,et al.  FindYourFavorite: An Interactive System for Finding the User's Favorite Tuple in the Database , 2019, SIGMOD Conference.