SAQR: An Efficient Scheme for Similarity-Aware Query Refinement

Query refinement techniques enable database systems to automatically adjust a submitted query so that its result satisfies some specified constraints. While current techniques are fairly successful in generating refined queries based on cardinality constraints, they are rather oblivious to the (dis)similarity between the input query and its corresponding refined version. Meanwhile, enforcing a similarity-aware query refinement is a rather challenging task as it would require an exhaustive examination of the large space of possible query refinements. To address this challenge, we propose a novel scheme for efficient Similarity-aware Query Refinement (SAQR). SAQR aims to balance the tradeoff between satisfying the cardinality and similarity constraints imposed on the refined query so that to maximize its overall benefit to the user. To achieve that goal, SAQR implements efficient strategies to minimize the costs incurred in exploring the available search space. In particular, SAQR utilizes both similarity-based and cardinality-based pruning techniques to bound the search space and quickly find a refined query that meets the user expectations. Our experimental evaluation shows the scalability exhibited by SAQR under various workload settings, and the significant benefits it provides.

[1]  Walid G. Aref,et al.  Efficient processing of window queries in the pyramid data structure , 1990, PODS '90.

[2]  Elke A. Rundensteiner,et al.  QRelX: generating meaningful queries that provide cardinality assurance , 2010, SIGMOD Conference.

[3]  Ihab F. Ilyas,et al.  A survey of top-k query processing techniques in relational database systems , 2008, CSUR.

[4]  Luis Gravano,et al.  Evaluating top-k queries over web-accessible databases , 2004, TODS.

[5]  Jianzhong Li,et al.  Probing Queries in Wireless Sensor Networks , 2008, 2008 The 28th International Conference on Distributed Computing Systems.

[6]  Moni Naor,et al.  Optimal aggregation algorithms for middleware , 2001, PODS '01.

[7]  Nick Koudas,et al.  Generating targeted queries for database testing , 2008, SIGMOD Conference.

[8]  Nick Koudas,et al.  Interactive query refinement , 2009, EDBT '09.

[9]  Ahmed Eldawy,et al.  LARS: A Location-Aware Recommender System , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[10]  Quoc Trung Tran,et al.  How to ConQueR why-not questions , 2010, SIGMOD Conference.

[11]  Jian Pei,et al.  Efficient Skyline and Top-k Retrieval in Subspaces , 2007, IEEE Transactions on Knowledge and Data Engineering.

[12]  Anthony K. H. Tung,et al.  Relaxing join and selection queries , 2006, VLDB.

[13]  Surajit Chaudhuri,et al.  Generating Queries with Cardinality Constraints for DBMS Testing , 2006, IEEE Transactions on Knowledge and Data Engineering.

[14]  Ion Muslea,et al.  Machine learning for online query relaxation , 2004, KDD.

[15]  Juliana Freire,et al.  Supporting Exploratory Queries in Databases , 2004, DASFAA.