论文信息 - Refinement Driven Processing of Aggregation Constrained Queries

Refinement Driven Processing of Aggregation Constrained Queries

Although existing database systems provide users an efficient means to select tuples based on attribute criteria, they however provide little means to select tuples based on whether they meet aggregate requirements. For instance, a requirement may be that the cardinality of the query result must be 1000 or the sum of a particular attribute must be < $5000. In this work, we term such queries as “Aggregation Constrained Queries” (ACQs). Aggregation constrained queries are crucial in many decision support applications to maintain a product’s competitive edge in this fast moving field of data processing. The challenge in processing ACQs is the unfamiliarity of the underlying data that results in queries being either too strict or too broad. Due to the lack of support of ACQs, users have to resort to a frustrating trial-and-error query refinement process. In this paper, we introduce and define the semantics of ACQs. We propose a refinement-based approach, called ACQUIRE, to efficiently process a range of ACQs. Lastly, in our experimental analysis we demonstrate the superiority of our technique over extensions of existing algorithms. More specifically, ACQUIRE runs up to 2 orders of magnitude faster than compared techniques while producing a 2X reduction in the amount of refinement made to the input queries.

Elke A. Rundensteiner | Samuel Madden | Manasi Vartak | Venkatesh Raghavan

[1] Gang Luo. Efficient detection of empty-result queries , 2006, VLDB.

[2] Surajit Chaudhuri,et al. Generating Queries with Cardinality Constraints for DBMS Testing , 2006, IEEE Transactions on Knowledge and Data Engineering.

[3] Nick Koudas,et al. Interactive query refinement , 2009, EDBT '09.

[4] Laks V. S. Lakshmanan,et al. Breaking out of the box of recommendations: from items to packages , 2010, RecSys '10.

[5] Samuel Madden,et al. Scorpion: Explaining Away Outliers in Aggregate Queries , 2013, Proc. VLDB Endow..

[6] Luis Gravano,et al. Evaluating Top-k Selection Queries , 1999, VLDB.

[7] Anthony K. H. Tung,et al. Relaxing join and selection queries , 2006, VLDB.

[8] Jerry J. Koliha. Metrics, Norms And Integrals: An Introduction To Contemporary Analysis , 2008 .

[9] Dimitrios Gunopulos,et al. Efficient Approximation Of Optimization Queries Under Parametric Aggregation Constraints , 2003, VLDB.

[10] Cong Yu,et al. Constructing and exploring composite items , 2010, SIGMOD Conference.

[11] Hans Kellerer,et al. Knapsack problems , 2004 .

[12] Nick Koudas,et al. Generating targeted queries for database testing , 2008, SIGMOD Conference.

[13] Ion Muslea,et al. Machine learning for online query relaxation , 2004, KDD.

[14] Guoping Wang,et al. Evaluation of set-based queries with aggregation constraints , 2011, CIKM '11.