Group-by and Aggregate Functions in XML Keyword Search

In this paper, we study how to support group-by and aggregate functions in XML keyword search. It goes beyond the simple keyword query, and raises several challenges including: (1) how to address the keyword ambiguity problem when interpreting a keyword query; (2) how to identify duplicated objects and relationships in order to guarantee the correctness of the results of aggregation functions; and (3) how to compute a keyword query with group-by and aggregate functions. We propose an approach to address the above challenges. As a result, our approach enables users to explore the data as much as possible with simple keyword queries. The experimental results on real datasets demonstrate that our approach can support keyword queries with group-by and aggregate functions which are not addressed by the LCA-based approaches while achieving a similar response time to that of LCA-based approaches.

[1]  Curtis E. Dyreson,et al.  MESSIAH: missing element-conscious SLCA nodes search in XML data , 2013, SIGMOD '13.

[2]  Laks V. S. Lakshmanan,et al.  Complex Group-By Queries for XML , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[3]  Tok Wang Ling,et al.  Breaking out of the MisMatch trap , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[4]  Jianxin Li,et al.  Fast ELCA computation for keyword queries on XML data , 2010, EDBT '10.

[5]  Tok Wang Ling,et al.  Object Semantics for XML Keyword Search , 2014, DASFAA.

[6]  Stéphane Bressan,et al.  Discovering Semantics from Data-Centric XML , 2013, DEXA.

[7]  Tok Wang Ling,et al.  Effective XML Keyword Search with Relevance Oriented Ranking , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[8]  Feng Shao,et al.  XRANK: ranked keyword search over XML documents , 2003, SIGMOD '03.

[9]  Tok Wang Ling,et al.  Performing grouping and aggregate functions in XML queries , 2009, WWW '09.

[10]  Sandeep Tata,et al.  SQAK: doing more with keywords , 2008, SIGMOD Conference.

[11]  Yannis Papakonstantinou,et al.  Efficient keyword search for smallest LCAs in XML databases , 2005, SIGMOD '05.

[12]  Jianyong Wang,et al.  Effective keyword search for valuable lcas over xml documents , 2007, CIKM '07.

[13]  Xudong Lin,et al.  Fast SLCA and ELCA Computation for XML Keyword Queries Based on Set Intersection , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[14]  Tok Wang Ling,et al.  From Structure-Based to Semantics-Based: Towards Effective XML Keyword Search , 2013, ER.

[15]  Wenfei Fan,et al.  Keys with Upward Wildcards for XML , 2001, DEXA.

[16]  Yi Chen,et al.  Reasoning and identifying relevant matches for XML keyword search , 2008, Proc. VLDB Endow..

[17]  Berthold Reinwald,et al.  Towards keyword-driven analytical processing , 2007, SIGMOD '07.

[18]  Cong Yu,et al.  Schema-Free XQuery , 2004, VLDB.

[19]  David J. DeWitt,et al.  On supporting containment queries in relational database management systems , 2001, SIGMOD '01.