Effective XML keyword query processing

Keyword search query processing is considered as the most promising way of information retrieval over XML data in present days as it relieves user from understanding complex schemas of XML document and writing difficult queries using XPath and XQuery. Till date various query processing techniques have been proposed to get meaningful results through keyword search using LCA (Lowest Common Ancestor) semantic under tree based approach. Amongst many such LCA based techniques which have been proposed to get more accurate and meaningful results SLCA (Smallest LCA) and ELCA (Exclusive LCA), have considered being the most popular ones. However due to AND-semantic constraints of LCA based techniques, SLCA or ELCA results into NULL for keyword queries involving missing elements and provides unintended results if the technique returns root element of the document. To address these issues, we propose an effective XML keyword query processing technique. In this paper we present the proposed technique based on ELCA query semantic which returns the meaningful results when ELCA based technique results into NULL or document root element thereby providing better information discovery over XML data. The proposed technique can also be applied to SLCA based techniques to get similar SLCA based meaningful results.

[1]  Divesh Srivastava,et al.  Keyword proximity search in XML trees , 2006 .

[2]  Andrew Chi-Chih Yao,et al.  An Almost Optimal Algorithm for Unbounded Searching , 1976, Inf. Process. Lett..

[3]  Jeffrey Xu Yu,et al.  Top-Down XML Keyword Query Processing , 2016, IEEE Transactions on Knowledge and Data Engineering.

[4]  Sudipto Guha,et al.  Improving the Performance of List Intersection , 2009, Proc. VLDB Endow..

[5]  Yannis Papakonstantinou,et al.  Efficient keyword search for smallest LCAs in XML databases , 2005, SIGMOD '05.

[6]  Cong Yu,et al.  Schema-Free XQuery , 2004, VLDB.

[7]  Michael Grossniklaus,et al.  Efficient structural bulk updates on the Pre/Dist/Size XML encoding , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[8]  Rémi Gilleron,et al.  Retrieving meaningful relaxed tightest fragments for XML keyword search , 2009, EDBT '09.

[9]  Krithi Ramamritham,et al.  Enabling generic keyword search over raw XML data , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[10]  Alejandro López-Ortiz,et al.  Faster Adaptive Set Intersections for Text Searching , 2006, WEA.

[11]  Yi Chen,et al.  Processing keyword search on XML: a survey , 2011, World Wide Web.

[12]  Aoying Zhou,et al.  Hash-Search: An Efficient SLCA-Based Keyword Search Algorithm on XML Documents , 2009, DASFAA.

[13]  Bolin Ding,et al.  Fast Set Intersection in Memory , 2011, Proc. VLDB Endow..

[14]  Tok Wang Ling,et al.  Survey on Keyword Search over XML Documents , 2016, SGMD.

[15]  Curtis E. Dyreson,et al.  MESSIAH: missing element-conscious SLCA nodes search in XML data , 2013, SIGMOD '13.

[16]  Curtis E. Dyreson,et al.  Querying virtual hierarchies using virtual prefix-based numbers , 2014, SIGMOD Conference.

[17]  Xudong Lin,et al.  Fast SLCA and ELCA Computation for XML Keyword Queries Based on Set Intersection , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[18]  Tok Wang Ling,et al.  From Region Encoding To Extended Dewey: On Efficient Processing of XML Twig Pattern Matching , 2005, VLDB.

[19]  Jianxin Li,et al.  Suggestion of promising result types for XML keyword search , 2010, EDBT '10.

[20]  Yi Chen,et al.  Reasoning and identifying relevant matches for XML keyword search , 2008, Proc. VLDB Endow..

[21]  Lin Guo XRANK : Ranked Keyword Search over XML Documents , 2003 .

[22]  Chee Yong Chan,et al.  Multiway SLCA-based keyword search in XML data , 2007, WWW '07.

[23]  Sivaji Yerraguntla,et al.  CONTEXT-BASED DIVERSIFICATION FOR KEYWORD QUERIES OVER XML DATA , 2016 .

[24]  Jeffrey Xu Yu,et al.  Top-down keyword query processing on XML data , 2013, CIKM.

[25]  Xiaofeng Meng,et al.  Efficient query processing for XML keyword queries based on the IDList index , 2013, The VLDB Journal.

[26]  Yannis Papakonstantinou,et al.  Supporting top-K keyword search in XML databases , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[27]  Jianyong Wang,et al.  Effective keyword search for valuable lcas over xml documents , 2007, CIKM '07.

[28]  Yehoshua Sagiv,et al.  XSEarch: A Semantic Search Engine for XML , 2003, VLDB.

[29]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.

[30]  Jianxin Li,et al.  Fast ELCA computation for keyword queries on XML data , 2010, EDBT '10.

[31]  Wei Wang,et al.  Keyword-based search and exploration on databases , 2011, 2011 IEEE 27th International Conference on Data Engineering.