论文信息 - Efficient top-k algorithm for eXtensible Markup Language keyword search

Efficient top-k algorithm for eXtensible Markup Language keyword search

The ability to compute top- k matches to eXtensible Markup Language (XML) queries is gaining importance owing to the increasing of large XML repositories. Current work on top- k match to XML queries mainly focuses on employing XPath, XQuery or NEXI as the query language, whereas little work has concerned on top- k match to XML keyword search. In this study, the authors propose a novel two-layer-based index construction and associated algorithm for efficiently computing top- k results for XML keyword search. Our core contribution, the two-layer-based inverted Index and associated algorithm for XML keyword search take both score-sorted-sequence and Dewey ID-sorted-sequence into consideration, and thus gain performance benefits during querying process. The authors have conducted expensive experiments and our experimental results show efficiency advantages compared with existing approaches.

Hang Yu | Zhi-Hong Deng | Ning Gao

[1] Yannis Papakonstantinou,et al. Efficient keyword search for smallest LCAs in XML databases , 2005, SIGMOD '05.

[2] Gerhard Weikum,et al. TopX: efficient and versatile top-k query processing for semistructured data , 2007, The VLDB Journal.

[3] Laks V. S. Lakshmanan,et al. FleXPath: flexible structure and full-text querying for XML , 2004, SIGMOD '04.

[4] Gerhard Weikum,et al. An Efficient and Versatile Query Engine for TopX Search , 2005, VLDB.

[5] Shiwei Tang,et al. Adaptive Top-k Algorithm in SLCA-Based XML Keyword Search , 2010, 2010 12th International Asia-Pacific Web Conference.

[6] Uzi Vishkin,et al. On Finding Lowest Common Ancestors: Simplification and Parallelization , 1988, AWOC.

[7] Feng Shao,et al. XRANK: ranked keyword search over XML documents , 2003, SIGMOD '03.

[8] Divesh Srivastava,et al. Keyword proximity search in XML trees , 2006, IEEE Transactions on Knowledge and Data Engineering.

[9] Nicholas Kushmerick,et al. Expressive retrieval from XML documents , 2001, SIGIR '01.

[10] Cong Yu,et al. Schema-Free XQuery , 2004, VLDB.

[11] Yi Chen,et al. Identifying meaningful return information for XML keyword search , 2007, SIGMOD '07.

[12] Tok Wang Ling,et al. Effective XML Keyword Search with Relevance Oriented Ranking , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[13] Jianyong Wang,et al. Effective keyword search for valuable lcas over xml documents , 2007, CIKM '07.