Generating Query Facets Using Knowledge Bases

A query facet is a significant list of information nuggets that explains an underlying aspect of a query. Existing algorithms mine facets of a query by extracting frequent lists contained in top search results. The coverage of facets and facet items mined by these kind of methods might be limited, because only a small number of search results are used. In order to solve this problem, we propose mining query facets by using knowledge bases which contain high-quality structured data. Specifically, we first generate facets based on the properties of the entities which are contained in Freebase and correspond to the query. Second, we mine initial query facets from search results, then expanding them by finding similar entities from Freebase. Experimental results show that our proposed method can significantly improve the coverage of facet items over the state-of-the-art algorithms.

[1]  Yi Liu,et al.  Translating Queries into Snippets for Improved Query Expansion , 2008, COLING.

[2]  James Allan,et al.  Extending Faceted Search to the General Web , 2014, CIKM.

[3]  Aristides Gionis,et al.  Improving recommendation for long-tail queries via templates , 2011, WWW.

[4]  K. Latha,et al.  AFGF: An Automatic Facet Generation Framework for Document Retrieval , 2010, 2010 International Conference on Advances in Computer Engineering.

[5]  Shuming Shi,et al.  Employing Topic Models for Pattern-based Semantic Class Discovery , 2009, ACL/IJCNLP.

[6]  Ji-Rong Wen,et al.  Automatically Mining Facets for Queries from Their Search Results , 2016, IEEE Transactions on Knowledge and Data Engineering.

[7]  Panagiotis G. Ipeirotis,et al.  Automatic Extraction of Useful Facet Hierarchies from Text Databases , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[8]  Enrique Alfonseca,et al.  Generalized syntactic and semantic models of query reformulation , 2010, SIGIR.

[9]  Jun Rao,et al.  Dynamic faceted search for discovery-driven analysis , 2008, CIKM '08.

[10]  Marti A. Hearst,et al.  Automating Creation of Hierarchical Faceted Metadata Structures , 2007, NAACL.

[11]  Zhenglu Yang,et al.  QUBiC: An adaptive approach to query-based recommendation , 2013, Journal of Intelligent Information Systems.

[12]  Efthimis N. Efthimiadis,et al.  Analyzing and evaluating query reformulation strategies in web search logs , 2009, CIKM.

[13]  Olfa Nasraoui,et al.  Mining search engine query logs for query recommendation , 2006, WWW '06.

[14]  Wisam Dakka Automatic Discovery of Useful Facet Terms , 2006 .

[15]  William W. Cohen,et al.  Character-level Analysis of Semi-Structured Documents for Set Expansion , 2009, EMNLP.

[16]  Gautam Das,et al.  Facetedpedia: dynamic generation of query-dependent faceted interfaces for wikipedia , 2010, WWW '10.

[17]  Panagiotis G. Ipeirotis,et al.  Automatic construction of multifaceted browsing interfaces , 2005, CIKM '05.

[18]  Panayiotis Tsaparas,et al.  Facet discovery for structured web search: a query-log mining approach , 2011, SIGMOD '11.

[19]  William W. Cohen,et al.  Iterative Set Expansion of Named Entities Using the Web , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[20]  Ivan Koychev,et al.  Query-Based Summarization: A survey , 2010 .

[21]  W. Bruce Croft,et al.  Modeling reformulation using query distributions , 2013, TOIS.

[22]  James Allan,et al.  Extracting query facets from search results , 2013, SIGIR.

[23]  Mukesh K. Mohania,et al.  Retrieval]: Query formulation, search process , 2022 .

[24]  Lidong Bing,et al.  Web Query Reformulation via Joint Modeling of Latent Topic Dependency and Term Context , 2015, TOIS.

[25]  Laurie J. Heyer,et al.  Exploring expression data: identification and analysis of coexpressed genes. , 1999, Genome research.

[26]  Chris Buckley,et al.  Improving automatic query expansion , 1998, SIGIR '98.

[27]  Ji-Rong Wen,et al.  Finding dimensions for queries , 2011, CIKM '11.

[28]  Harry Shum Bing dialog model: intent, knowledge and user interaction , 2011, WSDM '11.

[29]  Sougata Mukherjea,et al.  Faceted search and browsing of audio content on spoken web , 2010, CIKM.

[30]  Estevam R. Hruschka,et al.  Coupled semi-supervised learning for information extraction , 2010, WSDM '10.

[31]  Ricardo A. Baeza-Yates,et al.  Query Recommendation Using Query Logs in Search Engines , 2004, EDBT Workshops.

[32]  Xiaojie Yuan,et al.  Corpus-based Semantic Class Mining: Distributional vs. Pattern-Based Approaches , 2010, COLING.

[33]  Mohsen Amini Salehi,et al.  A Comprehensive Survey on Text Summarization Systems , 2009, 2009 2nd International Conference on Computer Science and its Applications.

[34]  William W. Cohen,et al.  Language-Independent Set Expansion of Named Entities Using the Web , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[35]  Peter G. Anick Using terminological feedback for web search refinement: a log-based study , 2003, SIGIR.

[36]  Eugene J. Shekita,et al.  Beyond basic faceted search , 2008, WSDM '08.