Leveraging Procedural Knowledge for Task-oriented Search

Many search engine users attempt to satisfy an information need by issuing multiple queries, with the expectation that each result will contribute some portion of the required information. Previous research has shown that structured or semi-structured descriptive knowledge bases (such as Wikipedia) can be used to improve search quality and experience for general or entity-centric queries. However, such resources do not have sufficient coverage of procedural knowledge, i.e. what actions should be performed and what factors should be considered to achieve some goal; such procedural knowledge is crucial when responding to task-oriented search queries. This paper provides a first attempt to bridge the gap between two evolving research areas: development of procedural knowledge bases (such as wikiHow) and task-oriented search. We investigate whether task-oriented search can benefit from existing procedural knowledge (search task suggestion) and whether automatic procedural knowledge construction can benefit from users' search activities (automatic procedural knowledge base construction). We propose to create a three-way parallel corpus of queries, query contexts, and task descriptions, and reduce both problems to sequence labeling tasks. We propose a set of textual features and structural features to identify key search phrases from task descriptions, and then adapt similar features to extract wikiHow-style procedural knowledge descriptions from search queries and relevant text snippets. We compare our proposed solution with baseline algorithms, commercial search engines, and the (manually-curated) wikiHow procedural knowledge; experimental results show an improvement of +0.28 to +0.41 in terms of Precision@8 and mean average precision (MAP).

[1]  Matthai Philipose,et al.  Mining models of human activities from the web , 2004, WWW '04.

[2]  Eugene Agichtein,et al.  Ready to buy or just browsing?: detecting web searcher goals from interaction data , 2010, SIGIR.

[3]  Ying Li,et al.  Detecting online commercial intention (OCI) , 2006, WWW '06.

[4]  Karen L. Myers A Procedural Knowledge Approach to Task-Level Control , 1996, AIPS.

[5]  Daniel Borrajo,et al.  From Unstructured Web Knowledge to Plan Descriptions , 2011, Information Retrieval and Mining in Distributed Environments.

[6]  Enhong Chen,et al.  Context-aware query suggestion by mining click-through and session data , 2008, KDD.

[7]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.

[8]  Mounia Lalmas,et al.  Penguins in sweaters, or serendipitous entity search on user-generated content , 2013, CIKM.

[9]  Somnath Banerjee,et al.  Question Classification and Answering from Procedural Text in English , 2012 .

[10]  Nathanael Chambers,et al.  Unsupervised Learning of Narrative Event Chains , 2008, ACL.

[11]  Jiawei Han,et al.  On building entity recommender systems using user click log and freebase knowledge , 2014, WSDM.

[12]  Zhenyu Liu,et al.  Automatic identification of user goals in Web search , 2005, WWW '05.

[13]  Abdur Chowdhury,et al.  A picture of search , 2006, InfoScale '06.

[14]  M.P. Georgeff,et al.  Procedural knowledge , 1986, Proceedings of the IEEE.

[15]  Ricardo A. Baeza-Yates,et al.  Query Recommendation Using Query Logs in Search Engines , 2004, EDBT Workshops.

[16]  Sung-Hyon Myaeng,et al.  Automatic construction of a large-scale situation ontology by mining how-to instructions from the web , 2010, J. Web Semant..

[17]  Ryen W. White,et al.  Supporting Complex Search Tasks , 2014, CIKM.

[18]  Aristides Gionis,et al.  Answers, not links: extracting tips from yahoo! answers to address how-to web queries , 2012, WSDM '12.

[19]  James Allan,et al.  Entity query feature expansion using knowledge base links , 2014, SIGIR.

[20]  Michael Gamon,et al.  Active objects: actions for entity-centric search , 2012, WWW.

[21]  Gary Marchionini,et al.  Exploratory search , 2006, Commun. ACM.

[22]  Roger C. Schank,et al.  Scripts, plans, goals and understanding: an inquiry into human knowledge structures , 1978 .

[23]  Simon Buckingham Shum,et al.  Knowledge Representation with Ontologies: The Present and Future , 2004, IEEE Intell. Syst..

[24]  Ryen W. White,et al.  Task tours: helping users tackle complex search tasks , 2012, CIKM.

[25]  John R. Anderson Cognitive Psychology and Its Implications , 1980 .

[26]  Filip Radlinski,et al.  Inferring query intent from reformulations and clicks , 2010, WWW '10.

[27]  Tie-Yan Liu,et al.  Actively predicting diverse search intent from user browsing behaviors , 2010, WWW '10.

[28]  Ewan Klein,et al.  A semantic web of know-how: linked data for community-centric tasks , 2014, WWW '14 Companion.

[29]  Kenneth Ward Church,et al.  Query suggestion using hitting time , 2008, CIKM '08.

[30]  Francesco Bonchi,et al.  From machu_picchu to "rafting the urubamba river": anticipating information needs via the entity-query graph , 2013, WSDM '13.

[31]  Katsumi Tanaka,et al.  Investigating users' query formulations for cognitive search intents , 2014, SIGIR.

[32]  ChengXiang Zhai,et al.  Unsupervised identification of synonymous query intent templates for attribute intents , 2013, CIKM.

[33]  Ravi Kumar,et al.  A web of concepts , 2009, PODS.

[34]  Yiqun Liu,et al.  Overview of the NTCIR-11 IMine Task , 2014, NTCIR.

[35]  Benjamin Rey,et al.  Generating query substitutions , 2006, WWW '06.

[36]  Nick Craswell,et al.  Random walks on the click graph , 2007, SIGIR.

[37]  Patrick Saint-Dizier,et al.  Investigating the Structure of Procedural Texts for Answering How-to Questions , 2008, LREC.

[38]  Jacek Gwizdka,et al.  Search behaviors in different task types , 2010, JCDL '10.

[39]  Jun Ota,et al.  Automatic modeling of user's real world activities from the web for semantic IR , 2010, SEMSEARCH '10.

[40]  Ryen W. White,et al.  Assessing the scenic route: measuring the value of search trails in web logs , 2010, SIGIR.

[41]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[42]  Ying Li,et al.  QUADS: question answering for decision support , 2014, SIGIR.