Automatically Constructing Concept Hierarchies of Health-Related Human Goals

To realize the vision of intelligent agents on the web, agents need to be capable of understanding people's behavior. Such an understanding would enable them to better predict and support human activities on the web. If agents had access to knowledge about human goals, they could, for instance, recognize people's goals from their actions or reason about people's goals. In this work, we study to what extent it is feasible to automatically construct concept hierarchies of domain-specific human goals. This process consists of the following two steps: (1) extracting human goal instances from a search query log and (2) inferring hierarchical structures by applying clustering techniques. To compare resulting concept hierarchies, we manually construct a golden standard and calculate taxonomic overlaps. In our experiments, we achieve taxonomic overlaps of up to ~51% for the health domain and up to ~60% for individual health subdomains. In an illustration scenario, we provide a prototypical implementation to automatically complement goal concept hierarchies by means-ends relations, i.e. relating goals to actions which potentially contribute to their accomplishment. Our findings are particularly relevant for knowledge engineers interested in (i) acquiring knowledge about human goals as well as (ii) automating the process of constructing goal concept hierarchies.

[1]  Marvin Minsky,et al.  Commonsense-based interfaces , 2000, CACM.

[2]  Rajeev Sangal,et al.  Proceedings of the 20th international joint conference on Artifical intelligence , 2007 .

[3]  Enrico Motta,et al.  The Semantic Web - ISWC 2005, 4th International Semantic Web Conference, ISWC 2005, Galway, Ireland, November 6-10, 2005, Proceedings , 2005, SEMWEB.

[4]  Moritz Tenorth,et al.  Understanding and executing instructions for everyday manipulation tasks from the World Wide Web , 2010, 2010 IEEE International Conference on Robotics and Automation.

[5]  Abdur Chowdhury,et al.  A picture of search , 2006, InfoScale '06.

[6]  Philipp Cimiano,et al.  Ontology learning and population from text - algorithms, evaluation and applications , 2006 .

[7]  Xiaoxin Yin,et al.  Building taxonomy of web search intents for name entity queries , 2010, WWW '10.

[8]  Markus Strohmaier,et al.  Acquiring knowledge about human goals from Search Query Logs , 2012, Inf. Process. Manag..

[9]  Jun Ota,et al.  Automatic modeling of user's real world activities from the web for semantic IR , 2010, SEMSEARCH '10.

[10]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[11]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[12]  Shoji Kurakake,et al.  Task Knowledge Based Retrieval for Service Relevant to Mobile User's Activity , 2005, SEMWEB.

[13]  Zellig S. Harris,et al.  Mathematical structures of language , 1968, Interscience tracts in pure and applied mathematics.

[14]  N. Garc'ia-Pedrajas,et al.  CIXL2: A Crossover Operator for Evolutionary Algorithms Based on Population Features , 2005, J. Artif. Intell. Res..

[15]  Carlo Strapparava,et al.  Adaptive Hypermedia and Adaptive Web-Based Systems, 5th International Conference, AH 2008, Hannover, Germany, July 29 - August 1, 2008. Proceedings , 2008, AH.

[16]  H. Lieberman Common Consensus : a web-based game for collecting commonsense goals , 2007 .

[17]  Henry Lieberman,et al.  GOOSE: A Goal-Oriented Search Engine with Commonsense , 2002, AH.

[18]  Sandra Carberry,et al.  Techniques for Plan Recognition , 2001, User Modeling and User-Adapted Interaction.

[19]  Hugo Liu,et al.  ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[20]  Henry Lieberman,et al.  The why UI: using goal networks to improve user interfaces , 2010, IUI '10.

[21]  Steffen Staab,et al.  Measuring Similarity between Ontologies , 2002, EKAW.

[22]  Luc De Raedt,et al.  Machine Learning: ECML 2001 , 2001, Lecture Notes in Computer Science.

[23]  Bill C. Hardgrave,et al.  Object-oriented curricula in academic programs , 2000, CACM.

[24]  Douglas B. Lenat,et al.  CYC: a large-scale investment in knowledge infrastructure , 1995, CACM.

[25]  Steffen Staab,et al.  Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis , 2005, J. Artif. Intell. Res..

[26]  Catherine Havasi,et al.  ConceptNet 3 : a Flexible , Multilingual Semantic Network for Common Sense Knowledge , 2007 .

[27]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.