IR based Task-Model Learning: Automating the hierarchical structuring of tasks

Task-models concretize general requests to support users in real-world scenarios. In this paper, we present an IR based algorithm (IRTML) to automate the construction of hierarchically structured task-models. In contrast to other approaches, our algorithm is capable of assigning general tasks closer to the top and specific tasks closer to the bottom. Connections between tasks are established by extending Turney’s PMI-IR measure. To evaluate our algorithm, we manually created a ground truth in the health-care domain consisting of 14 domains. We compared the IRTML algorithm to three state-of-the-art algorithms to generate hierarchical structures, i.e. BiSection K-means, Formal Concept Analysis and Bottom-Up Clustering. Our results show that IRTML achieves a 25.9% taxonomic overlap with the ground truth, a 32.0% improvement over the compared algorithms.

[1]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[2]  Giovanna Castellano,et al.  Computational Intelligence techniques for Web personalization , 2008, Web Intell. Agent Syst..

[3]  Paola Velardi,et al.  Monitoring the status of a research community through a Knowledge Map , 2008, Web Intell. Agent Syst..

[4]  Jeff A. Bilmes,et al.  Structure Learning on Large Scale Common Sense Statistical Models of Human State , 2008, AAAI.

[5]  Kenneth Ward Church,et al.  Using Statistics in Lexical Analysis , 2003, Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon.

[6]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[7]  Paul M. B. Vitányi,et al.  The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.

[8]  Shoji Kurakake,et al.  Construction and Use of Role-Ontology for Task-Based Service Navigation System , 2006, International Semantic Web Conference.

[9]  Henry A. Kautz,et al.  Sensor-Based Understanding of Daily Life via Large-Scale Use of Common Sense , 2006, AAAI.

[10]  Moritz Tenorth,et al.  Understanding and executing instructions for everyday manipulation tasks from the World Wide Web , 2010, 2010 IEEE International Conference on Robotics and Automation.

[11]  Doug Downey,et al.  Web-scale information extraction in knowitall: (preliminary results) , 2004, WWW '04.

[12]  J. Jenkins,et al.  Word association norms , 1964 .

[13]  Abdur Chowdhury,et al.  A picture of search , 2006, InfoScale '06.

[14]  Jerry R. Hobbs,et al.  DAML-S: Semantic Markup for Web Services , 2001, SWWS.

[15]  Tsau Young Lin,et al.  A semi-supervised efficient learning approach to extract biological relationships from web-based biomedical digital library , 2006, Web Intell. Agent Syst..

[16]  Kevin Crowston,et al.  Organizing Business Knowledge: The MIT Process Handbook , 2003 .

[17]  Pawan Lingras,et al.  Interval set clustering of web users using modified Kohonen self-organizing maps based on the properties of rough sets , 2004, Web Intell. Agent Syst..

[18]  Jun Ota,et al.  Automatic modeling of user's real world activities from the web for semantic IR , 2010, SEMSEARCH '10.

[19]  George A. Vouros,et al.  Learning subsumption hierarchies of ontology concepts from texts , 2010, Web Intell. Agent Syst..

[20]  Steffen Staab,et al.  Measuring Similarity between Ontologies , 2002, EKAW.

[21]  Matthai Philipose,et al.  Mining models of human activities from the web , 2004, WWW '04.

[22]  Jeff A. Bilmes,et al.  Learning Large Scale Common Sense Models of Everyday Life , 2007, AAAI.

[23]  Richi Nayak,et al.  Mining world knowledge for analysis of search engine content , 2007, Web Intell. Agent Syst..

[24]  Markus Strohmaier,et al.  Acquiring knowledge about human goals from Search Query Logs , 2012, Inf. Process. Manag..

[25]  David Snchez Domain Ontology Learning from the Web , 2008 .

[26]  Bijan Parsia,et al.  Task Computing - The Semantic Web Meets Pervasive Computing , 2003, SEMWEB.

[27]  Shoji Kurakake,et al.  Task Knowledge Based Retrieval for Service Relevant to Mobile User's Activity , 2005, SEMWEB.

[28]  Philipp Cimiano,et al.  Ontology learning and population from text - algorithms, evaluation and applications , 2006 .

[29]  Mark Klein,et al.  Towards High-Precision Service Retrieval , 2002, SEMWEB.