Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing

Procedural knowledge is the knowledge required to perform certain tasks, and forms an important part of expertise. A major source of procedural knowledge is natural language instructions. While these readable instructions have been useful learning resources for human, they are not interpretable by machines. Automatically acquiring procedural knowledge in machine interpretable formats from instructions has become an increasingly popular research topic due to their potential applications in process automation. However, it has been insufficiently addressed. This paper presents an approach and an implemented system to assist users to automatically acquire procedural knowledge in structured forms from instructions. We introduce a generic semantic representation of procedures for analysing instructions, using which natural language techniques are applied to automatically extract structured procedures from instructions. The method is evaluated in three domains to justify the generality of the proposed semantic representation as well as the effectiveness of the implemented automatic system.

[1]  Alois Ferscha,et al.  Constructing Ambient Intelligence - AmI 2007 Workshops Darmstadt, Germany, November 7-10, 2007 Revised Papers , 2008, AmI Workshops.

[2]  Guy Lapalme,et al.  Choosing Rhetorical Structures To Plan Instructional Texts , 2000, Comput. Intell..

[3]  Beatrice Santorini,et al.  The Penn Treebank: An Overview , 2003 .

[4]  Keith Vander Linden,et al.  Automatically Eliciting Task Models from Written Task Narratives , 2002, CADUI.

[5]  Yolanda Gil,et al.  TellMe: learning procedures from tutorial instruction , 2011, IUI '11.

[6]  Shijian Lu,et al.  Automatically Generating Effective Online Help , 2005 .

[7]  Nicholas J. Belkin,et al.  Identifying and improving retrieval for procedural questions , 2007, Inf. Process. Manag..

[8]  Patrick Saint-Dizier,et al.  Investigating the Structure of Procedural Texts for Answering How-to Questions , 2008, LREC.

[9]  Sophia Ananiadou,et al.  The C-value/NC-value Method of Automatic Recognition for Multi-Word Terms , 1998, ECDL.

[10]  Yolanda Gil,et al.  A formal framework for combining natural instruction and demonstration for end-user programming , 2011, IUI '11.

[11]  Shijian Lu,et al.  Automated knowledge acquisition for instructional text generation , 2002, SIGDOC '02.

[12]  Max Mühlhäuser,et al.  Smart Products: An Introduction , 2007, AmI Workshops.

[13]  Jian Tang,et al.  Handling uncertainties in workflow applications , 1996, CIKM '96.

[14]  Ziqi Zhang,et al.  A Comprehensive Solution to Procedural Knowledge Acquisition using Information Extraction , 2010, KDIR.

[15]  Harry Bunt,et al.  Question Answering: What's Next? , 2005 .

[16]  George Hripcsak,et al.  Technical Brief: Agreement, the F-Measure, and Reliability in Information Retrieval , 2005, J. Am. Medical Informatics Assoc..