From Language to Knowledge: Starting Hawk

Abstract : This report describes work completed by the MIT Computer Science and Artificial Intelligence Laboratory in support of DARPA's Rapid Knowledge Formation (RKF) program over the period from July 2000 to September 2003. The primary focus of the RKF program is to develop new technology to automate the task of transforming raw human- understandable information into encoded, machine-understandable information. The project described in this report addresses a central subtask of this task: converting natural language text into an encoded representation that can support computer inference. The technical approach taken in this effort is based on two key insights: First, we can make the translation task manageable by breaking it into successive stages of isolating information, then standardizing it, then encoding it, with each stage facilitated by proven components of natural language processing technology. Second, we can gain leverage during the translation process by exploiting human interaction at a number of distinct points along the way.

[1]  Norbert E. Fuchs,et al.  Attempto Controlled English (ACE)Language ManualVersion 3.0 , 1999 .

[2]  Joseph D. Novak,et al.  Learning creating and using knowledge: Concept maps as facilitative tools , 1998 .

[3]  M. Ross Quillian,et al.  The teachable language comprehender: a simulation program and theory of language , 1969, CACM.

[4]  Ramanathan V. Guha,et al.  Cyc: toward programs with common sense , 1990, CACM.

[5]  Jimmy J. Lin,et al.  Question answering from the web using knowledge annotation and knowledge mining techniques , 2003, CIKM '03.

[6]  Boris Katz,et al.  Using English for Indexing and Retrieving , 1991 .

[7]  Jimmy J. Lin,et al.  Viewing the Web as a Virtual Database for Question Answering , 2004, New Directions in Question Answering.

[8]  Jimmy J. Lin,et al.  Omnibase: Uniform Access to Heterogeneous Data for Question Answering , 2002, NLDB.

[9]  Boris Katz,et al.  Annotating the World Wide Web using Natural Language , 1997, RIAO.

[10]  N. Blackstone Essential Cell Biology: An Introduction to the Molecular Biology of the Cell.Bruce Alberts , Dennis Bray , Alexander Johnson , Julian Lewis , Martin Raff , Keith Roberts , Peter Walter , 1998 .

[11]  B. Alberts,et al.  Molecular Biology of the Cell, Third Edition , 1994 .

[12]  Boris Katz,et al.  Using empirical methods for evaluating expression and content similarity , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[13]  Kevin Knight,et al.  Building a Large-Scale Knowledge Base for Machine Translation , 1994, AAAI.

[14]  Joseph D. Novak,et al.  Learning How to Learn , 1984 .

[15]  Gary W. King,et al.  A Knowledge Acquisition Tool for Course of Action Analysis , 2003, IAAI.