Acquisition of Linguistic Patterns for Knowledge-based Information Extraction

In this paper we present a new method of automatic acquisition of linguistic patterns for Information Extraction, as implemented in the CICERO system. Our approach combines lexico-semantic information available from the WordNet database with collocating data extracted from training corpora. Due to the open-domain nature of the WordNet information and the immediate availability of large collections of texts, our method can be easily ported to open-domain Information Extraction.

[1]  Eric Brill,et al.  A Simple Rule-Based Part of Speech Tagger , 1992, HLT.

[2]  Dekang Lin,et al.  PRINCIPAR - An Efficient, Broad-coverage, Principle-based Parser , 1994, COLING.

[3]  Ralph M. Weischedel,et al.  BEN: description of the PLUM system as used for MUC-6 , 1995, MUC.

[4]  Ellen Riloff,et al.  Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping , 1999, AAAI/IAAI.

[5]  Ralph Grishman,et al.  New York University PROTEUS system: MUC-4 test results and analysis , 1992, MUC.

[6]  Sanda Harabagiu Testing Gricean Constraints on a WordNet-based Coherence Evaluation System , 1996 .

[7]  Lynette Hirschman,et al.  MITRE: Description of the Alembic System Used for MUC-6 , 1995, MUC.

[8]  Sasa Buvac Quantificational Logic of Context , 1996, AAAI/IAAI, Vol. 1.

[9]  Dan I. Moldovan,et al.  Acquisition of Linguistic Patterns for Knowledge-Based Information Extraction , 1995, IEEE Trans. Knowl. Data Eng..

[10]  John McCarthy,et al.  Notes on Formalizing Context , 1993, IJCAI.

[11]  Joseph Weizenbaum,et al.  and Machine , 1977 .

[12]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[13]  Herbert Gish,et al.  BBN: Description of the PLUM System as Used for MUC-5 , 2005, MUC.

[14]  David Fisher,et al.  Description of the UMass system as used for MUC-6 , 1995, MUC.

[15]  George A. Miller WordNet: A Lexical Database for English , 1992, HLT.

[16]  Raymond J. Mooney,et al.  Relational Learning of Pattern-Match Rules for Information Extraction , 1999, CoNLL.

[17]  Ellen Riloff,et al.  Automatically Constructing a Dictionary for Information Extraction Tasks , 1993, AAAI.

[18]  Alan W. Biermann,et al.  The Role of WordNet in The Creation of a Trainable Message Understanding System , 1997, AAAI/IAAI.

[19]  Douglas E. Appelt,et al.  The SRI MUC-5 JV-FASTUS In-formation Extraction System , 1993 .

[20]  Michael Collins,et al.  A New Statistical Parser Based on Bigram Lexical Dependencies , 1996, ACL.

[21]  David Stallard,et al.  The Mapping Unit Approach to Subcategorization , 1991, HLT.

[22]  Rebecca N. Wright,et al.  Finite-State Approximation of Phrase Structure Grammars , 1991, ACL.

[23]  David Fisher,et al.  CRYSTAL: Inducing a Conceptual Dictionary , 1995, IJCAI.

[24]  Jerry R. Hobbs,et al.  Localizing Expression Of Ambiguity , 1988, ANLP.