Generating Extraction Patterns from a Large Semantic Network and an Untagged Corpus

This paper presents a module dedicated to the elaboration of linguistic resources for a versatile Information Extraction system. In order to decrease the time spent on the elaboration of resources for the IE system and guide the end-user in a new domain, we suggest to use a machine learning system that helps defining new templates and associated resources. This knowledge is automatically derived from the text collection, in interaction with a large semantic network.

[1]  Ellen Riloff,et al.  Little words can make a big difference for text classification , 1995, SIGIR '95.

[2]  Emmanuel Morin Projecting Corpus-Based Semantic Links on a Thesaurus , 1999, ACL.

[3]  Douglas E. Appelt,et al.  FASTUS: A Finite-state Processor for Information Extraction from Real-world Text , 1993, IJCAI.

[4]  Fabio Ciravegna,et al.  Adaptive Information Extraction from Text by Rule Induction and Generalisation , 2001, IJCAI.

[5]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.

[6]  David Fisher,et al.  CRYSTAL: Inducing a Conceptual Dictionary , 1995, IJCAI.

[7]  Richard M. Schwartz,et al.  Nymble: a High-Performance Learning Name-finder , 1997, ANLP.

[8]  Dayne Freitag,et al.  Machine Learning for Information Extraction in Informal Domains , 2000, Machine Learning.

[9]  Ellen Riloff Bootstrapping for text learning tasks , 1999 .

[10]  K. Minton Extraction Patterns for Information Extraction Tasks : A Survey , 1999 .

[11]  Ralph Grishman,et al.  Scenario customization for information extraction , 2000 .

[12]  Marie Candito Organisation modulaire et parametrable de grammaires electroniques lexicalisees application du francais et a l'italien , 1999 .

[13]  Ellen Riloff,et al.  Automatically Constructing a Dictionary for Information Extraction Tasks , 1993, AAAI.

[14]  Thierry Poibeau Deriving a multi-domain information extraction system from a rough ontology , 2001, IJCAI.

[15]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[16]  Filippo Neri,et al.  Machine Learning for Information Extraction , 1997, SCIE.

[17]  Alan W. Biermann,et al.  The Role of WordNet in The Creation of a Trainable Message Understanding System , 1997, AAAI/IAAI.

[18]  Gregory Grefenstette Evaluating the adequacy of a multilingual transfer dictionary for the cross language information retrieval , 1998 .

[19]  Max Silberztein,et al.  Dictionnaires électroniques et analyse automatique de textes : le système intex , 1993 .

[20]  Thierry Poibeau,et al.  Inferring Knowledge from a Large Semantic Network , 2002, COLING.

[21]  Ellen Riloff,et al.  Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping , 1999, AAAI/IAAI.