RULE-BASED NAMED ENTITY RECOGNITION FOR GREEK FINANCIAL TEXTS

The identification and classification of proper names (named entity recognition) is considered an important task in the area of Information Retrieval and Extraction. A typical named entity recognition (NER) system mainly consists of a lexicon and a grammar. When moving to a new domain, these lexical resources should be customised, either manually or exploiting machine learning techniques. In this paper, we present a NER system based on hand crafted lexical resources. The system is part of a Greek information extraction system and was tested on a Greek corpus of financial news with satisfactory results.

[1]  David D. McDonald Internal and External Evidence in the Identification and Semantic Categorization of Proper Names , 1993 .

[2]  James Pustejovsky,et al.  Corpus processing for lexical acquisition , 1996 .

[3]  Maria Teresa Pazienza,et al.  Information Extraction A Multidisciplinary Approach to an Emerging Information Technology , 1997, Lecture Notes in Computer Science.

[4]  Sam Coates-Stephens,et al.  The Analysis and Acquisition of Proper Names for the Understanding of Free Text , 1992, Comput. Humanit..

[5]  Yorick Wilks,et al.  University of Sheffield: Description of the LaSIE System as Used for MUC-6 , 1995, MUC.

[6]  Ralph Grishman,et al.  Information Extraction: Techniques and Challenges , 1997, SCIE.

[7]  Ralph Grishman,et al.  NYU: Description of the MENE Named Entity System as Used in MUC-7 , 1998, MUC.

[8]  Marc Moens,et al.  Named Entity Recognition without Gazetteers , 1999, EACL.

[9]  Georgios Paliouras,et al.  USING MACHINE LEARNING TECHNIQUES FOR PART-OF-SPEECH TAGGING IN THE GREEK LANGUAGE , 2000 .

[10]  Georgios Paliouras,et al.  Named-Entity Recognition from Greek and English Texts , 1999, J. Intell. Robotic Syst..

[11]  Maria Liakata,et al.  Named Entity Recognition in Greek Texts , 2000, LREC.

[12]  Marc Moens,et al.  Description of the LTG System Used for MUC-7 , 1998, MUC.

[13]  Douglas E. Appelt,et al.  SRI International FASTUS SystemMUC-6 Test Results and Analysis , 1995, MUC.

[14]  Yorick Wilks,et al.  University of Sheffield: description of the LaSIE system as used for MUC-6 , 1995, MUC.

[15]  Richard M. Schwartz,et al.  Nymble: a High-Performance Learning Name-finder , 1997, ANLP.