Named Entity Recognition by Neural Sliding Window

Named Entity Recognition (NER) is an important subtask of document processing such as Information Extraction. This paper describes a NER algorithm which uses a Multi-Layer Perceptron (MLP) to find and classify entities in natural language text. In particular we use the MLP to implement a new supervised context-based NER approach called Sliding Window Neural (SWiN). The SWiN method is a good solution for domains where the documents are grammatically ill-formed and it is difficult to exploit the features derived from linguistic analysis. Experiments indicate good accuracy compared with traditional approaches and demonstrate the system's portability.

[1]  Andrew McCallum,et al.  Information Extraction with HMMs and Shrinkage , 1999 .

[2]  Raymond J. Mooney,et al.  Relational Learning of Pattern-Match Rules for Information Extraction , 1999, CoNLL.

[3]  Stephen Soderland,et al.  Learning Information Extraction Rules for Semi-Structured and Free Text , 1999, Machine Learning.

[4]  Georgios Paliouras,et al.  Learning Decision Trees for Named-Entity Recognition and Classification , 2000 .

[5]  Fabio Ciravegna,et al.  Adaptive Information Extraction from Text by Rule Induction and Generalisation , 2001, IJCAI.

[6]  Maria Leonor Pacheco,et al.  of the Association for Computational Linguistics: , 2001 .

[7]  Ralph Grishman,et al.  Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity Recognition , 1998, VLC@COLING/ACL.

[8]  Nancy Chinchor,et al.  MUC-4 evaluation metrics , 1992, MUC.

[9]  Dan Roth,et al.  Relational Learning via Propositional Algorithms: An Information Extraction Case Study , 2001, IJCAI.

[10]  Ricardo Baeza-Yates,et al.  Information Retrieval: Data Structures and Algorithms , 1992 .

[11]  Hwee Tou Ng,et al.  Named Entity Recognition with a Maximum Entropy Approach , 2003, CoNLL.

[12]  Jian Su,et al.  Named Entity Recognition using an HMM-based Chunk Tagger , 2002, ACL.

[13]  Dayne Freitag,et al.  Machine Learning for Information Extraction in Informal Domains , 2000, Machine Learning.

[14]  Baohua Gu,et al.  Recognizing Biomedical Named Entities in the Absence of Human Annotated Corpora , 2007, 2007 International Conference on Natural Language Processing and Knowledge Engineering.

[15]  Venkata Subramaniam,et al.  Information Retrieval: Data Structures & Algorithms , 1992 .

[16]  Ralph Grishman,et al.  A Decision Tree Method for Finding and Classifying Names in Japanese Texts , 1998, VLC@COLING/ACL.

[17]  Richard M. Schwartz,et al.  An Algorithm that Learns What's in a Name , 1999, Machine Learning.

[18]  Dayne Freitag,et al.  Information Extraction from HTML: Application of a General Machine Learning Approach , 1998, AAAI/IAAI.

[19]  Hideki Isozaki,et al.  Efficient Support Vector Classifiers for Named Entity Recognition , 2002, COLING.

[20]  Iris Hendrickx,et al.  Memory-based one-step named-entity recognition: Effects of seed list features, classifier stacking, and unannotated data , 2003, CoNLL.