A New Fuzzy Support Vector Machine Method for Named Entity Recognition

Recognizing and extracting exact name entities, like Persons, Locations, Organizations, Dates and Times are very useful to mining information from electronics resources and text. Learning to extract these types of data is called Named Entity Recognition (NER) task. Proper named entity recognition and extraction is important to solve most problems in hot research area such as Question Answering and Summarization Systems, Information Retrieval and Information Extraction, Machine Translation, Video Annotation, Semantic Web Search and Bioinformatics. In this paper we have improved the precision in NER from text using the new proposed method that calls FSVM. In our method we have employed Support Vector Machine as one of the best machine learning algorithm for classification and contribute a new fuzzy membership function thus removing the Support Vector Machinepsilas weakness points in NER precision and multi classification. The design of our method is a kind of One-Against-All multi classification technique to solve the traditional binary classifier in SVM.

[1]  Ralph Grishman,et al.  NYU: Description of the MENE Named Entity System as Used in MUC-7 , 1998, MUC.

[2]  Lucja Iwanska,et al.  Wayne State University: description of the UNO natural language processing system as used for MUC-6 , 1995, MUC.

[3]  Nina Wacholder,et al.  Disambiguation of Proper Names in Text , 1997, ANLP.

[4]  Douglas E. Appelt,et al.  SRI International FASTUS SystemMUC-6 Test Results and Analysis , 1995, MUC.

[5]  Richard M. Schwartz,et al.  Nymble: a High-Performance Learning Name-finder , 1997, ANLP.

[6]  Key-Sun Choi,et al.  Unsupervised Named Entity Classification Models and their Ensembles , 2002, COLING.

[7]  Yoram Singer,et al.  Unsupervised Models for Named Entity Classification , 1999, EMNLP.

[8]  Rohini K. Srihari,et al.  A Hybrid Approach for Named Entity and Sub-Type Tagging , 2000, ANLP.

[9]  Yue-Shi Lee,et al.  Extracting Named Entities Using Support Vector Machines , 2006, KDLL.

[10]  Ralph Grishman,et al.  The NYU System for MUC-6 or Where’s the Syntax? , 1995, MUC.

[11]  Stéphane Bressan,et al.  Association rules mining for name entity recognition , 2003, Proceedings of the Fourth International Conference on Web Information Systems Engineering, 2003. WISE 2003..

[12]  Marc Moens,et al.  Description of the LTG System Used for MUC-7 , 1998, MUC.

[13]  Ralph Grishman,et al.  Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity Recognition , 1998, VLC@COLING/ACL.

[14]  Frédéric Béchet,et al.  Tagging Unknown Proper Names Using Decision Trees , 2000, ACL.

[15]  Mark Smith,et al.  University of Durham: description of the LOLITA system as used in MUC-6 , 1995, MUC.