Named Entity Recognition for Malayalam language: A CRF based approach

Named Entity Recognition is an important application area of Natural Language Processing. It is the process of identifying the designators which are present in a sentence called as named entities. Named Entity Recognition can be performed using rule based approaches, machine learning based approaches and hybrid approaches. This paper proposes a method for Named Entity Recognition of Malayalam language using one of the supervised machine learning approach called Conditional Random field approach.

[1]  Richard M. Schwartz,et al.  Nymble: a High-Performance Learning Name-finder , 1997, ANLP.

[2]  Noémie Elhadad,et al.  Unsupervised biomedical named entity recognition: Experiments with clinical and biological texts , 2013, J. Biomed. Informatics.

[3]  Pabitra Mitra,et al.  A Hybrid Named Entity Recognition System for South and South East Asian Languages , 2008, IJCNLP.

[4]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[5]  Stéphane Bressan,et al.  Association rules mining for name entity recognition , 2003, Proceedings of the Fourth International Conference on Web Information Systems Engineering, 2003. WISE 2003..

[6]  Elizabeth Sherly,et al.  A Hybrid Statistical Approach for Named Entity Recognition for Malayalam Language , 2013 .

[7]  Ralph Grishman,et al.  Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity Recognition , 1998, VLC@COLING/ACL.

[8]  Frédéric Béchet,et al.  Tagging Unknown Proper Names Using Decision Trees , 2000, ACL.

[9]  Pabitra Mitra,et al.  A Hybrid Approach for Named Entity Recognition in Indian Languages , 2008 .

[10]  Sobha Lalitha Devi,et al.  Domain Focused Named Entity Recognizer for Tamil Using Conditional Random Fields , 2008, IJCNLP.

[12]  Sivaji Bandyopadhyay,et al.  CRF based Name Entity Recognition (NER) in Manipuri: A highly agglutinative Indian Language , 2011, 2011 2nd National Conference on Emerging Trends and Applications in Computer Science.

[13]  Marti A. Hearst Trends & Controversies: Support Vector Machines , 1998, IEEE Intell. Syst..

[14]  Yue-Shi Lee,et al.  Extracting Named Entities Using Support Vector Machines , 2006, KDLL.

[15]  Jian Su,et al.  Named Entity Recognition using an HMM-based Chunk Tagger , 2002, ACL.

[16]  Yoram Singer,et al.  Unsupervised Models for Named Entity Classification , 1999, EMNLP.

[17]  Rayner Alfred,et al.  Malay named entity recognition based on rule-based approach , 2014 .

[18]  Key-Sun Choi,et al.  Unsupervised Named Entity Classification Models and their Ensembles , 2002, COLING.