Named entity recognition for Hindi language: A survey

Abstract Named Entity Recognition (NER) is an important task that is used as a pre-processing step in various natural language processing (NLP) applications. It is useful as it mostly increases the performance of NLP applications. A large number of researchers are focusing this problem using various techniques such as rule based, machine learning based and hybrid approaches. In the recent times, deep learning algorithms are also emerging for developing NER models. It is very challenging to build NER system for Hindi language in particular because it is an ambiguous, morphologically rich and resource scarce language. In this paper, we present a state-of-the-art survey covering various approaches of NER for mainly Hindi Language.

[1]  Andy Way,et al.  Improved Named Entity Recognition using Machine Translation-based Cross-lingual Information , 2016, Computación y Sistemas.

[2]  Mukund Sanglikar,et al.  Named Entity Recognition System for Hindi Language: A Hybrid Approach , 2011 .

[3]  Sivaji Bandyopadhyay,et al.  Named Entity Recognition using Support Vector Machine: A Language Independent Approach , 2010 .

[4]  Mark A. Greenwood,et al.  Using a Named Entity Tagger to Generalise Surface Matching Text Patterns for Question Answering , 2003 .

[5]  Navanath Saharia Phone-based identification of language in code-mixed social network data , 2017 .

[6]  Ameya Prabhu,et al.  Towards Deep Learning in Hindi NER: An approach to tackle the Labelled Data Sparsity , 2016, ICON.

[7]  Kamal Sarkar,et al.  An HMM Based Named Entity Recognition System for Indian Languages: The JU System at ICON 2013 , 2014, ArXiv.

[8]  Deepti Chopra,et al.  Named Entity Recognition in Hindi Using Hidden Markov Model , 2016, 2016 Second International Conference on Computational Intelligence & Communication Technology (CICT).

[9]  Pabitra Mitra,et al.  A Semi-supervised Approach for Maximum Entropy Based Hindi Named Entity Recognition , 2009, PReMI.

[10]  Yavrajdeep Kaur,et al.  Named Entity Recognition (NER) System for Hindi Language Using Combination of Rule Based Approach and List Look Up Approach , 2015 .

[11]  Ramesh C. Poonia,et al.  A Literature Review on Dedicated Short Range Communication for Intelligent Transport , 2013 .

[12]  Pabitra Mitra,et al.  A composite kernel for named entity recognition , 2010, Pattern Recognit. Lett..

[13]  Sivaji Bandyopadhyay,et al.  A Conditional Random Field Approach for Named Entity Recognition in Bengali and Hindi , 2009 .

[14]  Deepti Chopra Hindi Named Entity Recognition By Aggregating Rule Based Heuristics and Hidden Markov Model , 2012 .

[15]  Pabitra Mitra,et al.  A Hybrid Feature Set based Maximum Entropy Hindi Named Entity Recognition , 2008, IJCNLP.

[16]  Linesh Raja,et al.  Source Redundancy Management and Host Intrusion Detection in Wireless Sensor Networks , 2018 .

[17]  Linesh Raja,et al.  The Realistic Mobility Evaluation of Vehicular Ad-Hoc Network for Indian Automotive Networks , 2014 .

[18]  Vijander Singh,et al.  Tier Application in Multi-Cloud Databases to Improve Security and Service Availability , 2019 .

[19]  Vasudeva Varma,et al.  Named entity recognition an aid to improve multilingual entity filling in language-independent approach , 2012, IKM4DR '12.

[20]  Ajit Kumar,et al.  Hindi to Punjabi machine translation system based on statistical approach , 2018 .

[21]  Neethu John,et al.  AMRITA_CEN@FIRE-2014: Named Entity Recognition for Indian Languages using Rich Features , 2014, FIRE.

[22]  Namita Mittal,et al.  Discourse Based Sentiment Analysis for Hindi Reviews , 2013, PReMI.

[23]  Deepti Chopra,et al.  Named Entity Recognition using Hidden Markov Model (HMM) , 2012 .

[24]  K. P. Soman,et al.  Entity Extraction of Hindi-English and Tamil-English Code-Mixed Social Media Text , 2016, FIRE Workshop.

[25]  Wei Li,et al.  Rapid development of Hindi named entity recognition using conditional random fields and feature induction , 2003, TALIP.