HMM BASED POS TAGGER FOR HINDI

Part of Speech tagging in Indian Languages is still an open problem. We still lack a clear approach in implementing a POS tagger for Indian Languages. In this paper we describe our efforts to build a Hidden Markov Model based Part of Speech Tagger. We have used IL POS tag set for the development of this tagger. We have achieved the accuracy of 92%.

[1]  Dipti Misra Sharma,et al.  AnnCorra : Annotating Corpora Guidelines For POS And Chunk Annotation For Indian Languages , 2008 .

[2]  M. Selvam,et al.  Improvement of Rule Based Morphological Analysis and POS Tagging in Tamil Language via Projection and Induction Techniques , 2022 .

[3]  Sivaji Bandyopadhyay,et al.  Lexicon Development and POS Tagging Using a Tagged Bengali News Corpus , 2007, FLAIRS.

[4]  Sudeshna Sarkar,et al.  Automatic Part-of-Speech Tagging for Bengali: An Approach for Morphologically Rich Languages in a Poor Resource Scenario , 2007, ACL.

[5]  Anirudh Mani,et al.  Part of Speech Tagging and Chunking with Conditional Random Fields , 2022 .

[6]  Pushpak Bhattacharyya,et al.  Morphological Richness Offsets Resource Demand - Experiences in Constructing a POS Tagger for Hindi , 2006, ACL.

[7]  Pushpak Bhattacharyya,et al.  Hindi POS Tagger Using Naive Stemming : Harnessing Morphological Information Without Extensive Linguistic Knowledge , 2008 .

[8]  ABOUT IIT BOMBAY & , 2022 .

[9]  Akshar Bharati,et al.  Natural language processing : a Paninian perspective , 1996 .

[10]  Brendan S. Gillon Review of Natural language processing: a Paninian perspective by Akshar Bharati, Vineet Chaitanya, and Rajeev Sangal. Prentice-Hall of India 1995. , 1995 .

[11]  K. P. Soman,et al.  Tamil POS Tagging using Linear Programming , 2009 .

[12]  Sumam Mary Idicula,et al.  Development of a POS Tagger for Malayalam - An Experience , 2009, 2009 International Conference on Advances in Recent Technologies in Communication and Computing.

[13]  Avinesh Pvs,et al.  Part-Of-Speech Tagging and Chunking using Conditional Random Fields and Transformation Based Learning , 2006 .

[14]  Nisheeth Joshi,et al.  Part of Speech Tagging of Marathi Text Using Trigram Method , 2013, ArXiv.

[15]  K. P. Soman,et al.  POS Tagger and Chunker for Tamil Language , 2009 .