Text Classification for Marathi Documents using Supervised Learning Methods

The evolution of Information Technology has led to the collection of large number of text documents. Mostly, researchers worked on English text documents. Today, millions of documents are present in Indian regional languages. So, to classify these documents manually is expensive and time consuming task. Automatic classification can help in better management and retrieval of these documents. From the literature survey, it is found that not much work has been done for classification of Marathi text documents. This paper presents efficient Marathi text classification system using Supervised Learning Methods and Ontology based classification.

[1]  J. W. Bakal,et al.  Extraction of Root Words using Morphological Analyzer for Devanagari Script , 2016 .

[2]  Dalwadi Bijal,et al.  Overview of Stemming Algorithms for Indian and Non-Indian Languages , 2014, ArXiv.

[3]  Abhishek Tyagi,et al.  An effective stemmer in Devanagari script , 2013 .

[4]  A. Govardhan,et al.  Ontology Based Text Categorization - Telugu Documents , 2011 .

[5]  Sushma R. Vispute,et al.  Automatic text categorization of marathi documents using clustering technique , 2013, 2013 15th International Conference on Advanced Computing Technologies (ICACT).

[6]  Nagaraju Bogiri,et al.  Automatic text categorization: Marathi documents , 2015, 2015 International Conference on Energy Systems and Applications.

[7]  Vishal Gupta,et al.  Domain Based Classification of Punjabi Text Documents using Ontology and Hybrid Based Approach , 2012, WSSANLP@COLING.

[8]  Abbas Raza Ali,et al.  Urdu text classification , 2009, FIT.

[9]  S. Niharika,et al.  A SURVEY ON TEXT CATEGORIZATION , 2012 .

[10]  K. Rajan,et al.  Automatic classification of Tamil documents using vector space model and artificial neural network , 2009, Expert Syst. Appl..

[11]  G VishnuMurthy,et al.  A Comparative study on Term Weighting Methods for Automated Telugu Text Categorization with Effective Classifiers , 2013 .

[12]  Nidhi Punjabi Text Classification using Naive Bayes, Centroid and Hybrid Approach , 2012 .

[13]  Dr. K. Duraiswamy,et al.  An Overview of Categorization techniques , 2012 .

[14]  Ashis Kumar Mandal,et al.  Supervised learning Methods for Bangla Web Document Categorization , 2014, ArXiv.