Investigation of BPNN & RBFN in text classification by Active search

Need of automatic text classification increases with the availability of huge amount of text in internet, news, institutes and organization. The proposed work comprised to deal with the major challenge of getting labeled data for training in classifier, since the availability of labeled data requires the involvement of annotator, is expensive and time consuming. A novel semi supervised text classification algorithm is proposed which makes use of web assisted data by Active search, the proposed algorithm investigates results by applying term weighting method (term frequency)tf and (term frequency.relevance frequency)tf.rf on BPNN (Back Propagation Neural Network)and RBFN (Radial Basis Function Network)classifiers and compared on test data and standard data Mini Newsgroup. Experimental results state that Both BPNN and RBF network performance is comparable for test data in the proposed framework, though RBF Network performance is better and more consistent than BPNN on standard mini newsgroup dataset on the basis of Micro averaged F1measure.

[1]  Mahak Motwani,et al.  Comparative Study and Analysis of Supervised and Unsupervised Term Weighting Methods on Text Classification , 2013 .

[2]  Dik Lun Lee,et al.  Feature reduction for neural network based text categorization , 1999, Proceedings. 6th International Conference on Advanced Systems for Advanced Applications.

[3]  Jian Su,et al.  Supervised and Traditional Term Weighting Methods for Automatic Text Categorization , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Shiwei Tang,et al.  A Comparative Study on Feature Weight in Text Categorization , 2004, APWeb.

[5]  John Mark,et al.  Introduction to radial basis function networks , 1996 .

[6]  Zenglin Xu,et al.  Semi-supervised text categorization by active search , 2008, CIKM '08.

[7]  Maria-Florina Balcan,et al.  A discriminative model for semi-supervised learning , 2010, J. ACM.

[8]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[9]  Hwee Tou Ng,et al.  Feature selection, perceptron learning, and a usability case study for text categorization , 1997, SIGIR '97.

[10]  Minoru Nakayama,et al.  Subject Categorization for Web Educational Resources using MLP , 2003, ESANN.

[11]  Padmini Srinivasan,et al.  Automatic Text Categorization Using Neural Networks , 1997 .

[12]  Yanchun Zhang,et al.  Enhancing text classification using synopses extraction , 2003, Proceedings of the Fourth International Conference on Web Information Systems Engineering, 2003. WISE 2003..

[13]  Michael W. Berry,et al.  Survey of Text Mining , 2003, Springer New York.

[14]  Andreas Hotho,et al.  A Brief Survey of Text Mining , 2005, LDV Forum.