A Comparison between a KNN Based Approach and a PNN Algorithm for a Multi-label Classification Problem

Techniques for categorization and clustering, range from support vector machines, neural networks to Bayesian inference and algebraic methods. The k-Nearest Neighbor Algorithm (KNN) is a popular example of the latter class of these algorithms. Recently, a slight modification of it has been proposed so that the Multi-Label k-Nearest Neighbor Algorithm (ML-KNN) can deal better with multi-label classification problems. In this paper we are interested in automatic text categorization, which are becoming more and more important as the amount of text in electronic format grows and the access to it becomes more necessary and widespread. We proposed a Probabilistic Neural Network Algorithm (PNN) tailored to also deal with multi-label classification problems, and compared it against the ML-KNN algorithm. Our implementation surpass the ML-KNN algorithm in four metrics typically used in the literature for multi-label categorization problems.

[1]  Dionisis Cavouras,et al.  Comparative evaluation of probabilistic neural network versus support vector machines classifiers in discriminating ERP signals of depressive patients from healthy controls , 2003, 3rd International Symposium on Image and Signal Processing and Analysis, 2003. ISPA 2003. Proceedings of the.

[2]  Chenn-Jung Huang,et al.  A comparative study of feature selection methods for probabilistic neural networks in cancer classification , 2003, Proceedings. 15th IEEE International Conference on Tools with Artificial Intelligence.

[3]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[4]  Kok Wai Wong,et al.  Comparing the Performance of Different Neural Networks Architectures for the Prediction of Mineral Prospectivity , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[5]  D. F. Specht,et al.  Probabilistic neural networks for classification, mapping, or associative memory , 1988, IEEE 1988 International Conference on Neural Networks.

[6]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[7]  Jian-xiong Dong,et al.  Fast SVM training algorithm with decomposition on very large data sets , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  David G. Stork,et al.  Pattern Classification , 1973 .

[9]  P. K. Patra,et al.  Probabilistic neural network for pattern classification , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[10]  Xiulan Hao,et al.  An Effective Method To Improve kNN Text Classifier , 2007 .

[11]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[12]  Wee Ser,et al.  Probabilistic neural-network structure determination for pattern classification , 2000, IEEE Trans. Neural Networks Learn. Syst..

[13]  W. Jatmiko,et al.  Optimized probabilistic neural networks in recognizing fragrance mixtures using higher number of sensors , 2005, IEEE Sensors, 2005..

[14]  Donald F. Specht,et al.  Probabilistic neural networks , 1990, Neural Networks.