论文信息 - A Normalized Probabilistic Expectation-Maximization Neural Network for Minimizing Bayesian Misclassification Cost Risk

A Normalized Probabilistic Expectation-Maximization Neural Network for Minimizing Bayesian Misclassification Cost Risk

Abstract In this paper, we propose a normalized semi-supervised probabilistic expectation-maximization neural network (PEMNN) that minimizes Bayesian misclassification cost risk. Using simulated and real-world datasets, we compare the proposed PEMNN with supervised cost sensitive probabilistic neural network (PNN), discriminant analysis (DA), mathematical integer programming (MIP) model and support vector machines (SVM) for different misclassification cost asymmetries and class biases. The results of our experiments indicate that the PEMNN performs better when class data distributions are normal or uniform. However, when class data distribution is exponential the performance of PEMNN deteriorates giving slight advantage to competing MIP, DA, PNN and SVM techniques. For real-world data with non-parametric distributions and mixed decision-making attributes (continuous and categorical), the PEMNN outperforms the PNN.

Parag C. Pendharkar | P. Pendharkar

[1] Antonie Stam,et al. A mixed integer programming algorithm for minimizing the training sample misclassification cost in two-group classification , 1997, Ann. Oper. Res..

[2] อนิรุธ สืบสิงห์,et al. Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[3] Casimir A. Kulikowski,et al. Computer Systems That Learn: Classification and Prediction Methods from Statistics, Neural Nets, Machine Learning and Expert Systems , 1990 .

[4] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[5] A. Stam,et al. Classification performance of mathematical programming techniques in discriminant analysis: Results for small and medium sample sizes , 1990 .

[6] Yiming Ying,et al. Support Vector Machine Soft Margin Classifiers: Error Analysis , 2004, J. Mach. Learn. Res..

[7] Jasni Mohamad Zain,et al. The Design of Pre-Processing Multidimensional Data Based on Component Analysis , 2011, Comput. Inf. Sci..

[8] Jiawei Han,et al. Data Mining: Concepts and Techniques , 2000 .

[9] S. M. Bajgier,et al. AN EXPERIMENTAL COMPARISON OF STATISTICAL AND LINEAR PROGRAMMING APPROACHES TO THE DISCRIMINANT PROBLEM , 1982 .

[10] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[11] Parag C. Pendharkar,et al. Probabilistic Approaches for Credit Screening and bankruptcy Prediction , 2011, Intell. Syst. Account. Finance Manag..