Learning with imbalanced datasets using fuzzy ARTMAP-based neural network models

One of the main difficulties in real-world data classification and analysis tasks is that the data distribution can be imbalanced. In this paper, a variant of the supervised learning neural network from the Adaptive Resonance Theory (ART) family, i.e., Fuzzy ARTMAP (FAM) which is equipped with a conflict-resolving facility, is proposed to classify an imbalanced dataset that represents a real problem in the semiconductor industry. The FAM model is combined with the Dynamic Decay Adjustment (DDA) algorithm to form a hybrid FAMDDA network. The classification results of FAM and FAMDDA are presented, compared, and analyzed using several classification metrics. The outcomes positively indicate the effectiveness of the proposed FAMDDA network in undertaking classification problems with imbalanced datasets.

[1]  Yanchun Zhang,et al.  Toward breast cancer survivability prediction models through improving training space , 2009, Expert Syst. Appl..

[2]  Alok R. Chaturvedi,et al.  Acquiring implicit knowledge in a complex domain , 1993 .

[3]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[4]  Rong Yan,et al.  Imbalanced RankBoost for efficiently ranking large-scale image/video collections , 2009, CVPR.

[5]  Niall M. Adams,et al.  Off-the-peg and bespoke classifiers for fraud detection , 2008, Comput. Stat. Data Anal..

[6]  Yok-Yen Nguwi,et al.  An unsupervised self-organizing learning with support vector ranking for imbalanced datasets , 2010, Expert Syst. Appl..

[7]  Stephen Grossberg,et al.  A massively parallel architecture for a self-organizing neural pattern recognition machine , 1988, Comput. Vis. Graph. Image Process..

[8]  Huiru Zheng,et al.  A Bayesian Approach to Improving Decision Making in Support Vector Machine and its Application in Bioinformatics , 2009, 2009 Fifth International Conference on Natural Computation.

[9]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[10]  Michael R. Berthold,et al.  Building precise classifiers with automatic rule extraction , 1995, Proceedings of ICNN'95 - International Conference on Neural Networks.

[11]  Sheng Chen,et al.  A Kernel-Based Two-Class Classifier for Imbalanced Data Sets , 2007, IEEE Transactions on Neural Networks.

[12]  Nathalie Japkowicz,et al.  The class imbalance problem: A systematic study , 2002, Intell. Data Anal..

[13]  Stan Matwin,et al.  Machine Learning for the Detection of Oil Spills in Satellite Radar Images , 1998, Machine Learning.

[14]  Stephen Grossberg,et al.  Fuzzy ARTMAP: A neural network architecture for incremental supervised learning of analog multidimensional maps , 1992, IEEE Trans. Neural Networks.

[15]  CHEE PENG LIM,et al.  An Incremental Adaptive Network for On-line Supervised Learning and Probability Estimation , 1997, Neural Networks.