A classification procedure for highly imbalanced class sizes

This article develops an effective procedure for handling two-class classification problems with highly imbalanced class sizes. In many imbalanced two-class problems, the majority class represents “normal” cases, while the minority class represents “abnormal” cases, detection of which is critical to decision making. When the class sizes are highly imbalanced, conventional classification methods tend to strongly favor the majority class, resulting in very low or even no detection of the minority class. The research objective of this article is to devise a systematic procedure to substantially improve the power of detecting the minority class so that the resulting procedure can help screen the original data set and select a much smaller subset for further investigation. A procedure is developed that is based on ensemble classifiers, where each classifier is constructed from a resized training set with reduced dimension space. In addition, how to find the best values of the decision variables in the proposed classification procedure is specified. The proposed method is compared to a set of off-the-shelf classification methods using two real data sets. The prediction results of the proposed method show remarkable improvements over the other methods. The proposed method can detect about 75% of the minority class units, while the other methods turn out much lower detection rates.

[1]  S. Mark Spearing,et al.  Design of a piezoelectric-based structural health monitoring system for damage detection in composite materials , 2002, SPIE Smart Structures and Materials + Nondestructive Evaluation and Health Monitoring.

[2]  Vivian West,et al.  Computing, Artificial Intelligence and Information Technology Ensemble strategies for a medical diagnostic decision support system: A breast cancer diagnosis application , 2005 .

[3]  Jerome P. Lynch,et al.  A summary review of wireless sensors and sensor networks for structural health monitoring , 2006 .

[4]  Shan Shi,et al.  Assessment of Corrosion of Steel in Concrete Structures by Magnetic Based NDE Techniques , 1996 .

[5]  Robert Tibshirani,et al.  Bias, Variance and Prediction Error for Classification Rules , 1996 .

[6]  Stan Matwin,et al.  Machine Learning for the Detection of Oil Spills in Satellite Radar Images , 1998, Machine Learning.

[7]  Emily K. Lada,et al.  A wavelet-based procedure for process fault detection , 2002 .

[8]  Adam Kowalczyk,et al.  Extreme re-balancing for SVMs: a case study , 2004, SKDD.

[9]  Pedro M. Domingos MetaCost: a general method for making classifiers cost-sensitive , 1999, KDD '99.

[10]  Sameer Singh,et al.  Novelty detection: a review - part 1: statistical approaches , 2003, Signal Process..

[11]  I. Johnstone,et al.  Adapting to Unknown Smoothness via Wavelet Shrinkage , 1995 .

[12]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[13]  Deborah Estrin,et al.  A wireless sensor network For structural monitoring , 2004, SenSys '04.

[14]  Naoki Saito,et al.  Simultaneous noise suppression and signal compression using a library of orthonormal bases and the minimum-description-length criterion , 1994, Defense, Security, and Sensing.

[15]  Kok-Meng Lee,et al.  Effects of Classification Methods on Color-Based Feature Detection With Food Processing Applications , 2007, IEEE Transactions on Automation Science and Engineering.

[16]  Michiel C. van Wezel,et al.  Improved customer choice predictions using ensemble methods , 2005, Eur. J. Oper. Res..

[17]  V. Giurgiutiu Tuned Lamb Wave Excitation and Detection with Piezoelectric Wafer Active Sensors for Structural Health Monitoring , 2005 .

[18]  Feng Niu,et al.  Fault Region Localization: Product and Process Improvement Based on Field Performance and Manufacturing Measurements , 2006, IEEE Transactions on Automation Science and Engineering.

[19]  Francesco Marcelloni,et al.  Reducing Power Consumption in Wireless Sensor Networks Using a Novel Approach to Data Aggregation , 2008, Comput. J..

[20]  Tom Fawcett,et al.  Adaptive Fraud Detection , 1997, Data Mining and Knowledge Discovery.

[21]  E. Anderson Customer Satisfaction and Word of Mouth , 1998 .

[22]  Michael J. Pazzani,et al.  Reducing Misclassification Costs , 1994, ICML.

[23]  Fadil Santosa,et al.  Nondestructive evaluation of plates using eddy current methods , 1998 .

[24]  Robert E. Green,et al.  Ultrasonic and Acoustic Emission Detection of Fatigue Damage. , 1980 .

[25]  Hoon Sohn,et al.  Overview of Piezoelectric Impedance-Based Health Monitoring and Path Forward , 2003 .

[26]  Charles E. Heckler,et al.  Applied Multivariate Statistical Analysis , 2005, Technometrics.

[27]  Charles R. Farrar,et al.  A summary review of vibration-based damage identification methods , 1998 .

[28]  Leo Breiman,et al.  Bias, Variance , And Arcing Classifiers , 1996 .

[29]  David L. Donoho,et al.  De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[30]  Roger G. Schroeder,et al.  Six sigma: A goal-theoretic perspective , 2003 .

[31]  Salvatore J. Stolfo,et al.  Toward Scalable Learning with Non-Uniform Class and Cost Distributions: A Case Study in Credit Card Fraud Detection , 1998, KDD.

[32]  Bernhard Schölkopf,et al.  Support Vector Novelty Detection Applied to Jet Engine Vibration Spectra , 2000, NIPS.

[33]  Mikhail F. Kanevski,et al.  A Comparison of One-Class Classifiers for Novelty Detection in Forensic Case Data , 2007, IDEAL.

[34]  R. Westbrook Product/Consumption-Based Affective Responses and Postpurchase Processes , 1987 .

[35]  L M Rogers DETECTION AND MONITORING OF CRACKS IN OFFSHORE STRUCTURES BY ACOUSTIC EMISSION , 1986 .

[36]  Nathalie Japkowicz,et al.  A Novelty Detection Approach to Classification , 1995, IJCAI.

[37]  Jiannong Cao,et al.  Distributed Processing in Wireless Sensor Networks for Structural Health Monitoring , 2007, UIC.

[38]  J. J. Chen,et al.  Classification ensembles for unbalanced class sizes in predictive toxicology , 2005, SAR and QSAR in environmental research.

[39]  C. Liang,et al.  Truss Structure Integrity Identification Using PZT Sensor-Actuator , 1995 .