The role of temporal feature extraction and bagging of MLP neural networks for solving the WCCI 2008 Ford Classification Challenge

This paper presents an approach for solving WCCI 2008's Ford Classification Challenge Problem. The solution is based on the creation of new input variables through temporal feature extraction and on the combination via bagging of an ensemble of 30 multi-layer perceptrons trained on sets divided by multiple random sampling of the labeled data. Signal power, signal to noise ratio and signal frequency were some of the meaningful features extracted for improving the system's performance. The data sampling strategy produced a robust median MLP response and allowed for the definition of the appropriate decision threshold. The performance measured on the 30 test samples (statistically independent from the training data) reached an average of Max_KS2 = 0.91, AUC_ROC = 0.99 and accuracy of 95.6% for Ford_A and Max_KS2 = 0.88, AUC_ROC = 0.98 and accuracy of 94.1% for Ford_B. These results have been confirmed on the competition for the noiseless data and have degraded around 15% for the noisy data.

[1]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[2]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[3]  Silvio Romero de Lemos Meira,et al.  Knowledge Reuse in Data Mining Projects and Its Practical Applications , 2009, ICEIS.

[4]  Tom Fawcett,et al.  Robust Classification for Imprecise Environments , 2000, Machine Learning.

[5]  Ian Witten,et al.  Data Mining , 2000 .

[6]  Germano C. Vasconcelos,et al.  The Power of Sampling and Stacking for the PAKDD-2007 Cross-Selling Problem , 2010, Strategic Advancements in Utilizing Data Mining and Warehousing Technologies.

[7]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[8]  Germano C. Vasconcelos,et al.  A systematic solution for the NN3 Forecasting Competition problem based on an ensemble of MLP neural networks , 2008, 2008 19th International Conference on Pattern Recognition.

[9]  Michael Y. Hu,et al.  Forecasting with artificial neural networks: The state of the art , 1997 .

[10]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[11]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[12]  Melody Y. Kiang,et al.  A comparative assessment of classification methods , 2003, Decis. Support Syst..

[13]  Tom Fawcett,et al.  ROC Graphs: Notes and Practical Considerations for Researchers , 2007 .

[14]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[15]  G. Kane Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol 1: Foundations, vol 2: Psychological and Biological Models , 1994 .

[16]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[17]  David West,et al.  Neural network credit scoring models , 2000, Comput. Oper. Res..

[18]  W. J. Conover,et al.  Practical Nonparametric Statistics , 1972 .