A data sampling and attribute selection strategy for improving decision tree construction

Abstract Decision trees are efficient means for building classification models due to the compressibility, simplicity and ease of interpretation of their results. However, during the construction phase of decision trees, the outputs are often large trees that are affected by many uncertainties in the data (particularity, noise and residual variation). Combining attribute selection and data sampling presents one of the most promising research directions to overcome decision tree construction problems. However, the search space composed of all possible combinations of subsets of training samples and attributes is extremely large. In this paper, a novel approach is presented that allows generating an optimized decision tree by selecting an optimal couple of training samples and attributes subsets for training. As the search space of candidate couples of training samples and attributes subsets is extremely large, we use particle swarm optimization to make the search of an “optimal” solution tractable. The selected optimized solution helps in avoiding over-fitting and complexity problems suffered in the construction phase of decision trees. We conducted an extensive experimental evaluation on 22 datasets from the UCI Machine Learning Repository. The obtained results show that the proposed approach outperforms state-of-the-art classical as well as evolutionary decision tree construction methods in terms of simplicity, accuracy, and F-measure. We further evaluate our approach on a real-world engineering application for condition monitoring of rotating machinery under severe non-stationary conditions. The obtained results showed that the proposed approach allowed to optimize the use of instantaneous angular speed to diagnose gears defects.

[1]  Donato Malerba,et al.  A Comparative Analysis of Methods for Pruning Decision Trees , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Nour El Islem Karabadji,et al.  Improving memory-based user collaborative filtering with evolutionary multi-objective optimization , 2018, Expert Syst. Appl..

[3]  Diego Cabrera,et al.  Attribute clustering using rough set theory for feature selection in fault severity classification of rotating machinery , 2017, Expert Syst. Appl..

[4]  Xinhua Zhuang,et al.  Binary linear decision tree with genetic algorithm , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[5]  Diego Cabrera,et al.  A review on data-driven fault severity assessment in rolling bearings , 2018 .

[6]  Antonio J. Nebro,et al.  jMetal: A Java framework for multi-objective optimization , 2011, Adv. Eng. Softw..

[7]  L. Renaudin,et al.  Natural roller bearing fault detection by angular measurement of true instantaneous angular speed , 2010 .

[8]  Didier Remond,et al.  Hybrid Scheme for Wind Turbine Condition Monitoring Based on Instantaneous Angular Speed and Pattern Recognition , 2018 .

[9]  Nour El Islem Karabadji,et al.  An evolutionary scheme for decision tree construction , 2017, Knowl. Based Syst..

[10]  Robert B. Randall,et al.  Instantaneous Angular Speed (IAS) processing and related angular applications , 2014 .

[11]  Urszula Boryczka,et al.  Ant Colony Decision Trees - A New Method for Constructing Decision Trees Based on Ant Colony Optimization , 2010, ICCCI.

[12]  Nour El Islem Karabadji,et al.  Evolutionary mining of skyline clusters of attributed graph data , 2020, Inf. Sci..

[13]  Jérôme Antoni,et al.  Precision of the IAS monitoring system based on the elapsed time method in the spectral domain , 2012 .

[14]  Ahmad Taher Azar,et al.  Supervised hybrid feature selection based on PSO and rough sets for medical diagnosis , 2014, Comput. Methods Programs Biomed..

[15]  P. V. G. Bradbeer,et al.  The Construction and Evaluation of Decision Trees: a Comparison of Evolutionary and Concept Learning Methods , 1997, Evolutionary Computing, AISB Workshop.

[16]  Nour El Islem Karabadji,et al.  Decision Tree Selection in an Industrial Machine Fault Diagnostics , 2012, MEDI.

[17]  I. Bratko,et al.  Learning decision rules in noisy domains , 1987 .

[18]  Voicu Groza,et al.  Optimizing Particle Swarm Optimization algorithm , 2014, 2014 IEEE 27th Canadian Conference on Electrical and Computer Engineering (CCECE).

[19]  Didier Remond,et al.  Simplified Dynamic Model of a Wind Turbine Shaft Line Operating in Non-stationary Conditions Applied to the Analysis of IAS as a Machinery Surveillance Tool , 2018 .

[20]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[21]  Radoslaw Zimroz,et al.  A new feature for monitoring the condition of gearboxes in non-stationary operating conditions , 2009 .

[22]  Enrico Zio,et al.  Artificial intelligence for fault diagnosis of rotating machinery: A review , 2018, Mechanical Systems and Signal Processing.

[23]  Fredrik Sandin,et al.  Online feature learning for condition monitoring of rotating machinery , 2017, Eng. Appl. Artif. Intell..

[24]  S. Raghavan,et al.  Diversification for better classification trees , 2006, Comput. Oper. Res..

[25]  J. Ross Quinlan,et al.  Simplifying Decision Trees , 1987, Int. J. Man Mach. Stud..

[26]  Nour El Islem Karabadji,et al.  Improved decision tree construction based on attribute selection and data sampling for fault diagnosis in rotating machines , 2014, Eng. Appl. Artif. Intell..

[27]  B. Chandra,et al.  Moving towards efficient decision tree construction , 2009, Inf. Sci..

[28]  Urszula Boryczka,et al.  Collective data mining in the ant colony decision tree approach , 2016, Inf. Sci..

[29]  Alex Alves Freitas,et al.  Inducing decision trees with an ant colony optimization algorithm , 2012, Appl. Soft Comput..

[30]  John Mingers,et al.  An Empirical Comparison of Pruning Methods for Decision Tree Induction , 1989, Machine Learning.

[31]  Md Zahidul Islam,et al.  Optimizing the number of trees in a decision forest to discover a subforest with high ensemble accuracy using a genetic algorithm , 2016, Knowl. Based Syst..

[32]  Yudong Zhang,et al.  Binary PSO with mutation operator for feature selection using decision tree applied to spam detection , 2014, Knowl. Based Syst..

[33]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.