Applying genetic programming technique in classification trees

Classification problems are often encountered in many applications. In the past, classification trees were often generated by decision-tree methods and commonly used to solve classification problems. In this paper, we have proposed an algorithm based on genetic programming to search for an appropriate classification tree according to some criteria. The classification tree obtained can be transferred into a rule set, which can then be fed into a knowledge base to support decision making and facilitate daily operations. Two new genetic operators, elimination and merge, are designed in the proposed approach to remove redundancy and subsumption, thus producing more accurate and concise decision rules than that without using them. Experimental results from the credit card data also show the feasibility of the proposed algorithm.

[1]  Christopher J. Neely,et al.  Technical Trading Rules in the European Monetary System , 1998 .

[2]  Tzung-Pei Hong,et al.  Automatically integrating multiple rule sets in a distributed-knowledge environment , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[3]  Tzung-Pei Hong,et al.  Learning discriminant functions with fuzzy attributes for classification using genetic programming , 2002, Expert systems with applications.

[4]  Christopher J. Neely,et al.  Is Technical Analysis in the Foreign Exchange Market Profitable? A Genetic Programming Approach , 1996, Journal of Financial and Quantitative Analysis.

[5]  Gary Riley,et al.  Expert Systems: Principles and Programming , 2004 .

[6]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[7]  Xing Zhang,et al.  A new approach to classification based on association rule mining , 2006, Decis. Support Syst..

[8]  Shu-Heng Chen Evolutionary Computation in Economics and Finance , 2002 .

[9]  Tzu-Wen Kuo,et al.  Evolutionary Computation in Economics and Finance: A Bibliography , 2002 .

[10]  金田 重郎,et al.  C4.5: Programs for Machine Learning (書評) , 1995 .

[11]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[12]  P. R. Balasubramanian,et al.  KnowledgeScope: managing knowledge in context , 2003, Decis. Support Syst..

[13]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[14]  Christopher J. Neely,et al.  Technical Analysis and Central Bank Intervention , 2000 .

[15]  P. Nordin Genetic Programming III - Darwinian Invention and Problem Solving , 1999 .

[16]  Melody Y. Kiang,et al.  A comparative assessment of classification methods , 2003, Decis. Support Syst..

[17]  David R. Cox The analysis of binary data , 1970 .

[18]  Dorothy E. Leidner,et al.  Review: Knowledge Management and Knowledge Management Systems: Conceptual Foundations and Research Issues , 2001, MIS Q..

[19]  Yen-Liang Chen,et al.  A data mining approach for retail knowledge discovery with consideration of the effect of shelf-space adjacency on sales , 2006, Decis. Support Syst..

[20]  Shu-Heng Chen,et al.  Genetic Algorithms and Genetic Programming in Computational Finance , 2002 .

[21]  Tzung-Pei Hong,et al.  Integrating fuzzy knowledge by genetic algorithms , 1998, IEEE Trans. Evol. Comput..

[22]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[23]  John R. Koza,et al.  Genetic Programming III: Darwinian Invention & Problem Solving , 1999 .

[24]  Dominique Bertrand,et al.  Comparison of multilayer perceptron and probabilistic neural networks in artificial vision. Application to the discrimination of seeds , 1997 .

[25]  Hitoshi Iba,et al.  Genetic Programming of Polynomial Models for Financial Forecasting , 2002 .

[26]  Asoke K. Nandi,et al.  Genetic programming techniques for hand written digit recognition , 2004, Signal Process..