Automated Design of Classification Algorithms

Data classification provides effective solutions to various real-world problems in areas such as disease diagnosis, network intrusion detection, and financial forecasting, among others. Classification algorithms such as induction algorithms, e.g., ID3, and genetic programming are used to produce classifiers. The design of these classification algorithms is time-consuming, requiring many person hours, and is an optimization problem. This chapter examines the automated design of genetic programming as a classification algorithm. The study compares the performance of genetic algorithms and grammatical evolution in automating the design of genetic programming for classifier induction. The performance of the classifiers produced by automated design is compared to that produced by manually designed genetic programming algorithms for binary and multi-class classification in various areas including network intrusion detection and financial forecasting. The automated design required less design time and produced classifiers that performed better than the manually designed classifiers. Grammatical evolution was found to produce better-performing classifiers for binary classification and genetic algorithms for multi-class classification.