Recursive Pattern based Hybrid Supervised Training

We propose, theorize and implement the Recursive Pattern-based Hybrid Supervised (RPHS) learning algorithm. The algorithm makes use of the concept of pseudo global optimal solutions to evolve a set of neural networks, each of which can solve correctly a subset of patterns. The pattern-based algorithm uses the topology of training and validation data patterns to find a set of pseudo-optima, each learning a subset of patterns. It is therefore well adapted to the pattern set provided. We begin by showing that finding a set of local optimal solutions is theoretically equivalent, and more efficient, to finding a single global optimum in terms of generalization accuracy and training time. We also highlight that, as each local optimum is found by using a decreasing number of samples, the efficiency of the training algorithm is increased. We then compare our algorithm, both theoretically and empirically, with different recursive and subset based algorithms. On average, the RPHS algorithm shows better generalization accuracy, with improvement of up to 60% when compared to traditional methods. Moreover, certain versions of the RPHS algorithm also exhibit shorter training time when compared to other recent algorithms in the same domain. In order to increase the relevance of this paper to practitioners, we have added pseudo code, remarks, parameter and algorithmic considerations where appropriate.

[1]  John H. Holland,et al.  Genetic Algorithms and the Optimal Allocation of Trials , 1973, SIAM J. Comput..

[2]  K. Lang,et al.  Learning to tell two spirals apart , 1988 .

[3]  Kalyanmoy Deb,et al.  Don't Worry, Be Messy , 1991, ICGA.

[4]  Hsin-Chia Fu,et al.  A divide-and-conquer methodology for modular supervised neural network design , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[5]  Nelson F. F. Ebecken,et al.  A clustering algorithm for extracting rules from supervised neural network models in data mining tasks , 2000, Int. J. Comput. Syst. Signals.

[6]  Katsumi Yoshida,et al.  Learning M-of-N Concepts for Medical Diagnosis Using Neural Networks , 2000, J. Adv. Comput. Intell. Intell. Informatics.

[7]  Efstratios F. Georgopoulos,et al.  Exchange-Rates Forecasting: A Hybrid Algorithm Based on Genetically Optimized Adaptive Neural Networks , 2002 .

[8]  Jun Liu,et al.  Incremental Neural Network Training with an Increasing Input Dimension , 2004 .

[9]  Chunyu Bao,et al.  Task Decomposition Using Pattern Distributor , 2004 .

[10]  Antonio González Muñoz,et al.  Including a simplicity criterion in the selection of the best rule in a genetic fuzzy learning algorithm , 2001, Fuzzy Sets Syst..

[11]  Alex A. Freitas,et al.  Extracting comprehensible rules from neural networks via genetic algorithms , 2000, 2000 IEEE Symposium on Combinations of Evolutionary Computation and Neural Networks. Proceedings of the First IEEE Symposium on Combinations of Evolutionary Computation and Neural Networks (Cat. No.00.

[12]  Jacek M. Zurada,et al.  Extraction of rules from artificial neural networks for nonlinear regression , 2002, IEEE Trans. Neural Networks.

[13]  Karim Faez,et al.  An Efficient Human Face Recognition System Using Pseudo Zernike Moment Invariant and Radial Basis Function Neural Network , 2003, Int. J. Pattern Recognit. Artif. Intell..

[14]  Urszula Markowska-Kaczmar,et al.  Fuzzy logic and evolutionary algorithm - two techniques in rule extraction from neural networks , 2005, Neurocomputing.

[15]  Bernard Widrow,et al.  Neural networks: applications in industry, business and science , 1994, CACM.

[16]  A. E. Eiben,et al.  Introduction to Evolutionary Computing , 2003, Natural Computing Series.

[17]  Fangming Zhu,et al.  Class decomposition for GA-based classifier agents - a Pitt approach , 2004, IEEE Trans. Syst. Man Cybern. Part B.

[18]  Karim Faez,et al.  A hybrid learning RBF neural network for human face recognition with pseudo Zernike moment invariant , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[19]  Peng Li,et al.  A Hierarchical Incremental Learning Approach to Task Decomposition , 2002 .

[20]  Steven Guan,et al.  Parallel growing and training of neural networks using output parallelism , 2002, IEEE Trans. Neural Networks.

[21]  Joachim Diederich,et al.  Survey and critique of techniques for extracting rules from trained artificial neural networks , 1995, Knowl. Based Syst..

[22]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[23]  Hiroaki Satoh,et al.  Minimal generation gap model for GAs considering both exploration and exploitation , 1996 .

[24]  Mikko Lehtokangas Modelling with constructive backpropagation , 1999, Neural Networks.

[25]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[26]  Bernhard Sendhoff,et al.  Evolutionary Multi-objective Optimization for Simultaneous Generation of Signal-Type and Symbol-Type Representations , 2005, EMO.

[27]  Urszula Markowska-Kaczmar,et al.  Discovering the Mysteries of Neural Networks , 2004, Int. J. Hybrid Intell. Syst..

[28]  Xiaogang Ruan,et al.  A neurocomputing model for real coded genetic algorithm with the minimal generation gap , 2004, Neural Computing & Applications.

[29]  Andrzej Cichocki,et al.  Neural networks for optimization and signal processing , 1993 .

[30]  Edward Keedwell,et al.  CREATING RULES FROM TRAINED NEURAL NETWORKS USING GENETIC ALGORITHMS , 2000 .

[31]  Xin Yao,et al.  A review of evolutionary artificial neural networks , 1993, Int. J. Intell. Syst..

[32]  Olli Simula,et al.  An approach to automated interpretation of SOM , 2001, WSOM.

[33]  Sushmita Mitra,et al.  Neuro-fuzzy rule generation: survey in soft computing framework , 2000, IEEE Trans. Neural Networks Learn. Syst..

[34]  A. E. Eiben,et al.  Global conver-gence of genetic algorithms: an infinite Markov chain analysis , 1991 .

[35]  Lipo Wang,et al.  Rule extraction by genetic algorithms based on a simplified RBF neural network , 2001, Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546).

[36]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[37]  Ronald J. Patton,et al.  Interpretation of Trained Neural Networks by Rule Extraction , 2001, Fuzzy Days.

[38]  Nelson F. F. Ebecken,et al.  Applying a clustering genetic algorithm for extracting rules from a supervised neural network , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[39]  Urszula Markowska-Kaczmar The influence of parameters in evolutionary based rule extraction method from neural network , 2005, 5th International Conference on Intelligent Systems Design and Applications (ISDA'05).

[40]  C. Robert,et al.  Electroencephalogram processing using neural networks , 2002, Clinical Neurophysiology.

[41]  Michael C. Mozer,et al.  Template-based procedures for neural network interpretation , 1999, Neural Networks.

[42]  Phil Husbands,et al.  Evolution of central pattern generators for bipedal walking in a real-time physics environment , 2002, IEEE Trans. Evol. Comput..

[43]  Urszula Markowska-Kaczmar,et al.  Rule Extraction from Neural Network by Genetic Algorithm with Pareto Optimization , 2004, ICAISC.

[44]  Wolfgang Banzhaf,et al.  Dynamic Subset Selection Based on a Fitness Case Topology , 2004, Evolutionary Computation.

[45]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1996, Springer Berlin Heidelberg.

[46]  Ed Keedwell,et al.  Creating rules from trained networks using genetic algorithms , 2000, Int. J. Comput. Syst. Signals.

[47]  Berend Jan van der Zwaag Handwritten Digit Recognition: A Neural Network Demo , 2001, Fuzzy Days.

[48]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[49]  Jun Liu,et al.  Incremental Ordered Neural Network Training , 2002 .

[50]  Steven Guan,et al.  An incremental approach to genetic-algorithms-based classification , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[51]  Hongjun Lu,et al.  NeuroRule: A Connectionist Approach to Data Mining , 1995, VLDB.

[52]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[53]  Joydeep Ghosh,et al.  Symbolic Interpretation of Artificial Neural Networks , 1999, IEEE Trans. Knowl. Data Eng..

[54]  Hajime Kita,et al.  A parallel and modular multi-sieving neural network architecture for constructive learning , 1995 .

[55]  Guido Bologna,et al.  Symbolic Rule Extraction from the DIMLP Neural Network , 1998, Hybrid Neural Systems.

[56]  Byoung-Tak Zhang,et al.  Genetic Programming with Active Data Selection , 1998, SEAL.