An incremental neural learning framework and its application to vehicle diagnostics

Abstract This paper presents a framework for incremental neural learning (INL) that allows a base neural learning system to incrementally learn new knowledge from only new data without forgetting the existing knowledge. Upon subsequent encounters of new data examples, INL utilizes prior knowledge to direct its incremental learning. A number of critical issues are addressed including when to make the system learn new knowledge, how to learn new knowledge without forgetting existing knowledge, how to perform inference using both the existing and the newly learnt knowledge, and how to detect and deal with aged learnt systems. To validate the proposed INL framework, we use backpropagation (BP) as a base learner and a multi-layer neural network as a base intelligent system. INL has several advantages over existing incremental algorithms: it can be applied to a broad range of neural network systems beyond the BP trained neural networks; it retains the existing neural network structures and weights even during incremental learning; the neural network committees generated by INL do not interact with one another and each sees the same inputs and error signals at the same time; this limited communication makes the INL architecture attractive for parallel implementation. We have applied INL to two vehicle fault diagnostics problems: end-of-line test in auto assembly plants and onboard vehicle misfire detection. These experimental results demonstrate that the INL framework has the capability to successfully perform incremental learning from unbalanced and noisy data. In order to show the general capabilities of INL, we also applied INL to three general machine learning benchmark data sets. The INL systems showed good generalization capabilities in comparison with other well known machine learning algorithms.

[1]  Paul E. Utgoff,et al.  Decision Tree Induction Based on Efficient Tree Restructuring , 1997, Machine Learning.

[2]  Robert E. Schapire,et al.  Theoretical Views of Boosting and Applications , 1999, ALT.

[3]  Christian-A. Bohn An incremental unsupervised learning scheme for function approximation , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[4]  José Carlos Príncipe,et al.  Incremental backpropagation learning networks , 1996, IEEE Trans. Neural Networks.

[5]  Ludmila I. Kuncheva,et al.  Switching between selection and fusion in combining classifiers: an experiment , 2002, IEEE Trans. Syst. Man Cybern. Part B.

[6]  Nikola Kasabov,et al.  Dynamic Evolving Neuro-Fuzzy Inference System (DENFIS): On-line learning and Application for Time-Series Prediction , 2000 .

[7]  Christopher G. Atkeson,et al.  Constructive Incremental Learning from Only Local Information , 1998, Neural Computation.

[8]  Lokendra Shastri,et al.  Incremental class learning approach and its application to handwritten digit recognition , 2002, Inf. Sci..

[9]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[10]  Harris Drucker,et al.  Boosting Performance in Neural Networks , 1993, Int. J. Pattern Recognit. Artif. Intell..

[11]  Nikola Kasabov,et al.  Evolving Connectionist Systems: Methods and Applications in Bioinformatics, Brain Study and Intelligent Machines , 2002, IEEE Transactions on Neural Networks.

[12]  Chu Kiong Loo,et al.  Accurate and reliable diagnosis and classification using probabilistic ensemble simplified fuzzy ARTMAP , 2005, IEEE Transactions on Knowledge and Data Engineering.

[13]  Hong Guo,et al.  Neural Learning from Unbalanced Data , 2004, Applied Intelligence.

[14]  Phayung Meesad,et al.  Pattern classification by an incremental learning fuzzy neural network , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[15]  Ah-Hwee Tan,et al.  Rule Extraction: From Neural Architecture to Symbolic Representation , 1995 .

[16]  Richard Granger,et al.  Incremental Learning from Noisy Data , 1986, Machine Learning.

[17]  Paul E. Utgoff,et al.  Incremental Induction of Decision Trees , 1989, Machine Learning.

[18]  Robert E. Schapire,et al.  The strength of weak learnability , 1990, Mach. Learn..

[19]  Hong Guo,et al.  Neural Learning from Unbalanced Data Using Noise Modeling , 2001, IEA/AIE.

[20]  G. V. Puskorius,et al.  A signal processing framework based on dynamic neural networks with application to problems in adaptation, filtering, and classification , 1998, Proc. IEEE.

[21]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[22]  Nikola K. Kasabov,et al.  DENFIS: dynamic evolving neural-fuzzy inference system and its application for time-series prediction , 2002, IEEE Trans. Fuzzy Syst..

[23]  Hyeyoung Park,et al.  On-Line Learning Theory of Soft Committee Machines with Correlated Hidden Units : Steepest Gradient Descent and Natural Gradient Descent , 2002, cond-mat/0212006.

[24]  Yi Lu Murphey,et al.  SVM learning from large training data set , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[25]  Andries Petrus Engelbrecht,et al.  Incremental learning using sensitivity analysis , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[26]  Yoav Freund,et al.  Data filtering and distribution modeling algorithms for machine learning , 1993 .

[27]  Saad,et al.  On-line learning in soft committee machines. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[28]  Christian Lebiere,et al.  The Cascade-Correlation Learning Architecture , 1989, NIPS.

[29]  John E. Moody,et al.  Fast Learning in Multi-Resolution Hierarchies , 1988, NIPS.

[30]  Toshio Fukuda,et al.  A neural network architecture for incremental learning , 1995, Neurocomputing.

[31]  Jiun-Hung Chen,et al.  Reducing SVM classification time using multiple mirror classifiers , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[32]  Hong Guo,et al.  Automotive signal diagnostics using wavelets and machine learning , 2000, IEEE Trans. Veh. Technol..

[33]  J. Elman Learning and development in neural networks: the importance of starting small , 1993, Cognition.

[34]  Yi Lu,et al.  A fuzzy system for automotive fault diagnosis: fast rule generation and self-tuning , 2000, IEEE Trans. Veh. Technol..

[35]  L. Breiman Arcing Classifiers , 1998 .

[36]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[37]  Timur Ash,et al.  Dynamic node creation in backpropagation networks , 1989 .

[38]  Stephen Grossberg,et al.  Fuzzy ARTMAP: an adaptive resonance architecture for incremental learning of analog maps , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[39]  Yi Lu Murphey,et al.  Incremental Learning in a Fuzzy Intelligent System , 1999, IJCAI.

[40]  Holger Schwenk Yoshua Adaptive Boosting of Neural Networks for Character Recognition , 1997 .

[41]  John M. Zelle,et al.  Growing layers of perceptrons: introducing the Extentron algorithm , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[42]  Fabio Scotti,et al.  Exploiting application locality to design low-complexity, highly performing, and power-aware embedded classifiers , 2006, IEEE Transactions on Neural Networks.

[43]  Sorin Draghici,et al.  The constraint based decomposition (CBD) training architecture , 2001, Neural Networks.

[44]  Tsu-Chang Lee,et al.  Structure level adaptation for artificial neural networks , 1991 .

[45]  L. Breiman Arcing classifier (with discussion and a rejoinder by the author) , 1998 .

[46]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, CACM.

[47]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[48]  Marcus Frean,et al.  The Upstart Algorithm: A Method for Constructing and Training Feedforward Neural Networks , 1990, Neural Computation.

[49]  Geoffrey E. Hinton,et al.  Proceedings of the 1988 Connectionist Models Summer School , 1989 .

[50]  J. Nadal,et al.  Learning in feedforward layered networks: the tiling algorithm , 1989 .

[51]  Stephen Grossberg,et al.  Fuzzy ARTMAP: A neural network architecture for incremental supervised learning of analog multidimensional maps , 1992, IEEE Trans. Neural Networks.

[52]  CHEE PENG LIM,et al.  An Incremental Adaptive Network for On-line Supervised Learning and Probability Estimation , 1997, Neural Networks.

[53]  Carla E. Brodley,et al.  Linear Machine Decision Trees , 1991 .

[54]  R. Brits,et al.  A clustering approach to incremental learning for feedforward neural networks , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[55]  T. Ash,et al.  Dynamic node creation in backpropagation networks , 1989, International 1989 Joint Conference on Neural Networks.