Online metric learning for an adaptation to confidence drift

One of the main aims of lifelong learning architectures is to efficiently and reliably cope with the stability-plasticity dilemma. A viable solution of this dilemma combines a static offline classifier, which preserves ground knowledge that should be respected during training, with an incremental online learning of new or specific information encountered during use. A feasible realisation has been published lately based on intuitive distance-based classifiers using the concept of metric learning (Fischer et al.: Combining offline and online classifiers for life-long learning (OOL), IJCNN'15). One crucial aspect of such a system is how to combine the offline and online model. A generic approach, taken in OOL, uses a dynamic classifier selection strategy based on confidences of both classifiers. This can cause problems in the case of confidence drift, especially when the validity of the confidence estimation of the static offline classifier changes. This pitfall occurs in the context of metric learning whenever the metric tensor of the online system becomes orthogonal to the metric of the offline system, hence the respective internal data description mismatch. We propose an efficient metric learning strategy which allows an online adaptation of an invalid confidence estimation of the OOL architecture in case of confidence drift.

[1]  Xin Yao,et al.  DDD: A New Ensemble Approach for Dealing with Concept Drift , 2012, IEEE Transactions on Knowledge and Data Engineering.

[2]  Kevin W. Bowyer,et al.  Combination of multiple classifiers using local accuracy estimates , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Marcus A. Maloof,et al.  Dynamic Weighted Majority: An Ensemble Method for Drifting Concepts , 2007, J. Mach. Learn. Res..

[4]  Heiko Wersing,et al.  Interactive online learning for obstacle classification on a mobile robot , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[5]  Robi Polikar,et al.  Guest Editorial Learning in Nonstationary and Evolving Environments , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Martin A. Riedmiller,et al.  Incremental GRLVQ: Learning relevant features for 3D object recognition , 2008, Neurocomputing.

[7]  Vasant Honavar,et al.  Learn++: an incremental learning algorithm for supervised neural networks , 2001, IEEE Trans. Syst. Man Cybern. Part C.

[8]  Jeffrey Queißer,et al.  Using context for the combination of off-line and on-line learning , 2012 .

[9]  Bernhard Sendhoff,et al.  Alleviating Catastrophic Forgetting via Multi-Objective Learning , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[10]  Michael Biehl,et al.  Insightful stress detection from physiology modalities using Learning Vector Quantization , 2015, Neurocomputing.

[11]  Ye Xu,et al.  An incremental learning vector quantization algorithm for pattern classification , 2010, Neural Computing and Applications.

[12]  Thomas Villmann,et al.  Limited Rank Matrix Learning, discriminative dimension reduction and visualization , 2012, Neural Networks.

[13]  Qiang Yang,et al.  Lifelong Machine Learning Systems: Beyond Learning Algorithms , 2013, AAAI Spring Symposium: Lifelong Machine Learning.

[14]  Luiz Eduardo Soares de Oliveira,et al.  Dynamic selection of classifiers - A comprehensive review , 2014, Pattern Recognit..

[15]  Koby Crammer,et al.  Adaptive regularization of weight vectors , 2009, Machine Learning.

[16]  Michael Biehl,et al.  Adaptive Relevance Matrices in Learning Vector Quantization , 2009, Neural Computation.

[17]  Gert Cauwenberghs,et al.  Incremental and Decremental Support Vector Machine Learning , 2000, NIPS.

[18]  Motoaki Kawanabe,et al.  Machine Learning in Non-Stationary Environments - Introduction to Covariate Shift Adaptation , 2012, Adaptive computation and machine learning.

[19]  Sebastian Thrun,et al.  Lifelong robot learning , 1993, Robotics Auton. Syst..

[20]  Jerzy Stefanowski,et al.  Reacting to Different Types of Concept Drift: The Accuracy Updated Ensemble Algorithm , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[21]  Marco Wiering,et al.  2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) , 2011, IJCNN 2011.

[22]  Horst-Michael Groß,et al.  A life-long learning vector quantization approach for interactive learning of multiple categories , 2012, Neural Networks.

[23]  Amaury Habrard,et al.  Robustness and generalization for metric learning , 2012, Neurocomputing.

[24]  Heiko Wersing,et al.  Certainty-based prototype insertion/deletion for classification with metric adaptation , 2015, ESANN.

[25]  Heiko Wersing,et al.  Combining offline and online classifiers for life-long learning , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[26]  Katsumi Inoue,et al.  Learning revised models for planning in adaptive systems , 2013, 2013 35th International Conference on Software Engineering (ICSE).

[27]  Yoshua Bengio,et al.  An Empirical Investigation of Catastrophic Forgeting in Gradient-Based Neural Networks , 2013, ICLR.

[28]  Nicolai Petkov,et al.  Adaptive Matrices and Filters for Color Texture Classification , 2012, Journal of Mathematical Imaging and Vision.

[29]  Heiko Wersing,et al.  A biologically motivated visual memory architecture for online learning of objects , 2008, Neural Networks.

[30]  Marc Sebban,et al.  A Survey on Metric Learning for Feature Vectors and Structured Data , 2013, ArXiv.

[31]  Michael Biehl,et al.  Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization , 2013, PloS one.

[32]  Amar Mitiche,et al.  Classifier combination for hand-printed digit recognition , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[33]  Frank-Michael Schleif,et al.  Adaptive conformal semi-supervised vector quantization for dissimilarity data , 2014, Pattern Recognit. Lett..

[34]  Slobodan Vucetic,et al.  Learning Vector Quantization with adaptive prototype addition and removal , 2009, 2009 International Joint Conference on Neural Networks.

[35]  Klaus Obermayer,et al.  Soft Learning Vector Quantization , 2003, Neural Computation.

[36]  Sebastian Thrun,et al.  Online Speed Adaptation Using Supervised Learning for High-Speed, Off-Road Autonomous Driving , 2007, IJCAI.

[37]  Gregory Ditzler,et al.  Incremental Learning of Concept Drift from Streaming Imbalanced Data , 2013, IEEE Transactions on Knowledge and Data Engineering.

[38]  David A. Clifton,et al.  A review of novelty detection , 2014, Signal Process..

[39]  Cesare Alippi,et al.  Just-In-Time Classifiers for Recurrent Concepts , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[40]  Horst-Michael Groß,et al.  A vision architecture for unconstrained and incremental learning of multiple categories , 2009, Memetic Comput..

[41]  Robi Polikar,et al.  COMPOSE: A Semisupervised Learning Framework for Initially Labeled Nonstationary Streaming Data , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[42]  B.V. Dasarathy,et al.  A composite classifier system design: Concepts and methodology , 1979, Proceedings of the IEEE.

[43]  Hamid Beigy,et al.  Using a classifier pool in accuracy based tracking of recurring concepts in data stream classification , 2013, Evol. Syst..

[44]  Heiko Wersing,et al.  Rapid Online Learning of Objects in a Biologically Motivated Recognition Architecture , 2005, DAGM-Symposium.

[45]  Haibo He,et al.  Incremental Learning From Stream Data , 2011, IEEE Transactions on Neural Networks.

[46]  Thomas Villmann,et al.  Distance Measures for Prototype Based Classification , 2013, BrainComp.

[47]  Atsushi Sato,et al.  Generalized Learning Vector Quantization , 1995, NIPS.

[48]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[49]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[50]  Gyan Bhanot,et al.  Inter-species prediction of protein phosphorylation in the sbv IMPROVER species translation challenge , 2015, Bioinform..

[51]  Heiko Wersing,et al.  Efficient rejection strategies for prototype-based classification , 2015, Neurocomputing.

[52]  Pablo A. Estévez,et al.  A review of learning vector quantization classifiers , 2013, Neural Computing and Applications.