Bayesian transfer learning between Student-t filters

Abstract The problem of sequentially transferring a data-predictive probability distribution from a source to a target Bayesian filter is addressed in this paper. In many practical settings, this transfer is incompletely modelled, since the stochastic dependence structure between the filters typically cannot be fully specified. We therefore adopt fully probabilistic design to select the optimal transfer mechanism. We relax the target observation model via a scale-mixing parameter, which proves vital in successfully transferring the first and second moments of the source data predictor. This sensitivity to the transferred second moment ensures that imprecise predictors are rejected, achieving robust transfer. Indeed, Student-t state and observation models are adopted for both learning processes, in order to handle outliers in all hidden and observed variables. A recursive outlier-robust Bayesian transfer learning algorithm is recovered via a local variational Bayes approximation. The outlier rejection and positive transfer properties of the resulting algorithm are clearly demonstrated in a simulated planar position-velocity system, as is the key property of imprecise knowledge rejection (robust transfer), unavailable in current Bayesian transfer algorithms. Performance comparison with particle filter variants demonstrates the successful convergence of our robust variational Bayes transfer learning algorithm in sequential processing.

[1]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[2]  R. E. Kalman,et al.  A New Approach to Linear Filtering and Prediction Problems , 2002 .

[3]  Monish D. Tandale,et al.  Particle Filter for Ballistic Target Tracking with Glint Noise , 2010 .

[4]  Anthony Quinn,et al.  Dynamic Bayesian Knowledge Transfer Between a Pair of Kalman Filters , 2018, 2018 IEEE 28th International Workshop on Machine Learning for Signal Processing (MLSP).

[5]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[6]  David M. Blei,et al.  Frequentist Consistency of Variational Bayes , 2017, Journal of the American Statistical Association.

[7]  Anthony Quinn,et al.  Fully Probabilistic Design for Knowledge Transfer in a Pair of Kalman Filters , 2018, IEEE Signal Processing Letters.

[8]  A. Doucet,et al.  A Tutorial on Particle Filtering and Smoothing: Fifteen years later , 2008 .

[9]  Weidong Zhou,et al.  Robust Linear Filter with Parameter Estimation Under Student-t Measurement Distribution , 2018, Circuits Syst. Signal Process..

[10]  Yonggang Zhang,et al.  A New Process Uncertainty Robust Student’s t Based Kalman Filter for SINS/GPS Integration , 2017, IEEE Access.

[11]  Simo Särkkä,et al.  Recursive outlier-robust filtering and smoothing for nonlinear systems using the multivariate student-t distribution , 2012, 2012 IEEE International Workshop on Machine Learning for Signal Processing.

[12]  Miroslav Kárný,et al.  Towards fully probabilistic control design , 1996, Autom..

[13]  Lubomir M. Hadjiiski,et al.  Multi-task transfer learning deep convolutional neural network: application to computer-aided diagnosis of breast cancer on mammograms , 2017, Physics in medicine and biology.

[14]  V. Šmídl,et al.  The Variational Bayes Method in Signal Processing , 2005 .

[15]  Xiao-Li Hu,et al.  A General Convergence Result for Particle Filtering , 2011, IEEE Transactions on Signal Processing.

[16]  Edward R. Dougherty,et al.  Optimal Bayesian Transfer Learning , 2018, IEEE Transactions on Signal Processing.

[17]  X. R. Li,et al.  Survey of maneuvering target tracking. Part I. Dynamic models , 2003 .

[18]  Fredrik Gustafsson,et al.  Robust Inference for State-Space Models with Skewed Measurement Noise , 2015, IEEE Signal Processing Letters.

[19]  David M. Blei,et al.  Variational Inference: A Review for Statisticians , 2016, ArXiv.

[20]  Michael Roth,et al.  On the Multivariate t Distribution , 2012 .

[21]  Miroslav Kárný,et al.  Axiomatisation of fully probabilistic design , 2012, Inf. Sci..

[22]  Yonggang Zhang,et al.  A robust Gaussian approximate filter for nonlinear systems with heavy tailed measurement noises , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[23]  Simo Srkk,et al.  Bayesian Filtering and Smoothing , 2013 .

[24]  Henry Leung,et al.  A variational Bayesian approach to robust sensor fusion based on Student-t distribution , 2013, Inf. Sci..

[25]  Peter Stone,et al.  Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..

[26]  Yonggang Zhang,et al.  A robust and efficient system identification method for a state-space model with heavy-tailed process and measurement noises , 2016, Fusion.

[27]  Richard J. Meinhold,et al.  Robustification of Kalman Filter Models , 1989 .

[28]  Rodney W. Johnson,et al.  Axiomatic derivation of the principle of maximum entropy and the principle of minimum cross-entropy , 1980, IEEE Trans. Inf. Theory.

[29]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[30]  Taghi M. Khoshgoftaar,et al.  A survey of transfer learning , 2016, Journal of Big Data.

[31]  Václav Smídl,et al.  Variational Bayesian Filtering , 2008, IEEE Transactions on Signal Processing.

[32]  C. Chang,et al.  Kalman filter algorithms for a multi-sensor system , 1976, 1976 IEEE Conference on Decision and Control including the 15th Symposium on Adaptive Processes.

[33]  Lamine Mili,et al.  Robust Kalman Filter Based on a Generalized Maximum-Likelihood-Type Estimator , 2010, IEEE Transactions on Signal Processing.

[34]  Thia Kirubarajan,et al.  Estimation with Applications to Tracking and Navigation: Theory, Algorithms and Software , 2001 .

[35]  Peng Shi,et al.  Robust Kalman Filters Based on Gaussian Scale Mixture Distributions With Application to Target Tracking , 2019, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[36]  Eduardo Mario Nebot,et al.  An outlier-robust Kalman filter , 2011, 2011 IEEE International Conference on Robotics and Automation.

[37]  Tatiana V. Guy,et al.  Optimal design of priors constrained by external predictors , 2017, Int. J. Approx. Reason..

[38]  Stefan Schaal,et al.  Learning an Outlier-Robust Kalman Filter , 2007, ECML.

[39]  Fredrik Gustafsson,et al.  A Student's t filter for heavy tailed process and measurement noise , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[40]  Yonggang Zhang,et al.  A Novel Robust Student's t-Based Kalman Filter , 2017, IEEE Transactions on Aerospace and Electronic Systems.

[41]  Henry Leung,et al.  Student-t mixture labeled multi-Bernoulli filter for multi-target tracking with heavy-tailed noise , 2018, Signal Process..

[42]  Rui Yan,et al.  How Transferable are Neural Networks in NLP Applications? , 2016, EMNLP.

[43]  Milan Papež,et al.  Robust Bayesian Transfer Learning Between Kalman Filters , 2019, 2019 IEEE 29th International Workshop on Machine Learning for Signal Processing (MLSP).