Online Bayesian Transfer Learning for Sequential Data Modeling

We consider the problem of inferring a sequence of hidden states associated with a sequence of observations produced by an individual within a population. Instead of learning a single sequence model for the population (which does not account for variations within the population), we learn a set of basis sequence models based on different individuals. The sequence of hidden states for a new individual is inferred in an online fashion by estimating a distribution over the basis models that best explain the sequence of observations of this new individual. We explain how to do this in the context of hidden Markov models with Gaussian mixture models that are learned based on streaming data by online Bayesian moment matching. The resulting transfer learning technique is demonstrated with three real-word applications: activity recognition based on smartphone sensors, sleep classification based on electroencephalography data and the prediction of the direction of future packet flows between a pair of servers in telecommunication networks.

[1]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[2]  Peter Stone,et al.  Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..

[3]  Ling Shao,et al.  Transfer Learning for Visual Categorization: A Survey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Pascal Poupart,et al.  Online and Distributed learning of Gaussian mixture models by Bayesian Moment Matching , 2016, ArXiv.

[5]  Han Zhao,et al.  Online and Distributed Bayesian Moment Matching for Parameter Learning in Sum-Product Networks , 2016, AISTATS.

[6]  Georg Dorffner,et al.  Computer-Assisted Automated Scoring of Polysomnograms Using the Somnolyzer System. , 2015, Sleep.

[7]  Sethuraman Panchanathan,et al.  Topology Preserving Domain Adaptation for Addressing Subject Based Variability in SEMG Signal , 2011, AAAI Spring Symposium: Computational Physiology.

[8]  Yiqiang Chen,et al.  Cross-People Mobile-Phone Based Activity Recognition , 2011, IJCAI.

[9]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[10]  Xin Liu,et al.  PyEEG: An Open Source Python Module for EEG/MEG Feature Extraction , 2011, Comput. Intell. Neurosci..

[11]  Pascal Poupart,et al.  Online Bayesian Moment Matching for Topic Modeling with Unknown Number of Topics , 2016, NIPS.

[12]  Masa-aki Sato,et al.  Online Model Selection Based on the Variational Bayes , 2001, Neural Computation.

[13]  F. Wilcoxon SOME RAPID APPROXIMATE STATISTICAL PROCEDURES , 1950 .

[14]  Leslie Pack Kaelbling,et al.  Activity Recognition from Physiological Data using Conditional Random Fields , 2006 .

[15]  Mohamed Moshrefi-Torbati,et al.  Signal processing techniques applied to human sleep EEG signals - A review , 2014, Biomed. Signal Process. Control..

[16]  P. Jennum,et al.  Automatic Sleep Scoring in Normals and in Individuals With Neurodegenerative Disorders According to New International Sleep Scoring Criteria , 2010, Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society.

[17]  Qiang Yang,et al.  Boosting for transfer learning , 2007, ICML '07.

[18]  Chandan K. Reddy,et al.  Adaptive Boosting for Transfer Learning Using Dynamic Updates , 2011, ECML/PKDD.

[19]  Farheen Omar,et al.  Online Bayesian Learning in Probabilistic Graphical Models using Moment Matching with Applications , 2016 .

[20]  Zhitang Chen,et al.  Online Algorithms for Sum-Product Networks with Continuous Variables , 2016, Probabilistic Graphical Models.

[21]  Chong Wang,et al.  Online Variational Inference for the Hierarchical Dirichlet Process , 2011, AISTATS.

[22]  Yi Yao,et al.  Boosting for transfer learning with multiple sources , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23]  Sudhansu Chokroverty,et al.  Sleep and Parkinson's disease: A review of case‐control polysomnography studies , 2012, Movement disorders : official journal of the Movement Disorder Society.

[24]  Diane J. Cook,et al.  Transfer learning for activity recognition: a survey , 2013, Knowledge and Information Systems.

[25]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[26]  R. Rosenberg,et al.  The American Academy of Sleep Medicine inter-scorer reliability program: sleep stage scoring. , 2013, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[27]  M. Degroot Optimal Statistical Decisions , 1970 .

[28]  S. Chokroverty,et al.  The visual scoring of sleep in adults. , 2007, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[29]  A. Pack,et al.  Performance of an automated polysomnography scoring system versus computer-assisted manual scoring. , 2013, Sleep.

[30]  Diane J. Cook,et al.  Transferring Learned Activities in Smart Environments , 2009, Intelligent Environments.

[31]  Y. Nesterov A method for solving the convex programming problem with convergence rate O(1/k^2) , 1983 .

[32]  Jing Peng,et al.  An Efficient Gradient-Based Algorithm for On-Line Training of Recurrent Network Trajectories , 1990, Neural Computation.

[33]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[34]  Geoffrey E. Hinton,et al.  On the importance of initialization and momentum in deep learning , 2013, ICML.