Robust Online Learning Method Based on Dynamical Linear Quadratic Regulator

In this paper, a novel algorithm is proposed for inferring online learning tasks efficiently. By a carefully designed scheme, the online learning problem is first formulated as a state feedback control problem for a series of finite-dimensional systems. Then, the online linear quadratic regulator (OLQR) learning algorithm is developed to obtain the optimal parameter updating. Solid mathematical analysis on the convergence and rationality of our method is also provided. Compared with the conventional learning methods, our learning framework represents a completely different approach with optimal control techniques, but does not introduce any assumption on the characteristics of noise or learning rate. The proposed method not only guarantees the fast and robust convergence but also achieves better performance in learning efficiency and accuracy, especially for the data streams with complex noise disturbances. In addition, under the proposed framework, new robust algorithms can be potentially developed for various machine learning tasks by using the powerful optimal control techniques. Numerical results on benchmark datasets and practical applications confirm the advantages of our new method.

[1]  Paul Honeine,et al.  Online Prediction of Time Series Data With Kernels , 2009, IEEE Transactions on Signal Processing.

[2]  Okyay Kaynak,et al.  Sliding Mode Control Approach for Online Learning as Applied to Type-2 Fuzzy Neural Networks and Its Experimental Evaluation , 2012, IEEE Transactions on Industrial Electronics.

[3]  Steven C. H. Hoi,et al.  Online Sparse Passive Aggressive Learning with Kernels , 2016, SDM.

[4]  Xingjian Jing An H∞ control approach to robust learning of feedforward neural networks , 2011, Neural Networks.

[5]  Li Cheng,et al.  Online Identification of Nonlinear Spatiotemporal Systems Using Kernel Learning Approach , 2011, IEEE Transactions on Neural Networks.

[6]  Koby Crammer,et al.  Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[7]  Li Cheng,et al.  An Optimal PID Control Algorithm for Training Feedforward Neural Networks , 2013, IEEE Transactions on Industrial Electronics.

[8]  Tom Fleischer,et al.  Applied Functional Analysis , 2016 .

[9]  Steven C. H. Hoi,et al.  Large Scale Online Kernel Learning , 2016, J. Mach. Learn. Res..

[10]  Johan A. K. Suykens,et al.  LS-SVM REGRESSION WITH AUTOCORRELATED ERRORS , 2006 .

[11]  Xing Jian Jing,et al.  Identification of Nonlinear Spatiotemporal Dynamical Systems With Nonuniform Observations Using Reproducing-Kernel-Based Integral Least Square Regulation , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[12]  Yong-Ping Zhao,et al.  Online independent reduced least squares support vector regression , 2012, Inf. Sci..

[13]  Xingjian Jing,et al.  Online Identification of Nonlinear Stochastic Spatiotemporal System With Multiplicative Noise by Robust Optimal Control-Based Kernel Learning Method , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[14]  Alexander J. Smola,et al.  Online learning with kernels , 2001, IEEE Transactions on Signal Processing.

[15]  Ping Li,et al.  Selective recursive kernel learning for online identification of nonlinear systems with NARX form , 2010 .

[16]  S. Haykin,et al.  Kernel Least‐Mean‐Square Algorithm , 2010 .

[17]  Steven C. H. Hoi,et al.  Sparse Passive-Aggressive Learning for Bounded Online Kernel Methods , 2018, ACM Trans. Intell. Syst. Technol..

[18]  Javier Jorge,et al.  Passive-Aggressive online learning with nonlinear embeddings , 2018, Pattern Recognit..

[19]  Shie Mannor,et al.  The kernel recursive least-squares algorithm , 2004, IEEE Transactions on Signal Processing.

[20]  Gregory Ditzler,et al.  A Novelty Detector and Extreme Verification Latency Model for Nonstationary Environments , 2019, IEEE Transactions on Industrial Electronics.

[21]  Stephen A. Billings,et al.  Model structure selection using an integrated forward orthogonal search algorithm assisted by squared correlation and mutual information , 2008, Int. J. Model. Identif. Control..

[22]  B. Anderson,et al.  Optimal control: linear quadratic methods , 1990 .

[23]  Feng Zou,et al.  Time series prediction with improved neuro-endocrine model , 2013, Neural Computing and Applications.

[24]  Xingjian Jing,et al.  Robust adaptive learning of feedforward neural networks via LMI optimizations , 2012, Neural Networks.

[25]  Theo J. A. de Vries,et al.  Pruning error minimization in least squares support vector machines , 2003, IEEE Trans. Neural Networks.

[26]  Yuan Yao,et al.  Online Learning Algorithms , 2006, Found. Comput. Math..

[27]  Kelvin K. W. Yau,et al.  Predicting electricity energy consumption: A comparison of regression analysis, decision tree and neural networks , 2007 .

[28]  Peng Shi,et al.  Observer and Command-Filter-Based Adaptive Fuzzy Output Feedback Control of Uncertain Nonlinear Systems , 2015, IEEE Transactions on Industrial Electronics.

[29]  Johan A. K. Suykens,et al.  Kernel Regression in the Presence of Correlated Errors , 2011, J. Mach. Learn. Res..

[30]  Jianzhou Wang,et al.  An adaptive fuzzy combination model based on self-organizing map and support vector regression for electric load forecasting , 2012 .

[31]  Shai Shalev-Shwartz,et al.  Online learning: theory, algorithms and applications (למידה מקוונת.) , 2007 .

[32]  Bernard Widrow,et al.  Least-mean-square adaptive filters , 2003 .

[33]  Rong Chen,et al.  Online weighted LS-SVM for hysteretic structural system identification , 2006 .

[34]  James B. Rawlings,et al.  Constrained linear quadratic regulation , 1998, IEEE Trans. Autom. Control..

[35]  Yiming Ying,et al.  Learning Rates of Least-Square Regularized Regression , 2006, Found. Comput. Math..

[36]  Donald E. Kirk,et al.  Optimal control theory : an introduction , 1970 .

[37]  Slobodan Vucetic,et al.  Online Passive-Aggressive Algorithms on a Budget , 2010, AISTATS.

[38]  Badong Chen,et al.  Quantized Kernel Recursive Least Squares Algorithm , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[39]  Steven C. H. Hoi,et al.  Online Adaptive Passive-Aggressive Methods for Non-Negative Matrix Factorization and Its Applications , 2016, CIKM.

[40]  Huijun Gao,et al.  Data-Based Techniques Focused on Modern Industry: An Overview , 2015, IEEE Transactions on Industrial Electronics.

[41]  Pınar Tüfekci,et al.  Prediction of full load electrical power output of a base load operated combined cycle power plant using machine learning methods , 2014 .