论文信息 - An Improved Data Augmentation Scheme for Model Predictive Control Policy Approximation

An Improved Data Augmentation Scheme for Model Predictive Control Policy Approximation

This paper considers the problem of data generation for MPC policy approximation. Learning an approximate MPC policy from expert demonstrations requires a large data set consisting of optimal state-action pairs, sampled across the feasible state space. Yet, the key challenge of efficiently generating the training samples has not been studied widely. Recently, a sensitivity-based data augmentation framework for MPC policy approximation was proposed, where the parametric sensitivities are exploited to cheaply generate several additional samples from a single offline MPC computation. The error due to augmenting the training data set with inexact samples was shown to increase with the size of the neighborhood around each sample used for data augmentation. Building upon this work, this letter paper presents an improved data augmentation scheme based on predictor-corrector steps that enforces a user-defined level of accuracy, and shows that the error bound of the augmented samples are independent of the size of the neighborhood used for data augmentation.

D. Krishnamoorthy

[1] D. Krishnamoorthy. A Sensitivity-Based Data Augmentation Framework for Model Predictive Control Policy Approximation , 2020, IEEE Transactions on Automatic Control.

[2] Steven W. Chen,et al. Large Scale Model Predictive Control with Neural Networks and Primal Active Sets , 2019, Autom..

[3] Stephen J. Wright,et al. Industrial, large-scale model predictive control with structured neural networks , 2021, Comput. Chem. Eng..

[4] Benjamin Karg,et al. Approximate moving horizon estimation and robust nonlinear model predictive control via deep learning , 2021, Comput. Chem. Eng..

[5] Monimoy Bujarbaruah,et al. Near-Optimal Rapid MPC Using Neural Networks: A Primal-Dual Policy Learning Framework , 2019, IEEE Transactions on Control Systems Technology.

[6] Teodoro Alamo,et al. Probabilistic performance validation of deep learning‐based robust NMPC controllers , 2019, International Journal of Robust and Nonlinear Control.

[7] A. Mesbah,et al. An Adaptive Correction Scheme for Offset-Free Asymptotic Performance in Deep Learning-based Economic MPC , 2021, IFAC-PapersOnLine.

[8] J. Kober,et al. Interactive Imitation Learning in State-Space , 2020, CoRL.

[9] Joel A. Paulson,et al. Approximate Closed-Loop Robust Model Predictive Control With Guaranteed Stability and Constraint Satisfaction , 2020, IEEE Control Systems Letters.

[10] Stephen P. Boyd,et al. Fitting a Linear Control Policy to Demonstrations with a Kalman Constraint , 2020, L4DC.

[11] F. Allgöwer,et al. Safe and Fast Tracking on a Robot Manipulator: Robust MPC and Neural Network Control , 2019, IEEE Robotics and Automation Letters.

[12] Benjamin Karg,et al. Efficient Representation and Approximation of Model Predictive Control Laws via Deep Learning , 2018, IEEE Transactions on Cybernetics.

[13] David B. Graves,et al. Toward Safe Dose Delivery in Plasma Medicine using Projected Neural Network-based Fast Approximate NMPC , 2020 .

[14] R. B. Gopaluni,et al. Deep Neural Network Approximation of Nonlinear Model Predictive Control , 2020 .

[15] Chung Choo Chung,et al. Approximate Model Predictive Control with Recurrent Neural Network for Autonomous Driving Vehicles , 2019, 2019 58th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE).

[16] T. Khoshgoftaar,et al. A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[17] Francesco Borrelli,et al. Safe and Near-Optimal Policy Learning for Model Predictive Control using Primal-Dual Neural Networks , 2019, 2019 American Control Conference (ACC).

[18] M. Diehl,et al. CasADi: a software framework for nonlinear optimization and optimal control , 2018, Math. Program. Comput..

[19] Julian Nubert,et al. Learning-based Approximate Model Predictive Control with Guarantees: Joining Neural Networks with Recent Robust MPC , 2019 .

[20] Frank Allgöwer,et al. Learning an Approximate Model Predictive Controller With Guarantees , 2018, IEEE Control Systems Letters.

[21] Vijay Kumar,et al. Approximating Explicit Model Predictive Control Using Constrained Neural Networks , 2018, 2018 Annual American Control Conference (ACC).

[22] Damien Picard,et al. Approximate model predictive building control via machine learning , 2018 .

[23] Geoff S. Nitschke,et al. Improving Deep Learning with Generic Data Augmentation , 2018, 2018 IEEE Symposium Series on Computational Intelligence (SSCI).

[24] Gustavo Carneiro,et al. A Bayesian Data Augmentation Approach for Learning Deep Models , 2017, NIPS.

[25] Dario Izzo,et al. Learning the optimal state-feedback using deep networks , 2016, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).

[26] L. Biegler,et al. Optimal sensitivity based on IPOPT , 2012, Mathematical Programming Computation.

[27] Stephen P. Boyd,et al. Imputing a convex objective function , 2011, 2011 IEEE International Symposium on Intelligent Control.

[28] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.

[29] Manuela M. Veloso,et al. Interactive Policy Learning through Confidence-Based Autonomy , 2014, J. Artif. Intell. Res..

[30] Brett Browning,et al. Learning robot motion control with demonstration and advice-operators , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[31] Hannu T. Toivonen,et al. A neural network model predictive controller , 2006 .

[32] Lorenz T. Biegler,et al. On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming , 2006, Math. Program..

[33] Alberto Bemporad,et al. The explicit linear quadratic regulator for constrained systems , 2003, Autom..

[34] Alexander Shapiro,et al. Optimization Problems with Perturbations: A Guided Tour , 1998, SIAM Rev..

[35] Thomas Parisini,et al. A receding-horizon regulator for nonlinear systems and a neural approximation , 1995, Autom..

[36] Anthony V. Fiacco,et al. Sensitivity analysis for nonlinear programming using penalty methods , 1976, Math. Program..