论文信息 - MPC policy learning using DNN for human following control without collision

MPC policy learning using DNN for human following control without collision

Abstract Model predictive control has recently been applied to a wide variety of motion control systems. Model predictive control can be used to generate optimized control inputs with excellent performance considering inequality constraints to the control inputs, control outputs, and state variables. However, the computational load for this method is too heavy for implementation in most actual systems because the quadratic programming problem must be solved within the sampling period. As the number of inequality constraints, control variables, and state variables in the control system increases, more calculation time is required. In this study, a deep neural network designed to learn the model predictive control policy was developed to reduce the computational load. It is expected that a relatively small neural network can be used to learn the model predictive control policy. In the proposed system, the motion controller calculates the learned neural network in real time instead of solving the quadratic programming problem, realizing almost the same control performance as the original model predictive control approach. The effectiveness of the proposed approach was verified by applying it to the control of a personal robot designed to follow the user, which can provide daily support to the elderly. In Matlab simulations, the calculation time for the proposed approach was approximately times faster than that of the conventional method of solving the quadratic programming problem. In addition, an experiment using an actual personal robot was conducted to confirm the control performance.

[1] Manfred Morari,et al. A Multiresolution Approximation Method for Fast Explicit Model Predictive Control , 2011, IEEE Transactions on Automatic Control.

[2] Noriaki Hirose,et al. Modeling of rolling friction by recurrent neural network using LSTM , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[3] Noriaki Hirose,et al. Personal Robot Assisting Transportation to Support Active Human Life - Following Control based on Model Predictive Control with Multiple Future Predictions - , 2015 .

[4] Minoru Tanaka,et al. Personal robot assisting transportation to support active human life — Reference generation based on model predictive control for robust quick turning , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[5] Chun-Yi Su,et al. Vision-Based Model Predictive Control for Steering of a Nonholonomic Mobile Robot , 2016, IEEE Transactions on Control Systems Technology.

[6] Silvio Savarese,et al. To Go or Not To Go? A Near Unsupervised Learning Approach For Robot Navigation , 2017, ArXiv.

[7] Silvio Savarese,et al. Tracking the Untrackable: Learning to Track Multiple Cues with Long-Term Dependencies , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[8] Usdhhs. Physical Activity and Health: A Report of the Surgeon General , 1996 .

[9] Jan Peters,et al. Model learning for robot control: a survey , 2011, Cognitive Processing.

[10] Noriaki Hirose,et al. Personal robot assisting transportation to support active human life — Human-following method based on model predictive control for adjacency without collision , 2015, 2015 IEEE International Conference on Mechatronics (ICM).

[11] Alberto Bemporad,et al. The explicit linear quadratic regulator for constrained systems , 2003, Autom..

[12] Noriaki Hirose,et al. Personal robot assisting transportation to support active human life — Posture stabilization based on feedback compensation of lateral acceleration , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13] Kazuhito Yokoi,et al. Biped walking pattern generation by using preview control of zero-moment point , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[14] Saad Mekhilef,et al. Modified Model Predictive Control of a Bidirectional AC–DC Converter Based on Lyapunov Function for Energy Storage Systems , 2016, IEEE Transactions on Industrial Electronics.

[15] L. Biegler,et al. Quadratic programming methods for reduced Hessian SQP , 1994 .

[16] Minoru Tanaka,et al. Following control approach based on model predictive control for wheeled inverted pendulum robot , 2016, Adv. Robotics.

[17] Il Hong Suh,et al. Marathoner tracking algorithms for a high speed mobile robot , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18] Xin Li,et al. Model Predictive Current Control of Switched Reluctance Motors With Inductance Auto-Calibration , 2016, IEEE Transactions on Industrial Electronics.

[19] Samir Kouro,et al. Model Predictive Control: MPC's Role in the Evolution of Power Electronics , 2015, IEEE Industrial Electronics Magazine.

[20] Sergey Levine,et al. Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[21] Stephen P. Boyd,et al. Fast Model Predictive Control Using Online Optimization , 2010, IEEE Transactions on Control Systems Technology.

[22] Jan M. Maciejowski,et al. Predictive control : with constraints , 2002 .