Model predictive ship collision avoidance based on Q-learning beetle swarm antenna search and neural networks

Abstract Real-time collision avoidance with full consideration of ship maneuverability, collision risks and International Regulations for Preventing Collisions at Sea (COLREGs) is difficult in multi-ship encounters. To deal with this problem, a novel method is proposed based on model predictive control (MPC), an improved Q-learning beetle swarm antenna search (I-Q-BSAS) algorithm and neural networks. The main idea of this method is to use a neural network to approximate an inverse model based on decisions made with MPC for collision avoidance. Firstly, the predictive collision avoidance strategy is established following the MPC concept incorporating an I-Q-BSAS algorithm to solve the optimization problem. Meanwhile, the relative collision motion states in typical encounters are collected for training an inverse neural network model, which is used as an approximated optimal policy of MPC. Moreover, to deal with uncertain dynamics, the obtained policy is reinforced by long-term retraining based on an aggregation of on-policy and off-policy data. Ship collision avoidance in multi-ship encounters can be achieved by weighting the outputs of the neural network model with respect to different target ships. Simulation experiments under several typical and multi-ship encounters are carried out using the KVLCC2 ship model to verify the effectiveness of the proposed method.

[1]  Xianku Zhang,et al.  Practical Robust Neural Path Following Control for Underactuated Marine Vessels with Actuators Uncertainties , 2017 .

[2]  Rafal Szlapczynski,et al.  Review of ship safety domains: Models and applications , 2017 .

[3]  Visakan Kadirkamanathan,et al.  Stability analysis of the particle dynamics in particle swarm optimizer , 2006, IEEE Transactions on Evolutionary Computation.

[4]  David Clelland,et al.  Automatic simulation of ship navigation , 2011 .

[5]  Rudy R. Negenborn,et al.  Path Planning for Autonomous Inland Vessels Using A*BG , 2016, ICCL.

[6]  Liang Hu,et al.  A Reactive COLREGs-Compliant Navigation Strategy for Autonomous Maritime Navigation , 2016 .

[7]  Reza Safabakhsh,et al.  A novel stability-based adaptive inertia weight for particle swarm optimization , 2016, Appl. Soft Comput..

[8]  Axel Hahn,et al.  Nonlinear Model Predictive Control for trajectory tracking and collision avoidance of underactuated vessels with disturbances , 2018, Ocean Engineering.

[9]  Xiangyu Wang,et al.  Model predictive control strategy for energy optimization of series-parallel hybrid electric vehicle , 2018, Journal of Cleaner Production.

[10]  Bart De Schutter,et al.  Learning-based model predictive control for Markov decision processes , 2005 .

[11]  Yaoguang Wei,et al.  Three-Dimensional Route Planning Based on the Beetle Swarm Optimization Algorithm , 2019, IEEE Access.

[12]  Xin-She Yang,et al.  A literature survey of benchmark functions for global optimisation problems , 2013, Int. J. Math. Model. Numer. Optimisation.

[13]  Hamid Reza Karimi,et al.  An ant colony optimization-based fuzzy predictive control approach for nonlinear processes , 2015, Inf. Sci..

[14]  Weidong Zhang,et al.  Robust Neural Control for Dynamic Positioning Ships With the Optimum-Seeking Guidance , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[15]  Nguyen Ngoc Son,et al.  A novel adaptive feed-forward-PID controller of a SCARA parallel robot using pneumatic artificial muscle actuator based on neural network and modified differential evolution algorithm , 2017, Robotics Auton. Syst..

[16]  Yunfei Liu,et al.  Design and Research of DC Motor Speed Control System Based on Improved BAS , 2018, 2018 Chinese Automation Congress (CAC).

[17]  Roger J.-B. Wets,et al.  Minimization by Random Search Techniques , 1981, Math. Oper. Res..

[18]  Michel Cabassud,et al.  Design of a neural controller by inverse modelling , 1995 .

[19]  Xinping Yan,et al.  Multi-objective path planning for unmanned surface vehicle with currents effects. , 2018, ISA transactions.

[20]  Wei He,et al.  Adaptive Neural Network Control of an Uncertain Robot With Full-State Constraints , 2016, IEEE Transactions on Cybernetics.

[21]  A. Lazarowska Ship's Trajectory Planning for Collision Avoidance at Sea Based on Ant Colony Optimisation , 2015 .

[22]  Key-Pyo Rhee,et al.  A study on the collision avoidance of a ship using neural networks and fuzzy logic , 2012 .

[23]  C. Guedes Soares,et al.  Fuzzy logic based decision making system for collision avoidance of ocean navigation under critical collision conditions , 2011 .

[24]  Yue Qiao,et al.  The Research on Ship Intelligence Navigation in Confined Waters , 2012 .

[25]  Hans Hopman,et al.  Distributed model predictive control for vessel train formations of cooperative multi-vessel systems , 2018, Transportation Research Part C: Emerging Technologies.

[26]  Xinyu Li,et al.  Measures to diminish the parameter drift in the modeling of ship manoeuvring using system identification , 2017 .

[27]  C. Guedes Soares,et al.  Intelligent Ocean Navigation and Fuzzy-Bayesian Decision/Action Formulation , 2012, IEEE Journal of Oceanic Engineering.

[28]  Yuanlong Wei,et al.  A path planning algorithm using artificial potential field based on probability map , 2011, 2011 8th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI).

[29]  Guifeng Zheng,et al.  HAS: Hierarchical A-Star Algorithm for Big Map Navigation in Special Areas , 2014, 2014 5th International Conference on Digital Home.

[30]  H. Yasukawa,et al.  Introduction of MMG standard method for ship maneuvering predictions , 2015 .

[31]  Xiumin Chu,et al.  Predictive path following based on adaptive line-of-sight for underactuated autonomous surface vessels , 2018 .

[32]  Marcello Farina,et al.  Formation control and collision avoidance of unicycle robots with distributed predictive control , 2015 .

[33]  Yuanqing Xia,et al.  Distributed MPC for formation of multi-agent systems with collision avoidance and obstacle avoidance , 2017, J. Frankl. Inst..

[34]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[35]  Axel Hahn,et al.  NMPC-based Trajectory Tracking and Collison Avoidance of Underactuated Vessels with Elliptical Ship Domain , 2016 .

[36]  Huanxin Chen,et al.  BSAS: Beetle Swarm Antennae Search Algorithm for Optimization Problems , 2018, ArXiv.

[37]  Qiang Liu,et al.  Beetle Swarm Optimization Algorithm: Theory and Application , 2018, Filomat.

[38]  J. M. Maestre,et al.  Distributed Model Predictive Control: An Overview and Roadmap of Future Research Opportunities , 2014, IEEE Control Systems.

[39]  Gabriel Lodewijks,et al.  Closed-loop scheduling and control of waterborne AGVs for energy-efficient Inter Terminal Transport , 2017 .

[40]  A Hammer,et al.  A SAFE WAY OF COLLISION AVOIDANCE MANOEUVRE BASED ON MANOEUVRING STANDARD USING FUZZY REASONING MODEL , 1993 .

[41]  Huarong Zheng,et al.  Predictive path following with arrival time awareness for waterborne AGVs , 2016 .

[42]  Shuai Li,et al.  Beetle Antennae Search without Parameter Tuning (BAS-WPT) for Multi-objective Optimization , 2017, Filomat.

[43]  Huang Liwen,et al.  Ship Collision Avoidance Path Planning by PSO Based on Maneuvering Equation , 2012 .

[44]  Mehmet Fatih Amasyali,et al.  Decision support system for collision avoidance of vessels , 2014, Appl. Soft Comput..

[45]  Xu Dan,et al.  Robust model predictive control for greenhouse temperature based on particle swarm optimization , 2018, Information Processing in Agriculture.

[46]  Rudy R. Negenborn,et al.  Robust Distributed Predictive Control of Waterborne AGVs—A Cooperative and Cost-Effective Approach , 2018, IEEE Transactions on Cybernetics.

[47]  Ming-Cheng Tsou,et al.  THE STUDY OF SHIP COLLISION AVOIDANCE ROUTE PLANNING BY ANT COLONY ALGORITHM , 2010 .

[48]  Jun Teng,et al.  Beetle swarm optimisation for solving investment portfolio problems , 2018 .

[49]  Lokukaluge P. Perera,et al.  Experimental Evaluations on Ship Autonomous Navigation and Collision Avoidance by Intelligent Guidance , 2015, IEEE Journal of Oceanic Engineering.

[50]  Chee Peng Lim,et al.  A new Reinforcement Learning-based Memetic Particle Swarm Optimizer , 2016, Appl. Soft Comput..

[51]  Jialun Liu,et al.  Distributed coordination for collision avoidance of multiple ships considering ship maneuverability , 2019, Ocean Engineering.

[52]  Yong Yin,et al.  Fast Path Planning for Autonomous Ships in Restricted Waters , 2018, Applied Sciences.

[53]  Swagatam Das,et al.  A closed loop stability analysis and parameter selection of the Particle Swarm Optimization dynamics for faster convergence , 2007, 2007 IEEE Congress on Evolutionary Computation.

[54]  George W. Irwin,et al.  COLREGs-based collision avoidance strategies for unmanned surface vehicles , 2012 .

[55]  Agnieszka Lazarowska A New Potential Field Inspired Path Planning Algorithm for Ships , 2018, 2018 23rd International Conference on Methods & Models in Automation & Robotics (MMAR).

[56]  Zengqiang Chen,et al.  New Chaotic PSO-Based Neural Network Predictive Control for Nonlinear Process , 2007, IEEE Transactions on Neural Networks.

[57]  Weidong Zhang,et al.  Concise deep reinforcement learning obstacle avoidance for underactuated unmanned marine vessels , 2018, Neurocomputing.

[58]  Bo Yan,et al.  Improved hierarchical A-star algorithm for optimal parking path planning of the large parking lot , 2014, 2014 IEEE International Conference on Information and Automation (ICIA).

[59]  Li Yu,et al.  Low-Order Stabilization of LTI Systems With Time Delay , 2009, IEEE Transactions on Automatic Control.

[60]  Xiaoming Xu,et al.  Quantitative Performance Design of a Modified Smith Predictor for Unstable Processes with Time Delay , 2004 .

[61]  Jing Zhang,et al.  Autonomous Ship Safe Navigation using Smoothing A* Algorithm , 2015 .

[62]  C. T. Stockel,et al.  A Computer Simulation of Multi-Ship Encounters , 1982, Journal of Navigation.

[63]  Shuai Li,et al.  BAS: Beetle Antennae Search Algorithm for Optimization Problems , 2017, ArXiv.

[64]  Guichen Li,et al.  Optimized neural network using beetle antennae search for predicting the unconfined compressive strength of jet grouting coalcretes , 2019, International Journal for Numerical and Analytical Methods in Geomechanics.

[65]  Masaaki Inaishi,et al.  BASIC RESEARCH ON A COLLISION AVOIDANCE SYSTEM USING NEURAL NETWORKS , 1992 .

[66]  C. Su,et al.  Decision Support from Genetic Algorithms for Ship Collision Avoidance Route Planning and Alerts , 2010 .

[67]  Guimin Chen,et al.  A Particle Swarm Optimizer with Multi-stage Linearly-Decreasing Inertia Weight , 2009, 2009 International Joint Conference on Computational Sciences and Optimization.