Q-learning trajectory planning based on Takagi–Sugeno fuzzy parallel distributed compensation structure of humanoid manipulator

NAO is the first robot created by SoftBank Robotics. Famous around the world, NAO is a tremendous programming tool and he has especially become a standard in education and research. Aiming at the large error and poor stability of the humanoid robot NAO manipulator during trajectory tracking, a novel framework based on fuzzy controller reinforcement learning trajectory planning strategy is proposed. Firstly, the Takagi–Sugeno fuzzy model based on the dynamic equation of the NAO right arm is established. Secondly, the design and the gain solution of the state feedback controller based on the parallel feedback compensation strategy are studied. Finally, the ideal trajectory of the motion is planned by reinforcement learning algorithm so that the end of the manipulator can track the desired trajectory and realize the valid obstacle avoidance. Simulation and experiment shows that the end of the manipulator based on this scheme has good controllability and stability and can meet the accuracy requirements of trajectory tracking accuracy, which verifies the effectiveness of the proposed framework.

[1]  Hui Yu,et al.  Physics Inspired Methods for Crowd Video Surveillance and Analysis: A Survey , 2018, IEEE Access.

[2]  Ankita Ranjan,et al.  Identification and control of NAO humanoid robot to grasp an object using monocular vision , 2017, 2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT).

[3]  Shinichi Hirai,et al.  Soft Gripper Dynamics Using a Line-Segment Model With an Optimization-Based Parameter Identification Method , 2017, IEEE Robotics and Automation Letters.

[4]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[5]  Carlos E. Castañeda,et al.  Real-Time Decentralized Neural Control via Backstepping for a Robotic Arm Powered by Industrial Servomotors , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Gora Chand Nandi,et al.  NAO humanoid robot: Analysis of calibration techniques for robot sketch drawing , 2016, Robotics Auton. Syst..

[7]  Gora Chand Nandi,et al.  Development of a self reliant humanoid robot for sketch drawing , 2017, Multimedia Tools and Applications.

[8]  Zhijing Zhang,et al.  Study on the Accuracy of a 6-DOF Flexure Hinge-Based Robot , 2011 .

[9]  K. M. Goher,et al.  Assessment of personal care and medical robots from older adults’ perspective , 2017, Robotics and biomimetics.

[10]  Genshe Chen,et al.  Eliminating drift of the head gesture reference to enhance Google Glass-based control of an NAO humanoid robot , 2017 .

[11]  Hak-Keung Lam,et al.  LMI-Based Stability Analysis for Fuzzy-Model-Based Control Systems Using Artificial T–S Fuzzy Model , 2011, IEEE Transactions on Fuzzy Systems.

[12]  SinghAvinash Kumar NAO humanoid robot , 2016 .

[13]  Kazuo Tanaka,et al.  An approach to fuzzy control of nonlinear systems: stability and design issues , 1996, IEEE Trans. Fuzzy Syst..

[14]  Gholamreza Anbarjafari,et al.  Real-time ensemble based face recognition system for NAO humanoids using local binary pattern , 2017, Analog Integrated Circuits and Signal Processing.

[15]  Dirk Helbing,et al.  Simulating dynamical features of escape panic , 2000, Nature.

[16]  Toshio Fukuda,et al.  Reinforcement Learning of Manipulation and Grasping Using Dynamical Movement Primitives for a Humanoidlike Mobile Manipulator , 2017, IEEE/ASME Transactions on Mechatronics.

[17]  Philip Webb,et al.  The Development of a Scale to Evaluate Trust in Industrial Human-robot Collaboration , 2015, International Journal of Social Robotics.

[18]  Ary Setijadi Prihatmanto,et al.  Design and implementation of kinematics model and trajectory planning for NAO humanoid robot in a tic-tac-toe board game , 2014, 2014 IEEE 4th International Conference on System Engineering and Technology (ICSET).

[19]  S. O. Fadlallah,et al.  Bacterial foraging-optimized PID control of a two-wheeled machine with a two-directional handling mechanism , 2017, Robotics and biomimetics.

[20]  Masayuki Shimizu,et al.  Analytical inverse kinematics for 5-DOF humanoid manipulator under arbitrarily specified unconstrained orientation of end-effector , 2014, Robotica.

[21]  Eun Ja Lee Teaching Korean Language and Culture Using the Myth of Dangun , 2017 .

[22]  Daniel R. Parisi,et al.  A modification of the Social Force Model can reproduce experimental data of pedestrian flows in normal conditions , 2009 .

[23]  Philippe Martinet,et al.  The Kinematics, Dynamics and Control of a Flying Parallel Robot With Three Quadrotors , 2018, IEEE Robotics and Automation Letters.

[24]  Naoji Shiroma,et al.  Collision-Free Trajectory Planning for a 3-DoF Robot with a Passive Joint , 2000, Int. J. Robotics Res..

[25]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.