Development of Deep Learning Algorithm for Humanoid Robots to Walk to the Target Using Semantic Segmentation and Deep Q Network

In this article, a new algorithm incorporating deep semantic segmentation algorithm and deep reinforcement algorithm is proposed to avoid the obstacle. This work was generated from two parts. The first part included semantic segmentation by using mini-Unet. The objects were detected and recognized. In the second part, Deep Q Network (DQN) was used for humanoid robots to learn to walk to target. The obtained results showed that the performance of proposed algorithm confirmed.

[1]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[3]  Jesper Smith,et al.  Team IHMC's Lessons Learned from the DARPA Robotics Challenge: Finding Data in the Rubble , 2017, J. Field Robotics.

[4]  Zenghui Wang,et al.  Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review , 2017, Neural Computation.

[5]  Nathan Lay,et al.  Automatic magnetic resonance prostate segmentation by deep learning with holistically nested networks , 2017, Journal of medical imaging.

[6]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Frank Nicolas Dittmann,et al.  Robots: The 500-year Quest to Make Machines Human , 2018 .

[8]  Cüneyt Güzeliş,et al.  Fast Object Recognition for Humanoid Robots by Using Deep Learning Models with Small Structure , 2020, 2020 International Conference on INnovations in Intelligent SysTems and Applications (INISTA).

[9]  García Vázquez,et al.  Vision-Based Locomotion for Humanoid Robots , 2013 .

[10]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[11]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[12]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[13]  Çagri Kaymak,et al.  A Brief Survey and an Application of Semantic Image Segmentation for Autonomous Driving , 2018, Handbook of Deep Learning Applications.

[14]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[16]  Ayşegül Uçar,et al.  Gesture imitation and recognition using Kinect sensor and extreme learning machines , 2016 .

[17]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[18]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[19]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[20]  Cüneyt Güzelis,et al.  An Implementation of Vision Based Deep Reinforcement Learning for Humanoid Robot Locomotion , 2019, 2019 IEEE International Symposium on INnovations in Intelligent SysTems and Applications (INISTA).

[21]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[22]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[23]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[24]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[25]  Simge Nur Aslan,et al.  End-To-End Learning from Demonstation for Object Manipulation of Robotis-Op3 Humanoid Robot , 2020, 2020 International Conference on INnovations in Intelligent SysTems and Applications (INISTA).

[26]  Alexandre Benoit,et al.  Assistance via IoT networking cameras and evidence theory for 3D object instance recognition: Application for the NAO humanoid robot , 2020, Internet Things.

[27]  Aysegül Uçar,et al.  Skin lesion segmentation using fully convolutional networks: A comparative experimental study , 2020, Expert Syst. Appl..