Real-time Active Vision for a Humanoid Soccer Robot Using Deep Reinforcement Learning

In this paper, we present an active vision method using a deep reinforcement learning approach for a humanoid soccer-playing robot. The proposed method adaptively optimises the viewpoint of the robot to acquire the most useful landmarks for self-localisation while keeping the ball into its viewpoint. Active vision is critical for humanoid decision-maker robots with a limited field of view. To deal with an active vision problem, several probabilistic entropy-based approaches have previously been proposed which are highly dependent on the accuracy of the self-localisation model. However, in this research, we formulate the problem as an episodic reinforcement learning problem and employ a Deep Q-learning method to solve it. The proposed network only requires the raw images of the camera to move the robot's head toward the best viewpoint. The model shows a very competitive rate of 80% success rate in achieving the best viewpoint. We implemented the proposed method on a humanoid robot simulated in Webots simulator. Our evaluations and experimental results show that the proposed method outperforms the entropy-based methods in the RoboCup context, in cases with high self-localisation errors.

[1]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[2]  Mohammad Reza Meybodi,et al.  A hybrid localization method for a soccer playing robot , 2016, 2016 Artificial Intelligence and Robotics (IRANOPEN).

[3]  Michael J. Swain,et al.  Promising directions in active vision , 1993, International Journal of Computer Vision.

[4]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[5]  Thomas Röfer,et al.  Entropy-Based Active Vision for a Humanoid Soccer Robot , 2010, RoboCup.

[6]  Meisam Teimouri,et al.  MRL Team Description Paper for Humanoid KidSize League of RoboCup 2016 , 2012 .

[7]  Wolfram Burgard,et al.  Active mobile robot localization by entropy minimization , 1997, Proceedings Second EUROMICRO Workshop on Advanced Mobile Robots.

[8]  Martin Rolfs,et al.  Attention in Active Vision: A Perspective on Perceptual Continuity Across Saccades , 2015, Perception.

[9]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[10]  Reinhard Klette,et al.  Computer Vision for Driver Assistance , 2017, Computational Imaging and Vision.

[11]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[12]  Michael Kruse,et al.  Real-Time Active Vision by Entropy Minimization Applied to Localization , 2010, RoboCup.

[13]  Shengyong Chen,et al.  Active vision in robotic systems: A survey of recent developments , 2011, Int. J. Robotics Res..

[14]  Fuchun Sun,et al.  Active Object Detection With Multistep Action Prediction Using Deep Q-Network , 2019, IEEE Transactions on Industrial Informatics.

[15]  R. Sabzevari,et al.  OBJECT DETECTION AND LOCALIZATION SYSTEM BASED ON NEURAL NETWORKS FOR ROBO-PONG , 2008, 2008 5th International Symposium on Mechatronics and Its Applications.

[16]  Arpit Agarwal,et al.  Reinforcement Learning of Active Vision for Manipulating Objects under Occlusions , 2018, CoRL.

[17]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[18]  Nicholas J. Butko,et al.  Active perception , 2010 .

[19]  John H. Reynolds,et al.  Active Vision in Marmosets: A Model System for Visual Neuroscience , 2014, The Journal of Neuroscience.

[20]  Davide Scaramuzza,et al.  Aggressive quadrotor flight through narrow gaps with onboard sensing and computing using active vision , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[21]  Jana Kosecka,et al.  A dataset for developing and benchmarking active vision , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[22]  Olivier Michel,et al.  Cyberbotics Ltd. Webots™: Professional Mobile Robot Simulation , 2004 .

[23]  Anthony J. Hornof,et al.  Towards accurate and practical predictive models of active-vision-based visual search , 2014, CHI.

[24]  Tom Schaul,et al.  Prioritized Experience Replay , 2015, ICLR.

[25]  Meisam Teimouri,et al.  A Real-Time Ball Detection Approach Using Convolutional Neural Networks , 2019, RoboCup.

[26]  Javier Ruiz-del-Solar,et al.  A Dynamic and Efficient Active Vision System for Humanoid Soccer Robots , 2015, RoboCup.