Humanoid Robot Detection Using Deep Learning: A Speed-Accuracy Tradeoff

Recent advances in computer vision have made the detection of landmarks on the soccer field easier for teams. However, the detection of other robots is also a critical capability that has not garnered much attention in the RoboCup community so far. This problem is well represented in different RoboCup Soccer and Rescue Robot Leagues. In this paper, we compare several two-stage detection systems based on various Convolutional Neural Networks (CNN) and highlight their speed-accuracy trade off. The approach performs edge based image segmentation in order to reduce the search space and then a CNN validates the detection in the second stage. We use images of different humanoid robots to train and test three different CNN architectures. A part of these images was gathered by our team and will be publicly available. Our experiments demonstrate the strong adaptability of deeper CNNs. These models, trained on a limited set of robots, are able to successfully distinguish an unseen kind of humanoid robot from non-robot regions.

[1]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[2]  Sven Behnke,et al.  Humanoid Robots in Soccer: Robots Versus Humans in RoboCup 2050 , 2015, IEEE Robotics & Automation Magazine.

[3]  B. S. Manjunath,et al.  EdgeFlow: a technique for boundary detection and image segmentation , 2000, IEEE Trans. Image Process..

[4]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[5]  Javier Ruiz-del-Solar,et al.  Detection of AIBO and Humanoid Robots Using Cascades of Boosted Classifiers , 2008, RoboCup.

[6]  Sven Behnke,et al.  RoboCup Humanoid League Rule Developments 2002-2014 and Future Perspectives , 2014, RoboCup.

[7]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Takeshi Ohashi,et al.  Obstacle avoidance and path planning for humanoid robots using stereo vision , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[9]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[10]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[11]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[12]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[13]  Sven Behnke,et al.  A Monocular Vision System for Playing Soccer in Low Color Information Environments , 2018, ArXiv.

[14]  Horst Bunke,et al.  Edge Detection in Range Images Based on Scan Line Approximation , 1999, Comput. Vis. Image Underst..

[15]  Daniele Nardi,et al.  A Deep Learning Approach for Object Recognition with NAO Soccer Robots , 2016, RoboCup.

[16]  Sven Behnke,et al.  Real-Time Visual Tracking and Identification for a Team of Homogeneous Humanoid Robots , 2016, RoboCup.

[17]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Jacky Baltes,et al.  Inter-humanoid robot interaction with emphasis on detection: a comparison study – ADDENDUM , 2017, The Knowledge Engineering Review.

[19]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[20]  Jianping Fan,et al.  Automatic image segmentation by integrating color-edge extraction and seeded region growing , 2001, IEEE Trans. Image Process..

[21]  Jacky Baltes,et al.  AUT-UofM Humanoid TeenSize Joint Team; A New Step Toward 2050's Humanoid League Long Term RoadMap , 2014, RiTA.