Joint localization and target tracking with a monocular camera

Abstract Localization capabilities are necessary for autonomous robots that need to keep track of their position with respect to a surrounding environment. A pursuit robot is an autonomous robot that tracks and pursues a moving target, requiring accurate localization relative to the target’s position and obstacles in the local environment. Small unmanned ground vehicles (SUGVs) equipped with a monocular camera and wheel encoders could act as effective pursuit robots, but the noisy 2D target position and size estimates from the monocular camera will in turn lead to overly noisy 3D target pose estimates. One possible approach to relative localization for pursuit robots is, rather than simply tracking and estimating a relative robot–target position in each frame, joint localization, in which the purser and target are both localized with respect to a common reference frame. In this paper, we propose a novel method for joint localization of a pursuit robot and arbitrary target. The proposed method fuses the pursuit robot’s kinematics and the target’s dynamics in a joint state space model. We show that predicting and correcting pursuer and target trajectories simultaneously produces improved results compared to standard filters for estimating relative target trajectories in a 3D coordinate system. For visual tracking, we also introduce an adaptive histogram matching threshold for suspending tracking when the target is lost in a cluttered environment. When tracking is suspended, rather than traversing the entire image to search for a reappearance of the target, we only search the part of the image segmented by histogram backprojection and correctly reinitialize the tracker. The experimental results show that the joint localization method outperforms standard localization methods and that the visual tracker for pursuit robot can deal effectively with target occlusions.

[1]  Shin'ichi Yuta,et al.  Tracking and Following People and Robots in Crowded Environment by a Mobile Robot with SOKUIKI Sensor , 2008, DARS.

[2]  Armin B. Cremers,et al.  Adaptive real-time video-tracking for arbitrary objects , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Zhichao Chen,et al.  Person following with a mobile robot using binocular feature-based tracking , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[5]  Qiang Huang,et al.  Rapid and precise object detection based on color histograms and adaptive bandwidth mean shift , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Sebastian Thrun,et al.  FastSLAM 2.0: An Improved Particle Filtering Algorithm for Simultaneous Localization and Mapping that Provably Converges , 2003, IJCAI.

[7]  Atsushi Yamashita,et al.  HOG-Based Person Following and Autonomous Returning Using Generated Map by Mobile Robot Equipped with Camera and Laser Range Finder , 2012, IAS.

[8]  Gary R. Bradski,et al.  Real time face and object tracking as a component of a perceptual user interface , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[9]  T. Poggio,et al.  A Contour-Based Moving Object Detection and Tracking , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[10]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Hiroshi Mizoguchi,et al.  Development of a Person Following Robot with Vision Based Target Detection , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[12]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[13]  Silvio Savarese,et al.  Detecting and tracking people using an RGB-D camera via multiple detector fusion , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[14]  Gary R. Bradski,et al.  Learning OpenCV 3: Computer Vision in C++ with the OpenCV Library , 2016 .

[15]  Aristidis Likas,et al.  Visual Tracking by Adaptive Kalman Filtering and Mean Shift , 2010, SETN.

[16]  Woojin Chung,et al.  Human tracking of a mobile robot with an onboard LRF (Laser Range Finder) using human walking motion analysis , 2011, 2011 8th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI).

[17]  Brian Yamauchi,et al.  PackBot: a versatile platform for military robotics , 2004, SPIE Defense + Commercial Sensing.

[18]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[19]  Steven M. LaValle,et al.  Planning algorithms , 2006 .

[20]  Natasha Gelfand,et al.  SURFTrac: Efficient tracking and continuous object recognition using local feature descriptors , 2009, CVPR.

[21]  Dorin Comaniciu,et al.  Mean shift and optimal prediction for efficient object tracking , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[22]  Olivier Stasse,et al.  MonoSLAM: Real-Time Single Camera SLAM , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Jesse S. Jin,et al.  Tracking Using CamShift Algorithm and Multiple Quantized Feature Spaces , 2004, VIP.

[24]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[25]  Dorin Comaniciu,et al.  Real-time tracking of non-rigid objects using mean shift , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[26]  Tieniu Tan,et al.  Visual Vehicle Tracking Using An Improved EKF , 2002 .

[27]  Gary R. Bradski,et al.  Learning OpenCV - computer vision with the OpenCV library: software that sees , 2008 .

[28]  Andrew Calway,et al.  Real-Time Camera Tracking Using a Particle Filter , 2005, BMVC.

[29]  Paul Schiopu,et al.  Remote control of a small unmanned ground vehicle (SUGV) , 2010, Advanced Topics in Optoelectronics, Microelectronics, and Nanotechnologies.

[30]  Shimin Feng,et al.  Human Tracking Based on Mean Shift and Kalman Filter , 2009, 2009 International Conference on Artificial Intelligence and Computational Intelligence.

[31]  Matthew N. Dailey,et al.  Fast target redetection for CAMSHIFT using back-projection and histogram matching , 2014, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[32]  Xiaopeng Chen,et al.  Adaptive Bandwidth Mean Shift Object Detection , 2008, 2008 IEEE Conference on Robotics, Automation and Mechatronics.

[33]  Matthew N. Dailey,et al.  Model driven state estimation for target pursuit , 2012, 2012 12th International Conference on Control Automation Robotics & Vision (ICARCV).

[34]  Sridha Sridharan,et al.  An adaptive optical flow technique for person tracking systems , 2007, Pattern Recognit. Lett..

[35]  Huiyu Zhou,et al.  Object tracking using SIFT features and mean shift , 2009, Comput. Vis. Image Underst..

[36]  Vincent Lepetit,et al.  BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[37]  Fatih Murat Porikli,et al.  Integral histogram: a fast way to extract histograms in Cartesian spaces , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[38]  Gaurav S. Sukhatme,et al.  People tracking and following with mobile robot using an omnidirectional camera and a laser , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[39]  Andrew Hogue,et al.  Histogram-based search: A comparative study , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Matthew N. Dailey,et al.  Joint Localization of Pursuit Quadcopters and Target Using Monocular Cues , 2015, J. Intell. Robotic Syst..

[41]  Ben J. A. Kröse,et al.  Keeping Track of Humans: Have I Seen This Person Before? , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[42]  Oliver Bimber,et al.  Fast and robust CAMShift tracking , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[43]  Matteo Munaro,et al.  Tracking people within groups with RGB-D data , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[44]  M. Nitulescu,et al.  Theoretical aspects in wheeled mobile robot control , 2008, 2008 IEEE International Conference on Automation, Quality and Testing, Robotics.

[45]  Patrick Hébert,et al.  Median Filtering in Constant Time , 2007, IEEE Transactions on Image Processing.