The UBC Semantic Robot Vision System

This abstract outlines the algorithms and robot hardware used in the UBC robot competing in the Semantic Robot Vision Challenge (SRVC), held at the AAAI’07 conference in Vancouver, Canada. Successfully completing the SRVC involves smooth integration of tasks such as data acquisition, training, obstacle avoidance, visual search, and object recognition. Given that these tasks span several research disciplines, successful integration is a formidable task. The value of working on these problems jointly is that assumptions built into an isolated method will be exposed when it is integrated, thus highlighting where further research is required. In addition, this will focus research on robots that can navigate safely and identify objects in their environment. Our approach is decomposed into five primary modules, each of which relies on the success of other modules, avoiding some of the unrealistic assumptions that are sometimes made when the tasks are tackled independently. The five primary modules are:

[1]  Tal Arbel,et al.  Efficient Discriminant Viewpoint Selection for Active Bayesian Recognition , 2006, International Journal of Computer Vision.

[2]  Ronald A. Rensink The Dynamic Representation of Scenes , 2000 .

[3]  Brian Yamauchi,et al.  A frontier-based approach for autonomous exploration , 1997, Proceedings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA'97. 'Towards New Computational Principles for Robotics and Automation'.

[4]  Wolfram Burgard,et al.  Improved Techniques for Grid Mapping With Rao-Blackwellized Particle Filters , 2007, IEEE Transactions on Robotics.

[5]  Christof Koch,et al.  Modeling attention to salient proto-objects , 2006, Neural Networks.

[6]  James J. Little,et al.  Autonomous vision-based exploration and mapping using hybrid maps and Rao-Blackwellised particle filters , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Danica Kragic,et al.  Strategies for Object Manipulation using Foveal and Peripheral Vision , 2006, Fourth IEEE International Conference on Computer Vision Systems (ICVS'06).

[8]  James J. Little,et al.  Curious George: An attentive semantic robot , 2008, Robotics Auton. Syst..

[9]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[10]  T. Southey,et al.  Object Discovery through Motion, Appearance and Shape , 2006 .

[11]  Cordelia Schmid,et al.  Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[12]  Per-Erik Forssén Learning Saccadic Gaze Control via Motion Prediciton , 2007, Fourth Canadian Conference on Computer and Robot Vision (CRV '07).

[13]  Silvio Savarese,et al.  3D generic object categorization, localization and pose estimation , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[14]  Hans P. Moravec,et al.  High resolution maps from wide angle sonar , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[15]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Andrew Zisserman,et al.  An Exemplar Model for Learning Object Classes , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  James J. Little,et al.  Informed visual search: Combining attention and object recognition , 2008, 2008 IEEE International Conference on Robotics and Automation.

[18]  David A. McAllester,et al.  A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[20]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[21]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[22]  Matthew Toews,et al.  Detecting and Localizing 3D Object Classes using Viewpoint Invariant Reference Frames , 2007, 2007 IEEE 11th International Conference on Computer Vision.