Toward Robot Perception through Omnidirectional Vision

Vision is an extraordinarily powerful sense. The ability to perceive the environment allows for movement to be regulated by the world. Humans do this effortlessly but still lack the understanding of how perception works. In the case of visual perception, many researchers, from psychologists to engineers, are working on this complex problem. Our approach is to build artificial visual systems to examine how a robot can use images, which convey only 2D information, in a robust manner to drive its actions in 3D space. The perceptual capabilities we developed allowed our robot to undertake everyday navigation tasks, such as “go to the fourth office in the second corridor”. A critical component of any perceptual system, human or artificial, is the sensing modality used to obtain information about the environment. In the biological world, for example, one striking observation is the diversity of “ocular” geometries. The majority of insects and arthropods benefit from a wide field of view and their eyes have a spacevariant resolution. To some extent, the perceptual capabilities of these animals can be explained by their specially adapted eye geometries. Similarly, in this work, we explore the advantages of having large fields of view by using an omnidirectional camera with a 360◦ azimuthal field of view. Once images have been acquired by the omnidirectional camera, a question arises as to what to do with them. Should they form an internal representation of the world? Over time, can they provide intrinsic information about the world so as no representation is required? These fundamental questions have long been addressed by the computer vision community and go to the heart of our current understanding of visual perception. Before going on to detail our approach, a brief overview of this understanding will be provided.

[1]  J. Okamoto,et al.  Constant resolution omnidirectional cameras , 2002, Proceedings of the IEEE Workshop on Omnidirectional Vision 2002. Held in conjunction with ECCV'02.

[2]  Hugh F. Durrant-Whyte,et al.  Mobile robot localization by tracking geometric beacons , 1991, IEEE Trans. Robotics Autom..

[3]  Erkki Oja,et al.  Subspace methods of pattern recognition , 1983 .

[4]  O. Cuisenaire Distance transformations: fast algorithms and applications to medical image processing , 1999 .

[5]  M. Hebert,et al.  Omni-directional structure from motion , 2000, Proceedings IEEE Workshop on Omnidirectional Vision (Cat. No.PR00704).

[6]  S. Nayar,et al.  Catadioptric Image Formation , 1997 .

[7]  José Gaspar,et al.  Visual Path Following with a Catadioptric Panoramic Camera , 1999 .

[8]  Don Kimber,et al.  FlyCam: practical panoramic video and automatic camera control , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[9]  J. Gaspar,et al.  Omni-directional vision for robot navigation , 2000, Proceedings IEEE Workshop on Omnidirectional Vision (Cat. No.PR00704).

[10]  Saburo Tsuji,et al.  Panoramic representation for route recognition by a mobile robot , 1992, International Journal of Computer Vision.

[11]  Gunilla Borgefors,et al.  Hierarchical Chamfer Matching: A Parametric Edge Matching Algorithm , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Yasushi Yagi,et al.  Omnidirectional Sensing and Its Applications , 1999 .

[13]  Vishvjit S. Nalwa,et al.  A True Omni-Directional Viewer † , 1996 .

[14]  Kenro Miyamoto,et al.  Fish Eye Lens , 1964 .

[15]  Wolfgang Straßer,et al.  Omnidirectional 3D Modeling on a Mobile Robot using Graph Cuts , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[16]  Kostas Daniilidis,et al.  A Unifying Theory for Central Panoramic Systems and Practical Applications , 2000, ECCV.

[17]  W. Brent Seales,et al.  Immersive teleconferencing: a new algorithm to generate seamless panoramic video imagery , 1999, MULTIMEDIA '99.

[18]  José Santos-Victor,et al.  Vision-based navigation and environmental representations with an omnidirectional camera , 2000, IEEE Trans. Robotics Autom..

[19]  Richard Szeliski,et al.  3-D Scene Data Recovery Using Omnidirectional Multibaseline Stereo , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[20]  José Santos-Victor,et al.  Algebraic Aspects of Reconstruction of Structured Scenes from One or More Views , 2001 .

[21]  Gaurav S. Sukhatme,et al.  Putting the 'I' in 'team': an ego-centric approach to cooperative localization , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[22]  S. P. Mudur,et al.  Three-dimensional computer vision: a geometric viewpoint , 1993 .

[23]  Andrew J. Davison,et al.  Real-time simultaneous localisation and mapping with a single camera , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[24]  Mohan M. Trivedi,et al.  Real-time target localization and tracking by N-ocular stereo , 2000, Proceedings IEEE Workshop on Omnidirectional Vision (Cat. No.PR00704).

[25]  Sebastian Thrun,et al.  Integrating Grid-Based and Topological Maps for Mobile Robot Navigation , 1996, AAAI/IAAI, Vol. 2.

[26]  Tomás Svoboda,et al.  Epipolar Geometry of Panoramic Cameras , 1998, ECCV.

[27]  Alfred M. Bruckstein,et al.  Omniview cameras with curved surface mirrors , 2000, Proceedings IEEE Workshop on Omnidirectional Vision (Cat. No.PR00704).

[28]  Richard Szeliski,et al.  3-D Scene Data Recovery Using Omnidirectional Multibaseline Stereo , 2004, International Journal of Computer Vision.

[29]  Darius Burschka,et al.  Optimal landmark configuration for vision-based control of mobile robots , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[30]  John B. Moore,et al.  Resolution invariant surfaces for panoramic vision systems , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[31]  Niall Winters,et al.  A holistic approach to mobile robot navigation using omnidirectional vision , 2002 .

[32]  Philip David,et al.  Simultaneous pose and correspondence determination using line features , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[33]  Rodney A. Brooks,et al.  Visual map making for a mobile robot , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[34]  Václav Hlavác,et al.  Zero Phase Representation of Panoramic Images for Image Vased Localization , 1999, CAIP.

[35]  J. Gaspar,et al.  Interactive Reconstruction from an Omnidirectional Image , 2001 .

[36]  Jitendra Malik,et al.  Modeling and Rendering Architecture from Photographs: A hybrid geometry- and image-based approach , 1996, SIGGRAPH.

[37]  Robert T. Collins,et al.  Vanishing point calculation as a statistical inference on the unit sphere , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[38]  José Santos-Victor,et al.  Omni-directional Visual Navigation , 1999 .

[39]  Dariu Gavrila,et al.  Real-time object detection for "smart" vehicles , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[40]  Minas E. Spetsakis,et al.  Structure from motion using line correspondences , 1990, International Journal of Computer Vision.

[41]  Avinash C. Kak,et al.  Fast Vision-guided Mobile Robot Navigation Using Model-based Reasoning And Prediction Of Uncertainties , 1992, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems.

[42]  José Santos-Victor,et al.  Maximum Likelihood 3D Reconstruction from One or More Images under Geometric Constraints , 2002, BMVC.

[43]  Avinash C. Kak,et al.  Fast vision-guided mobile robot navigation using model-based reasoning and prediction of uncertainties , 1992, CVGIP Image Underst..

[44]  Avinash C. Kak,et al.  Vision for Mobile Robot Navigation: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[45]  Robert Pless,et al.  Using many cameras as one , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[46]  Daniel P. Huttenlocher,et al.  Comparing Images Using the Hausdorff Distance , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[47]  Tomás Pajdla,et al.  Structure from motion with wide circular field of view cameras , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Gerd Hirzinger,et al.  Real-time visual tracking of 3D objects with dynamic handling of occlusion , 1997, Proceedings of International Conference on Robotics and Automation.

[49]  Hanspeter A. Mallot,et al.  The Quality of Catadioptric Imaging ? Application to Omnidirectional Stereo , 2004, ECCV.

[50]  Yasushi Yagi,et al.  Map-based navigation for a mobile robot with omnidirectional image sensor COPIS , 1995, IEEE Trans. Robotics Autom..

[51]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[52]  Geb Thomas Real-time panospheric image dewarping and presentation for remote mobile robot control , 2003, Adv. Robotics.

[53]  José Santos-Victor,et al.  Algebraic Aspects of Reconstruction of 3D Scenes from One or More Views , 2001, BMVC.

[54]  Mark Fiala Structure from motion using SIFT features and the PH transform with panoramic imagery , 2005, The 2nd Canadian Conference on Computer and Robot Vision (CRV'05).

[55]  Paolo Pirjanian,et al.  The vSLAM Algorithm for Robust Localization and Mapping , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[56]  Shree K. Nayar,et al.  Catadioptric omnidirectional camera , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[57]  S. Wehner,et al.  Insect navigation: use of maps or Ariadne's thread ? , 1990 .

[58]  Jake K. Aggarwal,et al.  Mobile robot self-location using model-image feature correspondence , 1996, IEEE Trans. Robotics Autom..

[59]  P. Sturm A method for 3D reconstruction of piecewise planar objects from single panoramic images , 2000, Proceedings IEEE Workshop on Omnidirectional Vision (Cat. No.PR00704).

[60]  Clark F. Olson,et al.  View-Based Recognition Using an Eigenspace Approximation to the Hausdorff Measure , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[61]  Kostas Daniilidis,et al.  Catadioptric Projective Geometry , 2001, International Journal of Computer Vision.

[62]  Rodney A. Brooks,et al.  A Robust Layered Control Syste For A Mobile Robot , 2022 .

[63]  Shree K. Nayar,et al.  Generation of Perspective and Panoramic Video from Omnidirectional Video , 1997 .

[64]  Shree K. Nayar,et al.  A Theory of Single-Viewpoint Catadioptric Image Formation , 1999, International Journal of Computer Vision.

[65]  Shree K. Nayar,et al.  Folded catadioptric cameras , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[66]  S. Derrien,et al.  Approximating a single viewpoint in panoramic imaging devices , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[67]  Peter F. Sturm,et al.  A Generic Concept for Camera Calibration , 2004, ECCV.

[68]  Ehud Rivlin,et al.  Landmark Selection for Task-Oriented Navigation , 2006, IEEE Transactions on Robotics.

[69]  M. Srinivasan,et al.  Reflective surfaces for panoramic imaging. , 1997, Applied optics.

[70]  Marc Pollefeys,et al.  Towards calibrating a pan-tilt-zoom camera network , 2004 .

[71]  José Santos-Victor,et al.  Cooperative localization by fusing vision-based bearing measurements and motion , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[72]  Ruzena Bajcsy,et al.  Catadioptric sensors that approximate wide-angle perspective projections , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[73]  Margrit Betke,et al.  Mobile robot localization using landmarks , 1997, IEEE Trans. Robotics Autom..

[74]  Hiroshi Murase,et al.  Visual learning and recognition of 3-d objects from appearance , 2005, International Journal of Computer Vision.

[75]  Shree K. Nayar,et al.  A theory of catadioptric image formation , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[76]  Yasushi Yagi,et al.  Obstacle detection with omnidirectional image sensor HyperOmni Vision , 1995, Proceedings of 1995 IEEE International Conference on Robotics and Automation.

[77]  William Rucklidge,et al.  Efficient Visual Recognition Using the Hausdorff Distance , 1996, Lecture Notes in Computer Science.

[78]  Ernest L. Hall,et al.  Dynamic Omnidirectional Vision For Mobile Robots , 1985, Other Conferences.

[79]  Marc Pollefeys,et al.  Synchronization and calibration of camera networks from silhouettes , 2004, ICPR 2004.