Maps , Places , and Worlds for Robots

Vision is a powerful sense that permits a robot to look around itself and gather information both about the immediate present and the near future. The future arrives through the more distant physical space through which the robot can move, and the possible actions and events that may arise. To know the future a robot needs to parse the world with the aid of its models and experience. Lasers and other active sensors have proven their ability to provide accurate geometric information. But context and the meaning of the space surrounding the robot, the objects and the actions they permit are only accessible with the more complete sensory input of vision. Vision as a sensor is computationally demanding, but resources have improved to make it practical. Moreover there has been a convergence of the interests of roboticists and vision scientists – both want to explore and act in the world. Many of us have accepted that we must learn the patterns of data using machine learning, but we must also integrate categorical descriptions of our world, prototypical information that no individual robot is yet capable of learning. Vision provides the anchoring for concepts. I will discuss recent advances and trends linking vision and robotics through spatial descriptions and the connections with objects, actions, and meaning.

[1]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[2]  Danica Kragic,et al.  Integrating Active Mobile Robot Object Recognition and SLAM in Natural Environments , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Henry A. Kautz,et al.  Sensor-Based Understanding of Daily Life via Large-Scale Use of Common Sense , 2006, AAAI.

[4]  Michael Beetz,et al.  Designing and Implementing a Plan Library for a Simulated Household Robot , 2006, AAAI 2006.

[5]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Christopher M. Brown,et al.  Control of selective perception using bayes nets and decision theory , 1994, International Journal of Computer Vision.

[7]  James H. Elder,et al.  Pre-Attentive and Attentive Detection of Humans in Wide-Field Scenes , 2007, International Journal of Computer Vision.

[8]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[9]  Benjamin Kuipers,et al.  The Spatial Semantic Hierarchy , 2000, Artif. Intell..

[10]  Alberto Elfes,et al.  Using occupancy grids for mobile robot perception and navigation , 1989, Computer.

[11]  Pierre Baldi,et al.  A principled approach to detecting surprising events in video , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[12]  Jesse Hoey,et al.  Waiting with Jose, a vision-based mobile robot , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[13]  Andrew Zisserman,et al.  Object Level Grouping for Video Shots , 2004, International Journal of Computer Vision.

[14]  Aaron Sloman,et al.  Long Term Requirements for Cognitive Robotics , 2006 .

[15]  David M. Mark,et al.  Ontology and Geographic Objects: An Empirical Study of Cognitive Categorization , 1999, COSIT.

[16]  Jamie Shotton,et al.  The Layout Consistent Random Field for Recognizing and Segmenting Partially Occluded Objects , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[17]  Matthai Philipose,et al.  Building Reliable Activity Models Using Hierarchical Shrinkage and Mined Ontology , 2006, Pervasive.

[18]  Jesse Hoey,et al.  Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Ben J. A. Kröse,et al.  From sensors to human spatial concepts , 2007, Robotics Auton. Syst..

[20]  David Poole,et al.  Qualitative Probabilistic Matching with Hierarchical Descriptions , 2004, KR.

[21]  Roland Siegwart,et al.  Cognitive maps for mobile robots - an object based approach , 2007, Robotics Auton. Syst..

[22]  Jesse Hoey,et al.  Visual Capabilities in an Interactive Autonomous Robot , 2006, Cognitive Vision Systems.

[23]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[24]  James J. Little,et al.  Autonomous vision-based exploration and mapping using hybrid maps and Rao-Blackwellised particle filters , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  Pietro Perona,et al.  Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[26]  Jesse Hoey,et al.  A planning system based on Markov decision processes to guide people with dementia through activities of daily living , 2006, IEEE Transactions on Information Technology in Biomedicine.

[27]  B. Gates A robot in every home. , 2007, Scientific American.

[28]  Antonio Torralba,et al.  Contextual Models for Object Detection Using Boosted Random Fields , 2004, NIPS.

[29]  KochChristof,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 1998 .

[30]  Gregory Dudek,et al.  Simultaneous planning, localization, and mapping in a camera sensor network , 2006, Robotics Auton. Syst..

[31]  Luc Van Gool,et al.  Simultaneous Object Recognition and Segmentation from Single or Multiple Model Views , 2006, International Journal of Computer Vision.

[32]  Hans-Peter Seidel,et al.  Image-based reconstruction of spatial appearance and geometric detail , 2003, TOGS.

[33]  Danica Kragic,et al.  Vision for robotic object manipulation in domestic settings , 2005, Robotics Auton. Syst..

[34]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Wolfram Burgard,et al.  Monte Carlo Localization with Mixture Proposal Distribution , 2000, AAAI/IAAI.

[36]  Antonio Torralba,et al.  Context-based vision system for place and object recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[37]  Andrew J. Davison,et al.  Real-time simultaneous localisation and mapping with a single camera , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.