论文信息 - October 2018 PRISM : Pose Registration for Integrated Semantic Mapping

October 2018 PRISM : Pose Registration for Integrated Semantic Mapping

Many robotics applications involve navigating to positions specified in terms of their semantic significance. A robot operating in a hotel may need to deliver room service to a named room. In a hospital, it may need to deliver medication to a patient’s room. The Building-Wide Intelligence Project at UT Austin has been developing a fleet of autonomous mobile robots, called BWIBots, which perform tasks in the computer science department. Tasks include guiding a person, delivering a message, or bringing an object to a location such as an office, lecture hall, or classroom. The process of constructing a map that a robot can use for navigation has been simplified by modern SLAM algorithms. The attachment of semantics to map data, however, remains a tedious manual process of labeling locations in otherwise automatically generated maps. This paper introduces a system called PRISM to automate a step in this process by enabling a robot to localize door signs – a semantic markup intended to aid the human occupants of a building – and to annotate these locations in its map.

[1] Rahul Sukthankar,et al. Cognitive Mapping and Planning for Visual Navigation , 2017, International Journal of Computer Vision.

[2] Jake K. Aggarwal,et al. BWIBots: A platform for bridging the gap between AI and human–robot interaction research , 2017, Int. J. Robotics Res..

[3] Silvio Savarese,et al. 3D Semantic Parsing of Large-Scale Indoor Spaces , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Wenzhe Li,et al. Room segmentation: Survey, implementation, and analysis , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[5] Chang-Hun Kim,et al. RViz: a toolkit for real domain data visualization , 2015, Telecommunication Systems.

[6] John J. Leonard,et al. Monocular SLAM Supported Object Recognition , 2015, Robotics: Science and Systems.

[7] Roberto Cipolla,et al. PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[8] Oskar von Stryk,et al. Hector Open Source Modules for Autonomous Mapping and Navigation with Rescue Robots , 2013, RoboCup.

[9] Nico Blodow,et al. Autonomous semantic mapping for robots performing everyday manipulation tasks in kitchen environments , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[10] Andrew Y. Ng,et al. Autonomous sign reading for semantic mapping , 2011, 2011 IEEE International Conference on Robotics and Automation.

[11] R. Smith,et al. An Overview of the Tesseract OCR Engine , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[12] Jean-Arcady Meyer,et al. Map-based navigation in mobile robots: I. A review of localization strategies , 2003, Cognitive Systems Research.

[13] Bernhard P. Wrobel,et al. Multiple View Geometry in Computer Vision , 2001 .

[14] Zhengyou Zhang,et al. A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[15] A. Fitzgibbon,et al. Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[16] Keiichi Abe,et al. Topological structural analysis of digitized binary images by border following , 1985, Comput. Vis. Graph. Image Process..

[17] Urs Ramer,et al. An iterative procedure for the polygonal approximation of plane curves , 1972, Comput. Graph. Image Process..

[18] Y. Oshman,et al. Averaging Quaternions , 2007 .

[19] Christopher G. Harris,et al. A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[20] Vladimir I. Levenshtein,et al. Binary codes capable of correcting deletions, insertions, and reversals , 1965 .