Context-aware robot navigation using interactively built semantic maps

Abstract We discuss the process of building semantic maps, how to interactively label entities in them, and how to use them to enable context-aware navigation behaviors in human environments. We utilize planar surfaces, such as walls and tables, and static objects, such as door signs, as features for our semantic mapping approach. Users can interactively annotate these features by having the robot follow him/her, entering the label through a mobile app, and performing a pointing gesture toward the landmark of interest. Our gesture-based approach can reliably estimate which object is being pointed at, and detect ambiguous gestures with probabilistic modeling. Our person following method attempts to maximize future utility by search for future actions assuming constant velocity model for the human. We describe a method to extract metric goals from a semantic map landmark and to plan a human aware path that takes into account the personal spaces of people. Finally, we demonstrate context awareness for person following in two scenarios: interactive labeling and door passing.We believe that future navigation approaches and service robotics applications can be made more effective by further exploiting the structure of human environments.

[1]  Danica Kragic,et al.  Object detection and mapping for service robot tasks , 2007, Robotica.

[2]  David Lee,et al.  The influence of subjects' personality traits on personal spatial zones in a human-robot interaction experiment , 2005, ROMAN 2005. IEEE International Workshop on Robot and Human Interactive Communication, 2005..

[3]  R. Simmons,et al.  COMPANION: A Constraint-Optimizing Method for Person-Acceptable Navigation , 2009, RO-MAN 2009 - The 18th IEEE International Symposium on Robot and Human Interactive Communication.

[4]  Huosheng Hu,et al.  Multisensor-Based Human Detection and Tracking for Mobile Service Robots , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[5]  Daniele Nardi,et al.  Interactive Semantic Mapping: Experimental Evaluation , 2014, ISER.

[6]  T. Kanda,et al.  Social force model with explicit collision prediction , 2011 .

[7]  Henrik I. Christensen,et al.  Tracking for following and passing persons , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8]  H. Christensen,et al.  Did you Mean this Object?: Detecting Ambiguity in Pointing Gesture Targets , 2015 .

[9]  Wolfram Burgard,et al.  The dynamic window approach to collision avoidance , 1997, IEEE Robotics Autom. Mag..

[10]  Henrik I. Christensen,et al.  Topological Modelling for Human Augmented Mapping , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  Patric Jensfelt,et al.  Human- and Situation-Aware People Following , 2007, RO-MAN 2007 - The 16th IEEE International Symposium on Robot and Human Interactive Communication.

[12]  Adrian Hilton,et al.  A survey of advances in vision-based human motion capture and analysis , 2006, Comput. Vis. Image Underst..

[13]  Rachid Alami,et al.  A Human Aware Mobile Robot Motion Planner , 2007, IEEE Transactions on Robotics.

[14]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[15]  Alejandro Bordallo,et al.  Counterfactual reasoning about intent for interactive navigation in dynamic environments , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[16]  Wolfram Burgard,et al.  Tracking multiple moving targets with a mobile robot using particle filters and statistical data association , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[17]  Takayuki Kanda,et al.  Destination Unknown: Walking Side-by-Side without Knowing the Goal , 2014, 2014 9th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[18]  Benjamin Kuipers,et al.  Towards a general theory of topological maps , 2004, Artif. Intell..

[19]  Siddhartha S. Srinivasa,et al.  Legibility and predictability of robot motion , 2013, 2013 8th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[20]  James W. Davis,et al.  The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Luc Van Gool,et al.  Real-time 3D hand gesture interaction with a robot for understanding directions from humans , 2011, 2011 RO-MAN.

[22]  Matthew R. Walter,et al.  Understanding Natural Language Commands for Robotic Navigation and Mobile Manipulation , 2011, AAAI.

[23]  Henrik I. Christensen,et al.  Anticipatory robot path planning in human environments , 2016, 2016 25th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN).

[24]  Herbert H. Clark,et al.  Grounding in communication , 1991, Perspectives on socially shared cognition.

[25]  Benjamin Kuipers,et al.  Autonomous person pacing and following with Model Predictive Equilibrium Point Control , 2013, 2013 IEEE International Conference on Robotics and Automation.

[26]  Akihisa Ohya,et al.  Intelligent Escort Robot Moving together with Human -Interaction in Accompanying Behavior- , 2002 .

[27]  Peter Cheeseman,et al.  On the Representation and Estimation of Spatial Uncertainty , 1986 .

[28]  Patric Jensfelt,et al.  Large-scale semantic mapping and reasoning with heterogeneous modalities , 2012, 2012 IEEE International Conference on Robotics and Automation.

[29]  Juan D. Tardós,et al.  Data association in stochastic mapping using the joint compatibility test , 2001, IEEE Trans. Robotics Autom..

[30]  Sonia Chernova,et al.  Mobile human-robot teaming with environmental tolerance , 2009, 2009 4th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[31]  Henrik I. Christensen,et al.  Interactive object modeling & labeling for service robots , 2013, 2013 8th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[32]  Roland Siegwart,et al.  Smooth and efficient obstacle avoidance for a tour guide robot , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[33]  Andreas Krause,et al.  Robot navigation in dense human crowds: Statistical models and experimental studies of human–robot cooperation , 2015, Int. J. Robotics Res..

[34]  Henrik I. Christensen,et al.  Clarification dialogues in human-augmented mapping , 2006, HRI '06.

[35]  Ian D. Reid,et al.  Article in Press Robotics and Autonomous Systems ( ) – Robotics and Autonomous Systems a Comparison of Loop Closing Techniques in Monocular Slam , 2022 .

[36]  Afshin Dehghan,et al.  Part-based multiple-person tracking with partial occlusion handling , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[37]  Wolfram Burgard,et al.  Supervised Learning of Places from Range Data using AdaBoost , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[38]  Cynthia Breazeal,et al.  Working with robots and objects: revisiting deictic reference for achieving spatial common ground , 2006, HRI '06.

[39]  Michael Milford,et al.  Meaningful maps with object-oriented semantic mapping , 2016, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[40]  Reid G. Simmons,et al.  Natural person-following behavior for social robots , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[41]  Joachim Hertzberg,et al.  Towards semantic maps for mobile robots , 2008, Robotics Auton. Syst..

[42]  Benjamin Kuipers,et al.  The Spatial Semantic Hierarchy , 2000, Artif. Intell..

[43]  HiltonAdrian,et al.  A survey of advances in vision-based human motion capture and analysis , 2006 .

[44]  Alessandro Saffiotti,et al.  Inferring robot goals from violations of semantic knowledge , 2013, Robotics Auton. Syst..

[45]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[46]  Maren Bennewitz,et al.  Foresighted navigation through cluttered environments , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[47]  Takayuki Kanda,et al.  How do people walk side-by-side? — Using a computational model of human behavior for a social robot , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[48]  Dieter Fox,et al.  RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments , 2010, ISER.

[49]  Henrik I. Christensen,et al.  Autonomous person following for telepresence robots , 2013, 2013 IEEE International Conference on Robotics and Automation.

[50]  Philippe Bidaud,et al.  A framework for the design of person following behaviors for social mobile robots , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[51]  Alberto Elfes,et al.  Using occupancy grids for mobile robot perception and navigation , 1989, Computer.

[52]  Rachid Alami,et al.  Human-aware robot navigation: A survey , 2013, Robotics Auton. Syst..

[53]  Wolfram Burgard,et al.  Improved Techniques for Grid Mapping With Rao-Blackwellized Particle Filters , 2007, IEEE Transactions on Robotics.

[54]  Joelle Pineau,et al.  Person tracking and following with 2D laser scanners , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[55]  Takayuki Kanda,et al.  Simulation-Based Behavior Planning to Prevent Congestion of Pedestrians Around a Robot , 2015, IEEE Transactions on Robotics.

[56]  Hugh F. Durrant-Whyte,et al.  Simultaneous map building and localization for an autonomous mobile robot , 1991, Proceedings IROS '91:IEEE/RSJ International Workshop on Intelligent Robots and Systems '91.

[57]  Álvaro Sánchez Miralles,et al.  Topological simultaneous localization and mapping: a survey , 2013, Robotica.

[58]  Silvio Savarese,et al.  Social LSTM: Human Trajectory Prediction in Crowded Spaces , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[59]  Geoffrey A. Hollinger,et al.  Search and pursuit-evasion in mobile robotics , 2011, Auton. Robots.

[60]  Verena Rieser,et al.  A review of spatial reasoning and interaction for real-world robotics , 2017, Adv. Robotics.

[61]  Trevor Darrell,et al.  Integrated Person Tracking Using Stereo, Color, and Pattern Detection , 2000, International Journal of Computer Vision.

[62]  Jannik Fritsch,et al.  Interacting with a mobile robot: Evaluating gestural object references , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[63]  Yoshua Bengio,et al.  Convolutional networks for images, speech, and time series , 1998 .

[64]  Henrik I. Christensen,et al.  Human-robot embodied interaction in hallway settings: a pilot user study , 2005, ROMAN 2005. IEEE International Workshop on Robot and Human Interactive Communication, 2005..

[65]  Marc Hanheide,et al.  Real-time multisensor people tracking for human-robot spatial interaction , 2015 .

[66]  Alexandra Kirsch,et al.  Legible robot navigation in the proximity of moving humans , 2012, 2012 IEEE Workshop on Advanced Robotics and its Social Impacts (ARSO).

[67]  Alessio Del Bue,et al.  Social interaction discovery by statistical analysis of F-formations , 2011, BMVC.

[68]  Maja J. Mataric,et al.  Using semantic fields to model dynamic spatial relations in a robot architecture for natural language instruction of service robots , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[69]  Xiaogang Wang,et al.  Deep Learning Strong Parts for Pedestrian Detection , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[70]  Wolfram Burgard,et al.  Using Boosted Features for the Detection of People in 2D Range Data , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[71]  Wolfram Burgard,et al.  An Integrated Robotic System for Spatial Understanding and Situated Interaction in Indoor Environments , 2007, AAAI.

[72]  Lars Karlsson,et al.  Inferring Context and Goals for Online Human-Aware Planning , 2015, 2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI).

[73]  William D. Smart,et al.  Towards more efficient navigation for robots and humans , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[74]  A. Kendon Conducting Interaction: Patterns of Behavior in Focused Encounters , 1990 .

[75]  António E. Ruano,et al.  Fast Line, Arc/Circle and Leg Detection from Laser Scan Data in a Player Driver , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[76]  Antonios Gasteratos,et al.  Semantic mapping for mobile robotics tasks: A survey , 2015, Robotics Auton. Syst..

[77]  Zhen Liu,et al.  Multi Robot Object-Based SLAM , 2016, ISER.

[78]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[79]  Dana Kulic,et al.  Learning User Preferences in Robot Motion Planning Through Interaction , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[80]  Henrik I. Christensen,et al.  The M-Space Feature Representation for SLAM , 2007, IEEE Transactions on Robotics.

[81]  Kai Oliver Arras,et al.  Socially-aware robot navigation: A learning approach , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[82]  Christian Vollmer,et al.  Learning to navigate through crowded environments , 2010, 2010 IEEE International Conference on Robotics and Automation.

[83]  Cordelia Schmid,et al.  Human Detection Based on a Probabilistic Assembly of Robust Part Detectors , 2004, ECCV.

[84]  Helbing,et al.  Social force model for pedestrian dynamics. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[85]  Frank Dellaert,et al.  Square Root SAM: Simultaneous Localization and Mapping via Square Root Information Smoothing , 2006, Int. J. Robotics Res..

[86]  Antonios Gasteratos,et al.  Recent trends in social aware robot navigation: A survey , 2017, Robotics Auton. Syst..

[87]  Qi Wu,et al.  Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.