论文信息 - Crowd-Robot Interaction: Crowd-Aware Robot Navigation With Attention-Based Deep Reinforcement Learning

Crowd-Robot Interaction: Crowd-Aware Robot Navigation With Attention-Based Deep Reinforcement Learning

Mobility in an effective and socially-compliant manner is an essential yet challenging task for robots operating in crowded spaces. Recent works have shown the power of deep reinforcement learning techniques to learn socially cooperative policies. However, their cooperation ability deteriorates as the crowd grows since they typically relax the problem as a one-way Human-Robot interaction problem. In this work, we want to go beyond first-order Human-Robot interaction and more explicitly model Crowd-Robot Interaction (CRI). We propose to (i) rethink pairwise interactions with a self-attention mechanism, and (ii) jointly model Human-Robot as well as Human-Human interactions in the deep reinforcement learning framework. Our model captures the Human-Human interactions occurring in dense crowds that indirectly affects the robot’s anticipation capability. Our proposed attentive pooling mechanism learns the collective importance of neighboring humans with respect to their future states. Various experiments demonstrate that our model can anticipate human dynamics and navigate in crowds with time efficiency, outperforming state-of-the-art methods.

[1] Silvio Savarese,et al. CAR-Net: Clairvoyant Attentive Recurrent Network , 2017, ECCV.

[2] Silvio Savarese,et al. Learning to Predict Human Behavior in Crowded Scenes , 2017, Group and Crowd Behavior for Computer Vision.

[3] Silvio Savarese,et al. Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4] Han Zhang,et al. Self-Attention Generative Adversarial Networks , 2018, ICML.

[5] Pete Trautman,et al. Sparse interacting Gaussian processes: Efficiency and optimality theorems of autonomous crowd navigation , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[6] Yang Liu,et al. Learning Natural Language Inference using Bidirectional LSTM model and Inner-Attention , 2016, ArXiv.

[7] Wolfram Burgard,et al. The dynamic window approach to collision avoidance , 1997, IEEE Robotics Autom. Mag..

[8] Yoram Koren,et al. Real-time obstacle avoidance for fast mobile robots in cluttered environments , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[9] Bowen Zhou,et al. A Structured Self-attentive Sentence Embedding , 2017, ICLR.

[10] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[11] Jonathan P. How,et al. Probabilistically safe motion planning to avoid dynamic obstacles with uncertain motion patterns , 2013, Auton. Robots.

[12] Jonathan P. How,et al. Decentralized non-communicating multiagent collision avoidance with deep reinforcement learning , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[13] Jean Oh,et al. Social Attention: Modeling Attention in Human Crowds , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[14] Rachid Alami,et al. Human-aware robot navigation: A survey , 2013, Robotics Auton. Syst..

[15] Holger Schwenk,et al. Supervised Learning of Universal Sentence Representations from Natural Language Inference Data , 2017, EMNLP.

[16] Dinesh Manocha,et al. Reciprocal Velocity Obstacles for real-time multi-agent navigation , 2008, 2008 IEEE International Conference on Robotics and Automation.

[17] Silvio Savarese,et al. SoPhie: An Attentive GAN for Predicting Paths Compliant to Social and Physical Constraints , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Jonathan P. How,et al. Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[19] Andreas Krause,et al. Unfreezing the robot: Navigation in dense, interacting crowds , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20] Jia Pan,et al. Deep-Learned Collision Avoidance Policy for Distributed Multiagent Navigation , 2016, IEEE Robotics and Automation Letters.

[21] Dinesh Manocha,et al. The Hybrid Reciprocal Velocity Obstacle , 2011, IEEE Transactions on Robotics.

[22] Gonzalo Ferrer,et al. Robot companion: A social-force based approach with human awareness-navigation in crowded environments , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[23] Helbing,et al. Social force model for pedestrian dynamics. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[24] Wolfram Burgard,et al. Socially compliant mobile robot navigation via inverse reinforcement learning , 2016, Int. J. Robotics Res..

[25] Koren,et al. Real-Time Obstacle Avoidance for Fast Mobile Robots , 2022 .

[26] Hannes Sommer,et al. Predicting actions to act predictably: Cooperative partial motion planning with maximum entropy models , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[27] Illah R. Nourbakhsh,et al. A survey of socially interactive robots , 2003, Robotics Auton. Syst..

[28] Dinesh Manocha,et al. Reciprocal n-Body Collision Avoidance , 2011, ISRR.

[29] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[30] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .

[31] Sridha Sridharan,et al. Soft + Hardwired Attention: An LSTM Framework for Human Trajectory Prediction and Abnormal Event Detection , 2017, Neural Networks.

[32] Virginie Lurkin,et al. Let Me Not Lie: Learning MultiNomial Logit , 2018, ArXiv.

[33] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[34] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[35] Yoram Koren,et al. The vector field histogram-fast obstacle avoidance for mobile robots , 1991, IEEE Trans. Robotics Autom..

[36] Silvio Savarese,et al. Learning Social Etiquette: Human Trajectory Understanding In Crowded Scenes , 2016, ECCV.

[37] Gonzalo Ferrer,et al. Robot social-aware navigation framework to accompany people walking side-by-side , 2017, Auton. Robots.

[38] Jonathan P. How,et al. Socially aware motion planning with deep reinforcement learning , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[39] An Xu,et al. Map-based Deep Imitation Learning for Obstacle Avoidance , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[40] Tobias Kretz,et al. Some Indications on How to Calibrate the Social Force Model of Pedestrian Dynamics , 2017, Transportation Research Record: Journal of the Transportation Research Board.

[41] Yedid Hoshen,et al. VAIN: Attentional Multi-agent Predictive Modeling , 2017, NIPS.

[42] Alexandre Alahi,et al. Collaborative Sampling in Generative Adversarial Networks , 2019, AAAI.

[43] Ming Liu,et al. Virtual-to-real deep reinforcement learning: Continuous control of mobile robots for mapless navigation , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[44] Andreas Krause,et al. Robot navigation in dense human crowds: the case for cooperation , 2013, 2013 IEEE International Conference on Robotics and Automation.

[45] Wolfram Burgard,et al. Learning Motion Patterns of People for Compliant Robot Motion , 2005, Int. J. Robotics Res..

[46] Stefan Becker,et al. An Evaluation of Trajectory Prediction Approaches and Notes on the TrajNet Benchmark , 2018, ArXiv.

[47] Dinesh Manocha,et al. Real-time navigation of independent agents using adaptive roadmaps , 2008, SIGGRAPH '08.

[48] Michel Bierlaire,et al. Discrete Choice Models for Pedestrian Walking Behavior , 2006 .

[49] Wolfram Burgard,et al. Socially Compliant Navigation Through Raw Depth Inputs with Generative Adversarial Imitation Learning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[50] Wolfram Burgard,et al. Feature-Based Prediction of Trajectories for Socially Compliant Navigation , 2012, Robotics: Science and Systems.

[51] Silvio Savarese,et al. Social LSTM: Human Trajectory Prediction in Crowded Spaces , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[52] Hao Zhang,et al. Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[53] Virginie Lurkin,et al. Enhancing discrete choice models with representation learning , 2020 .