Learning Human–Robot Interaction for Robot-Assisted Pedestrian Flow Optimization

Due to the fast-is-slower phenomenon in emergency escape, it is desirable to regulate pedestrian flows at the exit or a bottleneck. We propose a new robot-assisted pedestrian regulation and study passive human–robot interaction (HRI). A learning-based motion control approach is presented for a robot to efficiently interact with pedestrians for desirable collective motion. We first formulate the problem into an optimal control framework using the pedestrian dynamics description based on existing social force models with embedded HRI forces. To solve the defined optimal control problem, we propose an adaptive dynamic programming (ADP) approach to provide adjustable motion parameters of the robot to efficiently interact with pedestrians so that the regulated pedestrian flow tracks a desired velocity. The ADP control process only uses observed flow information rather than the models of pedestrians, and the ADP method provides feedback control with online learning and control capability. Simulation results demonstrate that the proposed approach can regulate pedestrian flows to desirable speeds by online learning.

[1]  Roger L. Hughes,et al.  A continuum theory for the flow of pedestrians , 2002 .

[2]  Apoorva Shende,et al.  Optimal Control of Pedestrian Evacuation in a Corridor , 2007, 2007 IEEE Intelligent Transportation Systems Conference.

[3]  Apoorva Shende,et al.  Optimization-Based Feedback Control for Pedestrian Evacuation From an Exit Corridor , 2011, IEEE Transactions on Intelligent Transportation Systems.

[4]  R. Hughes The flow of human crowds , 2003 .

[5]  Kerner,et al.  Cluster effect in initially homogeneous traffic flow. , 1993, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[6]  Andreas Krause,et al.  Robot navigation in dense human crowds: the case for cooperation , 2013, 2013 IEEE International Conference on Robotics and Automation.

[7]  Ko Yamamoto,et al.  Continuum model of crossing pedestrian flows and swarm control based on temporal/spatial frequency , 2011, 2011 IEEE International Conference on Robotics and Automation.

[8]  Dirk Helbing,et al.  Experimental study of the behavioural mechanisms underlying self-organization in human crowds , 2009, Proceedings of the Royal Society B: Biological Sciences.

[9]  Sabiha Amin Wadoo,et al.  Sliding Mode Control of Crowd Dynamics , 2013, IEEE Transactions on Control Systems Technology.

[10]  Dirk Helbing,et al.  Dynamics of crowd disasters: an empirical study. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Shuai Li,et al.  Cooperative Distributed Source Seeking by Multiple Robots: Algorithms and Experiments , 2014, IEEE/ASME Transactions on Mechatronics.

[12]  Andreas Krause,et al.  Unfreezing the robot: Navigation in dense, interacting crowds , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Rachid Alami,et al.  Human-aware robot navigation: A survey , 2013, Robotics Auton. Syst..

[14]  Ko Yamamoto,et al.  Control of swarm behavior in crossing pedestrians based on temporal/spatial frequencies , 2013, Robotics Auton. Syst..

[15]  Gonzalo Ferrer,et al.  Robot companion: A social-force based approach with human awareness-navigation in crowded environments , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16]  Helbing,et al.  Social force model for pedestrian dynamics. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[17]  Warren B. Powell,et al.  GUIDANCE IN THE USE OF ADAPTIVE CRITICS FOR CONTROL , 2007 .

[18]  Seiji Yamada,et al.  Active teaching for an interactive learning robot , 2003, The 12th IEEE International Workshop on Robot and Human Interactive Communication, 2003. Proceedings. ROMAN 2003..

[19]  Dirk Helbing,et al.  Pedestrian flow optimization with a genetic algorithm based on Boolean grids , 2007 .

[20]  Jagannathan Sarangapani,et al.  Neural Network Control of Nonlinear Discrete-Time Systems , 2018 .

[21]  Ko Yamamoto,et al.  Human swarm modeling in exhibition space and space design , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[22]  Jennie Si,et al.  Online learning control by association and reinforcement. , 2001, IEEE transactions on neural networks.

[23]  Ding Wang,et al.  Feature selection and feature learning for high-dimensional batch reinforcement learning: A survey , 2015, Int. J. Autom. Comput..

[24]  Wenjian Yu,et al.  Modeling crowd turbulence by many-particle simulations. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  Christian Vollmer,et al.  Learning to navigate through crowded environments , 2010, 2010 IEEE International Conference on Robotics and Automation.

[26]  Apoorva Shende,et al.  Optimal Feedback Flow Rates for Pedestrian Evacuation in a Network of Corridors , 2013, IEEE Transactions on Intelligent Transportation Systems.

[27]  Maja J. Mataric,et al.  Insights toward robot-assisted evacuation , 2005, Adv. Robotics.

[28]  Masafumi Okada,et al.  Optimization of personal distribution for evacuation guidance based on vector field , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[29]  Agathoniki Trigoni,et al.  Robot-assisted discovery of evacuation routes in emergency scenarios , 2008, 2008 IEEE International Conference on Robotics and Automation.

[30]  L. F. Henderson,et al.  The Statistics of Crowd Fluids , 1971, Nature.

[31]  Douglas Guimarães Macharet,et al.  Learning how to increase the chance of human-robot engagement , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[32]  Kai Oliver Arras,et al.  Socially-aware robot navigation: A learning approach , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[33]  F.L. Lewis,et al.  Reinforcement learning and adaptive dynamic programming for feedback control , 2009, IEEE Circuits and Systems Magazine.

[34]  Warren B. Powell,et al.  Handbook of Learning and Approximate Dynamic Programming , 2006, IEEE Transactions on Automatic Control.

[35]  Thomas Bak,et al.  Trajectory planning for robots in dynamic human environments , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[36]  Feng Liu,et al.  A boundedness result for the direct heuristic dynamic programming , 2012, Neural Networks.

[37]  Haibo He,et al.  Kernel-Based Approximate Dynamic Programming for Real-Time Online Learning Control: An Experimental Study , 2014, IEEE Transactions on Control Systems Technology.

[38]  Alberto Sanfeliu,et al.  Local optimization of cooperative robot movements for guiding and regrouping people in a guiding mission , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[39]  Takayuki Kanda,et al.  Simulation-Based Behavior Planning to Prevent Congestion of Pedestrians Around a Robot , 2015, IEEE Transactions on Robotics.

[40]  Yi Guo,et al.  Distributed multi-robot evacuation incorporating human behavior , 2013, 2013 10th IEEE International Conference on Control and Automation (ICCA).

[41]  Jinyu Wen,et al.  Adaptive Learning in Tracking Control Based on the Dual Critic Network Design , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[42]  Matteo Munaro,et al.  OpenPTrack: Open source multi-camera calibration and people tracking for RGB-D camera networks , 2016, Robotics Auton. Syst..

[43]  Sridhar Mahadevan,et al.  Proto-value functions: developmental reinforcement learning , 2005, ICML.

[44]  Patricio A. Vela,et al.  Information propagation applied to robot-assisted evacuation , 2012, 2012 IEEE International Conference on Robotics and Automation.

[45]  Kerner,et al.  Structure and parameters of clusters in traffic flow. , 1994, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[46]  Haibo He,et al.  Robot-assisted pedestrian regulation in an exit corridor , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[47]  Georgios Ch. Sirakoulis,et al.  Robot Guided Crowd Evacuation , 2015, IEEE Transactions on Automation Science and Engineering.

[48]  Lubos Buzna,et al.  Self-Organized Pedestrian Crowd Dynamics: Experiments, Simulations, and Design Solutions , 2005, Transp. Sci..

[49]  Sabiha Amin Wadoo,et al.  Feedback Control of Crowd Evacuation in One Dimension , 2010, IEEE Transactions on Intelligent Transportation Systems.

[50]  Dirk Helbing,et al.  Simulating dynamical features of escape panic , 2000, Nature.

[51]  Andrew Y. Ng,et al.  Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[52]  Anthony A. Maciejewski,et al.  Using genetic algorithms to optimize social robot behavior for improved pedestrian flow , 2005, 2005 IEEE International Conference on Systems, Man and Cybernetics.