Planning Under Non-Rational Perception of Uncertain Spatial Costs

This work investigates the design of risk-perception-aware motion-planning strategies that incorporate non-rational risk associated with uncertain spatial costs. Our proposed method employs the Cumulative Prospect Theory (CPT) to generate a perceived risk map over a given environment. CPT-like perceived risks and path-length metrics are then combined to define a cost function that is compliant with the requirements of asymptotic optimality of sampling-based motion planners (RRT*). The modeling power of CPT is illustrated in theory and in simulation, along with a comparison to other risk perception models like Conditional Value at Risk (CVaR). Theoretically, we define a notion of expressiveness for a risk perception model and show that CPT's is higher than that of CVaR and expected risk. We then show that this expressiveness translates to our path planning setting, where we observe that a planner equipped with CPT together with a simultaneous perturbation stochastic approximation (SPSA) method can better approximate arbitrary paths in an environment. Additionally, we show in simulation that our planner captures a rich set of meaningful paths, representative of different risk perceptions in a custom environment. We then compare the performance of our planner with T-RRT* (a planner for continuous cost spaces) and Risk-RRT* (a risk-aware planner for dynamic human obstacles) through simulations in cluttered and dynamic environments respectively, showing the advantage of our proposed planner.

[1]  Insoon Yang,et al.  Risk-Aware Motion Planning and Control Using CVaR-Constrained Optimization , 2019, IEEE Robotics and Automation Letters.

[2]  Jonathan P. How,et al.  Robust Sampling-based Motion Planning with Asymptotic Optimality Guarantees , 2013 .

[3]  Emilio Frazzoli,et al.  Sampling-based algorithms for optimal motion planning , 2011, Int. J. Robotics Res..

[4]  Matthew L. Bolton,et al.  Modeling human perception Could Stevens' Power Law be an emergent feature? , 2008, 2008 IEEE International Conference on Systems, Man and Cybernetics.

[5]  Max Q.-H. Meng,et al.  A human-friendly robot navigation algorithm using the risk-RRT approach , 2016, 2016 IEEE International Conference on Real-time Computing and Robotics (RCAR).

[6]  Yue Wang,et al.  A Human-Computer Interface Design for Quantitative Measure of Regret Theory , 2018, IFAC-PapersOnLine.

[7]  Masahiro Ono,et al.  Chance-Constrained Optimal Path Planning With Obstacles , 2011, IEEE Transactions on Robotics.

[8]  J. Spall Multivariate stochastic approximation using a simultaneous perturbation gradient approximation , 1992 .

[9]  Han-Pang Huang,et al.  Robot Motion Planning in Dynamic Uncertain Environments , 2011, Adv. Robotics.

[10]  Richard M. Stern,et al.  Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction , 2009, INTERSPEECH.

[11]  Marco Pavone,et al.  Multimodal Probabilistic Model-Based Planning for Human-Robot Interaction , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[12]  Sonia Martínez,et al.  Limited range spatial load balancing in non-convex environments using sampling-based motion planners , 2018, Auton. Robots.

[13]  S. S. Stevens Neural events and the psychophysical law. , 1970, Science.

[14]  Alexandre M. Bayen,et al.  Stabilizing Traffic with Autonomous Vehicles , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Shalabh Gupta,et al.  T$^\star$: Time-Optimal Risk-Aware Motion Planning for Curvature-Constrained Vehicles , 2019, IEEE Robotics and Automation Letters.

[16]  Qing Liu,et al.  Decision-making of investment in navigation safety improving schemes with application of cumulative prospect theory , 2018 .

[17]  James C. Spall,et al.  Introduction to stochastic search and optimization - estimation, simulation, and control , 2003, Wiley-Interscience series in discrete mathematics and optimization.

[18]  Max Q.-H. Meng,et al.  Risk-RRT∗: A robot motion planning algorithm for the human robot coexisting environment , 2017, 2017 18th International Conference on Advanced Robotics (ICAR).

[19]  E. Wagenmakers,et al.  Hierarchical Bayesian parameter estimation for cumulative prospect theory , 2011, Journal of Mathematical Psychology.

[20]  Sanjit Dhami,et al.  The Foundations of Behavioral Economic Analysis , 2017 .

[21]  Brendan Englot,et al.  Sampling-based Minimum Risk path planning in multiobjective configuration spaces , 2015, 2015 54th IEEE Conference on Decision and Control (CDC).

[22]  Oliver Brock,et al.  Sampling-Based Motion Planning With Sensing Uncertainty , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[23]  Michael C. Fu,et al.  Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control , 2015, ICML.

[24]  Didier Devaurs,et al.  Optimal Path Planning in Complex Cost Spaces With Sampling-Based Algorithms , 2016, IEEE Transactions on Automation Science and Engineering.

[25]  A. Tversky,et al.  Advances in prospect theory: Cumulative representation of uncertainty , 1992 .

[26]  Nicholas M. Patrikalakis,et al.  Global motion planning under uncertain motion, sensing, and environment map , 2011, Auton. Robots.

[27]  Alberto Speranzon,et al.  Sampling-based min-max uncertainty path planning , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[28]  Philippe Artzner,et al.  Coherent Measures of Risk , 1999 .

[29]  Shreyas Sundaram,et al.  Game-Theoretic Protection Against Networked SIS Epidemics by Human Decision-Makers , 2017, IFAC-PapersOnLine.

[30]  Moshe Ben-Akiva,et al.  Adaptive route choices in risky traffic networks: A prospect theory approach , 2010 .

[31]  S. Shankar Sastry,et al.  Learning prospect theory value function and reference point of a sequential decision maker , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[32]  S. S. Stevens On the psychophysical law. , 1957, Psychological review.

[33]  Sonia Martinez,et al.  Human-swarm Interactions for Formation Control Using Interpreters , 2020 .

[34]  Csaba Szepesvári,et al.  Stochastic Optimization in a Cumulative Prospect Theory Framework , 2018, IEEE Transactions on Automatic Control.

[35]  Andreas Glöckner,et al.  Cognitive models of risky choice: Parameter stability and predictive accuracy of prospect theory , 2012, Cognition.

[36]  Sonia Martínez,et al.  Gesture based Human-Swarm Interactions for Formation Control using interpreters , 2018, ArXiv.

[37]  Marco Pavone,et al.  A Framework for Time-Consistent, Risk-Sensitive Model Predictive Control: Theory and Algorithms , 2019, IEEE Transactions on Automatic Control.

[38]  Simon Parsons,et al.  Principles of Robot Motion: Theory, Algorithms and Implementations by Howie Choset, Kevin M. Lynch, Seth Hutchinson, George Kantor, Wolfram Burgard, Lydia E. Kavraki and Sebastian Thrun, 603 pp., $60.00, ISBN 0-262-033275 , 2007, The Knowledge Engineering Review.