Deep Probabilistic Accelerated Evaluation: A Certifiable Rare-Event Simulation Methodology for Black-Box Autonomy

Evaluating the reliability of intelligent physical systems against rare catastrophic events poses a huge testing burden for real-world applications. Simulation provides a useful, if not unique, platform to evaluate the extremal risks of these AI-enabled systems before their deployments. Importance Sampling (IS), while proven to be powerful for rare-event simulation, faces challenges in handling these systems due to their black-box nature that fundamentally undermines its efficiency guarantee. To overcome this challenge, we propose a framework called Deep Probabilistic Accelerated Evaluation (D-PrAE) to design IS, which leverages rare-event-set learning and a new notion of efficiency certificate. D-PrAE combines the dominating point method with deep neural network classifiers to achieve superior estimation efficiency. We present theoretical guarantees and demonstrate the empirical effectiveness of D-PrAE via examples on the safety-testing of self-driving algorithms that are beyond the reach of classical variance reduction techniques.

[1]  Ding Zhao,et al.  Accelerated Evaluation of Automated Vehicles in Car-Following Maneuvers , 2016, IEEE Transactions on Intelligent Transportation Systems.

[2]  Donald L. Iglehart,et al.  Importance sampling for stochastic simulations , 1989 .

[3]  F. Cérou,et al.  Adaptive Multilevel Splitting for Rare Event Analysis , 2007 .

[4]  Nidhi Kalra,et al.  Driving to Safety , 2016 .

[5]  Michel Mandjes,et al.  Fast simulation of overflow probabilities in a queue with Gaussian input , 2006, TOMC.

[6]  Yuan Cao,et al.  Tight Sample Complexity of Learning One-hidden-layer Convolutional Neural Networks , 2019, NeurIPS.

[7]  J. Rosenthal,et al.  General state space Markov chains and MCMC algorithms , 2004, math/0404033.

[8]  Russ Tedrake,et al.  Scalable End-to-End Autonomous Vehicle Testing via Rare-event Simulation , 2018, NeurIPS.

[9]  Ding Zhao,et al.  An Accelerated Approach to Safely and Efficiently Test Pre-Production Autonomous Vehicles on Public Streets , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[10]  Russ Tedrake,et al.  Evaluating Robustness of Neural Networks with Mixed Integer Programming , 2017, ICLR.

[11]  Paul Glasserman,et al.  Multilevel Splitting for Estimating Rare Event Probabilities , 1999, Oper. Res..

[12]  Amir Dembo,et al.  Large Deviations Techniques and Applications , 1998 .

[13]  A. Skoogh,et al.  DESIGNING IMPORTANCE SAMPLERS TO SIMULATE MACHINE LEARNING PREDICTORS VIA OPTIMIZATION , 2018 .

[14]  Gerardo Rubino,et al.  Introduction to Rare Event Simulation , 2009, Rare Event Simulation using Monte Carlo Methods.

[15]  Ding Zhao,et al.  Accelerated Evaluation of Automated Vehicles. , 2016 .

[16]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[17]  P. Shahabuddin,et al.  Chapter 11 Rare-Event Simulation Techniques: An Introduction and Recent Advances , 2006, Simulation.

[18]  Henry Lam,et al.  State-dependent importance sampling for rare-event simulation: An overview and recent advances , 2012 .

[19]  J.S. Sadowsky,et al.  On large deviations theory and asymptotically efficient Monte Carlo estimation , 1990, IEEE Trans. Inf. Theory.

[20]  J. Claybrook,et al.  Autonomous vehicles: No driver…no regulation? , 2018, Science.

[21]  Philip Koopman,et al.  Autonomous Vehicle Safety: An Interdisciplinary Challenge , 2017, IEEE Intelligent Transportation Systems Magazine.

[22]  Cem Anil,et al.  Sorting out Lipschitz function approximation , 2018, ICML.

[23]  Mykel J. Kochenderfer,et al.  A Survey of Algorithms for Black-Box Safety Validation , 2020, J. Artif. Intell. Res..

[24]  Philip Koopman,et al.  Toward a Framework for Highly Automated Vehicle Safety Validation , 2018 .

[25]  J. Lynch,et al.  A weak convergence approach to the theory of large deviations , 1997 .

[26]  Shie Mannor,et al.  A Tutorial on the Cross-Entropy Method , 2005, Ann. Oper. Res..

[27]  Ding Zhao,et al.  Accelerated Evaluation of Automated Vehicles Using Piecewise Mixture Models , 2017, IEEE Transactions on Intelligent Transportation Systems.

[28]  J. Beck,et al.  Estimation of Small Failure Probabilities in High Dimensions by Subset Simulation , 2001 .

[29]  J. Gärtner On Large Deviations from the Invariant Measure , 1977 .

[30]  Mykel J. Kochenderfer,et al.  Adaptive Stress Testing for Autonomous Vehicles , 2018, 2018 IEEE Intelligent Vehicles Symposium (IV).

[31]  R. Ellis,et al.  LARGE DEVIATIONS FOR A GENERAL-CLASS OF RANDOM VECTORS , 1984 .

[32]  Thomas A. Henzinger,et al.  Handbook of Model Checking , 2018, Springer International Publishing.

[33]  Joachim Wegener,et al.  Evaluation of Different Fitness Functions for the Evolutionary Testing of an Autonomous Parking System , 2004, GECCO.

[34]  Peter W. Glynn,et al.  Stochastic Simulation: Algorithms and Analysis , 2007 .

[35]  Henry Lam,et al.  Rare event simulation techniques , 2011, Proceedings of the 2011 Winter Simulation Conference (WSC).

[36]  Fei-Yue Wang,et al.  Capturing Car-Following Behaviors by Deep Learning , 2018, IEEE Transactions on Intelligent Transportation Systems.

[37]  Bouhari Arouna,et al.  Adaptative Monte Carlo Method, A Variance Reduction Technique , 2004, Monte Carlo Methods Appl..

[38]  Helbing,et al.  Congested traffic states in empirical observations and microscopic simulations , 2000, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[39]  Russ Tedrake,et al.  Verifying Neural Networks with Mixed Integer Programming , 2017, ArXiv.

[40]  F. Cérou,et al.  Fluctuation Analysis of Adaptive Multilevel Splitting , 2014, 1408.6366.

[41]  José Villén-Altamirano,et al.  RESTART: a straightforward method for fast simulation of rare events , 1994, Proceedings of Winter Simulation Conference.

[42]  Soumendu Sundar Mukherjee,et al.  Weak convergence and empirical processes , 2019 .

[43]  S. Juneja,et al.  Rare-event Simulation Techniques : An Introduction and Recent Advances , 2006 .

[44]  Pushmeet Kohli,et al.  Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures , 2018, ICLR.

[45]  Lih-Yuan Deng,et al.  The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation, and Machine Learning , 2006, Technometrics.

[46]  P. Dupuis,et al.  Splitting for rare event simulation : A large deviation approach to design and analysis , 2007, 0711.2037.

[47]  Abbas Mehrabian,et al.  Nearly-tight VC-dimension bounds for piecewise linear neural networks , 2017, COLT.

[48]  Martin Lauer,et al.  Towards Responsibility-Sensitive Safety of Automated Vehicles with Reachable Set Analysis , 2019, 2019 IEEE International Conference on Connected Vehicles and Expo (ICCVE).

[49]  Liwei Wang,et al.  The Expressive Power of Neural Networks: A View from the Width , 2017, NIPS.