论文信息 - Deep Probabilistic Accelerated Evaluation: A Certifiable Rare-Event Simulation Methodology for Black-Box Autonomy

Deep Probabilistic Accelerated Evaluation: A Certifiable Rare-Event Simulation Methodology for Black-Box Autonomy

Evaluating the reliability of intelligent physical systems against rare catastrophic events poses a huge testing burden for real-world applications. Simulation provides a useful, if not unique, platform to evaluate the extremal risks of these AI-enabled systems before their deployments. Importance Sampling (IS), while proven to be powerful for rare-event simulation, faces challenges in handling these systems due to their black-box nature that fundamentally undermines its efficiency guarantee. To overcome this challenge, we propose a framework called Deep Probabilistic Accelerated Evaluation (D-PrAE) to design IS, which leverages rare-event-set learning and a new notion of efficiency certificate. D-PrAE combines the dominating point method with deep neural network classifiers to achieve superior estimation efficiency. We present theoretical guarantees and demonstrate the empirical effectiveness of D-PrAE via examples on the safety-testing of self-driving algorithms that are beyond the reach of classical variance reduction techniques.

[1] Ding Zhao,et al. Accelerated Evaluation of Automated Vehicles in Car-Following Maneuvers , 2016, IEEE Transactions on Intelligent Transportation Systems.

[2] Donald L. Iglehart,et al. Importance sampling for stochastic simulations , 1989 .

[3] F. Cérou,et al. Adaptive Multilevel Splitting for Rare Event Analysis , 2007 .

[4] Nidhi Kalra,et al. Driving to Safety , 2016 .

[5] Michel Mandjes,et al. Fast simulation of overflow probabilities in a queue with Gaussian input , 2006, TOMC.

[6] Yuan Cao,et al. Tight Sample Complexity of Learning One-hidden-layer Convolutional Neural Networks , 2019, NeurIPS.

[7] J. Rosenthal,et al. General state space Markov chains and MCMC algorithms , 2004, math/0404033.

[8] Russ Tedrake,et al. Scalable End-to-End Autonomous Vehicle Testing via Rare-event Simulation , 2018, NeurIPS.

[9] Ding Zhao,et al. An Accelerated Approach to Safely and Efficiently Test Pre-Production Autonomous Vehicles on Public Streets , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[10] Russ Tedrake,et al. Evaluating Robustness of Neural Networks with Mixed Integer Programming , 2017, ICLR.

[11] Paul Glasserman,et al. Multilevel Splitting for Estimating Rare Event Probabilities , 1999, Oper. Res..

[12] Amir Dembo,et al. Large Deviations Techniques and Applications , 1998 .

[13] A. Skoogh,et al. DESIGNING IMPORTANCE SAMPLERS TO SIMULATE MACHINE LEARNING PREDICTORS VIA OPTIMIZATION , 2018 .

[14] Gerardo Rubino,et al. Introduction to Rare Event Simulation , 2009, Rare Event Simulation using Monte Carlo Methods.

[15] Ding Zhao,et al. Accelerated Evaluation of Automated Vehicles. , 2016 .

[16] Jon A. Wellner,et al. Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[17] P. Shahabuddin,et al. Chapter 11 Rare-Event Simulation Techniques: An Introduction and Recent Advances , 2006, Simulation.

[18] Henry Lam,et al. State-dependent importance sampling for rare-event simulation: An overview and recent advances , 2012 .

[19] J.S. Sadowsky,et al. On large deviations theory and asymptotically efficient Monte Carlo estimation , 1990, IEEE Trans. Inf. Theory.

[20] J. Claybrook,et al. Autonomous vehicles: No driver…no regulation? , 2018, Science.

[21] Philip Koopman,et al. Autonomous Vehicle Safety: An Interdisciplinary Challenge , 2017, IEEE Intelligent Transportation Systems Magazine.

[22] Cem Anil,et al. Sorting out Lipschitz function approximation , 2018, ICML.

[23] Mykel J. Kochenderfer,et al. A Survey of Algorithms for Black-Box Safety Validation , 2020, J. Artif. Intell. Res..

[24] Philip Koopman,et al. Toward a Framework for Highly Automated Vehicle Safety Validation , 2018 .

[25] J. Lynch,et al. A weak convergence approach to the theory of large deviations , 1997 .

[26] Shie Mannor,et al. A Tutorial on the Cross-Entropy Method , 2005, Ann. Oper. Res..

[27] Ding Zhao,et al. Accelerated Evaluation of Automated Vehicles Using Piecewise Mixture Models , 2017, IEEE Transactions on Intelligent Transportation Systems.

[28] J. Beck,et al. Estimation of Small Failure Probabilities in High Dimensions by Subset Simulation , 2001 .

[29] J. Gärtner. On Large Deviations from the Invariant Measure , 1977 .

[30] Mykel J. Kochenderfer,et al. Adaptive Stress Testing for Autonomous Vehicles , 2018, 2018 IEEE Intelligent Vehicles Symposium (IV).

[31] R. Ellis,et al. LARGE DEVIATIONS FOR A GENERAL-CLASS OF RANDOM VECTORS , 1984 .

[32] Thomas A. Henzinger,et al. Handbook of Model Checking , 2018, Springer International Publishing.

[33] Joachim Wegener,et al. Evaluation of Different Fitness Functions for the Evolutionary Testing of an Autonomous Parking System , 2004, GECCO.

[34] Peter W. Glynn,et al. Stochastic Simulation: Algorithms and Analysis , 2007 .

[35] Henry Lam,et al. Rare event simulation techniques , 2011, Proceedings of the 2011 Winter Simulation Conference (WSC).

[36] Fei-Yue Wang,et al. Capturing Car-Following Behaviors by Deep Learning , 2018, IEEE Transactions on Intelligent Transportation Systems.

[37] Bouhari Arouna,et al. Adaptative Monte Carlo Method, A Variance Reduction Technique , 2004, Monte Carlo Methods Appl..

[38] Helbing,et al. Congested traffic states in empirical observations and microscopic simulations , 2000, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[39] Russ Tedrake,et al. Verifying Neural Networks with Mixed Integer Programming , 2017, ArXiv.

[40] F. Cérou,et al. Fluctuation Analysis of Adaptive Multilevel Splitting , 2014, 1408.6366.

[41] José Villén-Altamirano,et al. RESTART: a straightforward method for fast simulation of rare events , 1994, Proceedings of Winter Simulation Conference.

[42] Soumendu Sundar Mukherjee,et al. Weak convergence and empirical processes , 2019 .

[43] S. Juneja,et al. Rare-event Simulation Techniques : An Introduction and Recent Advances , 2006 .

[44] Pushmeet Kohli,et al. Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures , 2018, ICLR.

[45] Lih-Yuan Deng,et al. The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation, and Machine Learning , 2006, Technometrics.

[46] P. Dupuis,et al. Splitting for rare event simulation : A large deviation approach to design and analysis , 2007, 0711.2037.

[47] Abbas Mehrabian,et al. Nearly-tight VC-dimension bounds for piecewise linear neural networks , 2017, COLT.

[48] Martin Lauer,et al. Towards Responsibility-Sensitive Safety of Automated Vehicles with Reachable Set Analysis , 2019, 2019 IEEE International Conference on Connected Vehicles and Expo (ICCVE).

[49] Liwei Wang,et al. The Expressive Power of Neural Networks: A View from the Width , 2017, NIPS.