Automated generation of diverse and challenging scenarios for test and evaluation of autonomous vehicles

We propose a novel method for generating test scenarios for a black box autonomous system that demonstrate critical transitions in its performance modes. In complex environments it is possible for an autonomous system to fail at its assigned mission even if it complies with requirements for all subsystems and throws no faults. This is particularly true when the autonomous system may have to choose between multiple exclusive objectives. The standard approach of testing robustness through fault detection is directly stimulating the system and detecting violations of the system requirements. Our approach differs by instead running the autonomous system through full missions in a simulated environment and measuring performance based on high-level mission criteria. The result is a method of searching for challenging scenarios for an autonomous system under test that exercise a variety of performance modes. We utilize adaptive sampling to intelligently search the state space for test scenarios which exist on the boundary between distinct performance modes. Additionally, using unsupervised clustering techniques we can group scenarios by their performance modes and sort them by those which are most effective at diagnosing changes in the autonomous system's behavior.

[1]  Myra B. Cohen,et al.  An orchestrated survey of methodologies for automated software test case generation , 2013, J. Syst. Softw..

[2]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[3]  Damian M. Lyons,et al.  Automatic Verification of Autonomous Robot Missions , 2014, SIMPAR.

[4]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[5]  Mauro Birattari,et al.  Fault detection in autonomous robots based on fault injection and learning , 2008, Auton. Robots.

[6]  Jérémie Guiochet,et al.  Virtual Worlds for Testing Robot Navigation: A Study on the Difficulty Level , 2016, 2016 12th European Dependable Computing Conference (EDCC).

[7]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Dirk Gorissen,et al.  A novel sequential design strategy for global surrogate modeling , 2009, Proceedings of the 2009 Winter Simulation Conference (WSC).

[9]  Yi Qin,et al.  SIT: Sampling-based interactive testing for self-adaptive apps , 2016, J. Syst. Softw..

[10]  Jason R. Stack,et al.  Long duration autonomy for maritime systems: challenges and opportunities , 2016, Auton. Robots.

[11]  Matthias Scheutz,et al.  Development environments for autonomous mobile robots: A survey , 2007, Auton. Robots.

[12]  John McDermid,et al.  Testing Method for Multi-UAV Conflict Resolution Using Agent-Based Simulation and Multi-Objective Search , 2016, J. Aerosp. Inf. Syst..

[13]  H. A. Abbass,et al.  Computational Red Teaming to investigate Failure Patterns in Medium Term Conflict Detection , 2009 .

[14]  Piet Demeester,et al.  A Surrogate Modeling and Adaptive Sampling Toolbox for Computer Based Design , 2010, J. Mach. Learn. Res..

[15]  James Arnold,et al.  Testing Autonomous Robot Control Software Using Procedural Content Generation , 2013, SAFECOMP.

[16]  Karl Meinke,et al.  Learning-Based Testing of Distributed Microservice Architectures: Correctness and Fault Injection , 2015, SEFM Workshops.

[17]  Roger King,et al.  A framework for predicting the mission-specific performance of autonomous unmanned systems , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  J. Choi,et al.  Model checking for decision making behaviour of heterogeneous multi-agent autonomous system , 2013 .

[19]  Bestoun S. Ahmed,et al.  An efficient strategy for covering array construction with fuzzy logic-based adaptive swarm optimization for software testing use , 2015, Expert Syst. Appl..

[20]  Andrew Rae,et al.  Situation coverage – a coverage criterion for testing autonomous robots , 2015 .