论文信息 - Reachable Sets of Classifiers & Regression Models: (Non-)Robustness Analysis and Robust Training

Reachable Sets of Classifiers & Regression Models: (Non-)Robustness Analysis and Robust Training

Neural networks achieve outstanding accuracy in classification and regression tasks. However, understanding their behavior still remains an open challenge that requires questions to be addressed on the robustness, explainability and reliability of predictions. We answer these questions by computing reachable sets of neural networks, i.e. sets of outputs resulting from continuous sets of inputs. We provide two efficient approaches that lead to over- and under-approximations of the reachable set. This principle is highly versatile, as we show. First, we analyze and enhance the robustness properties of both classifiers and regression models. This is in contrast to existing works, which only handle classification. Specifically, we verify (non-)robustness, propose a robust training procedure, and show that our approach outperforms adversarial attacks as well as state-of-the-art methods of verifying classifiers for non-norm bound perturbations. We also provide a technique of distinguishing between reliable and non-reliable predictions for unlabeled inputs, quantify the influence of each feature on a prediction, and compute a feature ranking.

Stephan Günnemann | Anna-Kathrin Kopetzki

[1] Weiming Xiang,et al. Reachable Set Computation and Safety Verification for Neural Networks with ReLU Activations , 2017, ArXiv.

[2] Eugene H. Gover,et al. Determinants and the volumes of parallelotopes and zonotopes , 2010 .

[3] Timon Gehr,et al. An abstract domain for certifying neural networks , 2019, Proc. ACM Program. Lang..

[4] J. Jossinet. Variability of impedivity in normal and pathological breast tissue , 1996, Medical and Biological Engineering and Computing.

[5] Timon Gehr,et al. Boosting Robustness Certification of Neural Networks , 2018, ICLR.

[6] Timothy A. Mann,et al. On the Effectiveness of Interval Bound Propagation for Training Verifiably Robust Models , 2018, ArXiv.

[7] Riccardo Leardi,et al. PARVUS: An Extendable Package of Programs for Data Exploration , 1988 .

[8] Antonio Criminisi,et al. Measuring Neural Net Robustness with Constraints , 2016, NIPS.

[9] Roland Vollgraf,et al. Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms , 2017, ArXiv.

[10] Matthias Hein,et al. Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation , 2017, NIPS.

[11] Aditi Raghunathan,et al. Semidefinite relaxations for certifying robustness to adversarial examples , 2018, NeurIPS.