论文信息 - NICE: An Algorithm for Nearest Instance Counterfactual Explanations

NICE: An Algorithm for Nearest Instance Counterfactual Explanations

In this paper we suggest NICE: a new algorithm to generate counterfactual explanations for heterogeneous tabular data. The design of our algorithm specifically takes into account algorithmic requirements that often emerge in real-life deployments: the ability to provide an explanation for all predictions, being efficient in run-time, and being able to handle any classification model (also nondifferentiable ones). More specifically, our approach exploits information from a nearest instance to speed up the search process. We propose four versions of NICE, where three of them optimize the explanations for one of the following properties: sparsity, proximity or plausibility. An extensive empirical comparison on 10 datasets shows that our algorithm performs better on all properties than the current state-of-the-art. These analyses show a trade-off between on the one hand plausiblity and on the other hand proximity or sparsity, with our different optimization methods offering the choice to select the preferred trade-off. An open-source implementation of NICE can be found at https://github.com/ADMAntwerp/NICE.

David Martens | Dieter Brughmans

[1] Hiroki Arimura,et al. DACE: Distribution-Aware Counterfactual Explanation by Mixed-Integer Linear Optimization , 2020, IJCAI.

[2] Donald Nute,et al. Counterfactuals , 1975, Notre Dame J. Formal Log..

[3] Martin Wattenberg,et al. The What-If Tool: Interactive Probing of Machine Learning Models , 2019, IEEE Transactions on Visualization and Computer Graphics.

[4] Foster J. Provost,et al. Explaining Data-Driven Document Classifications , 2013, MIS Q..

[5] Amit Sharma,et al. Explaining machine learning classifiers through diverse counterfactual explanations , 2020, FAT*.

[6] Peter A. Flach,et al. Explainability fact sheets: a framework for systematic assessment of explainable approaches , 2019, FAT*.

[7] John P. Dickerson,et al. Counterfactual Explanations for Machine Learning: A Review , 2020, ArXiv.

[8] Jacques Wainer,et al. Uses of artificial intelligence in the Brazilian customs fraud detection system , 2008, DG.O.

[9] Amit Dhurandhar,et al. Explanations based on the Missing: Towards Contrastive Explanations with Pertinent Negatives , 2018, NeurIPS.

[10] Theodoros Evgeniou,et al. A comparison of instance-level counterfactual explanation algorithms for behavioral and textual data: SEDC, LIME-C and SHAP-C , 2020, Advances in Data Analysis and Classification.

[11] Amit Dhurandhar,et al. Model Agnostic Contrastive Explanations for Structured Data , 2019, ArXiv.