论文信息 - Slice Finder: Automated Data Slicing for Model Interpretability

Slice Finder: Automated Data Slicing for Model Interpretability

As machine learning (ML) systems become democratized, helping users easily debug their models becomes increasingly important. Yet current data tools are still primitive when it comes to helping users trace model performance problems all the way to the data. We focus on the particular problem of slicing data to identify subsets of the training data where the model performs poorly. Unlike general techniques (e.g., clustering) that can find arbitrary slices, our goal is to find interpretable slices (which are easier to take action compared to arbitrary subsets) that are problematic and large. We propose Slice Finder, which is an interactive framework for identifying such slices using statistical techniques. The slices can be used for applications like diagnosing model fairness and fraud detection where describing slices that are interpretable to humans is necessary.

[1] Been Kim,et al. Towards A Rigorous Science of Interpretable Machine Learning , 2017, 1702.08608.

[2] Martin Wattenberg,et al. Ad click prediction: a view from the trenches , 2013, KDD.

[3] Jacob Cohen. Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[4] C. Borland,et al. Effect Size , 2019, SAGE Research Methods Foundations.

[5] Nathan Srebro,et al. Equality of Opportunity in Supervised Learning , 2016, NIPS.

[6] Minsuk Kahng,et al. Visual exploration of machine learning results using data cube analysis , 2016, HILDA '16.

[7] Xin Zhang,et al. TFX: A TensorFlow-Based Production-Scale Machine Learning Platform , 2017, KDD.