论文信息 - PALM: Machine Learning Explanations For Iterative Debugging

PALM: Machine Learning Explanations For Iterative Debugging

When a Deep Neural Network makes a misprediction, it can be challenging for a developer to understand why. While there are many models for interpretability in terms of predictive features, it may be more natural to isolate a small set of training examples that have the greatest influence on the prediction. However, it is often the case that every training example contributes to a prediction in some way but with varying degrees of responsibility. We present Partition Aware Local Model (PALM), which is a tool that learns and summarizes this responsibility structure to aide machine learning debugging. PALM approximates a complex model (e.g., a deep neural network) using a two-part surrogate model: a meta-model that partitions the training data, and a set of sub-models that approximate the patterns within each partition. These sub-models can be arbitrarily complex to capture intricate local patterns. However, the meta-model is constrained to be a decision tree. This way the user can examine the structure of the meta-model, determine whether the rules match intuition, and link problematic test examples to responsible training data efficiently. Queries to PALM are nearly 30x faster than nearest neighbor queries for identifying relevant data, which is a key property for interactive applications.

Sanjay Krishnan | Eugene Wu | S. Krishnan | Eugene Wu

[1] James Cheney,et al. Provenance in Databases: Why, How, and Where , 2009, Found. Trends Databases.

[2] Carlos Guestrin,et al. "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[3] Samuel Madden,et al. MacroBase: Prioritizing Attention in Fast Data , 2016, SIGMOD Conference.

[4] Sanjay Krishnan,et al. ActiveClean: Interactive Data Cleaning For Statistical Modeling , 2016, Proc. VLDB Endow..

[5] Tim Kraska,et al. MLbase: A Distributed Machine-learning System , 2013, CIDR.

[6] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[7] Regina Barzilay,et al. Rationalizing Neural Predictions , 2016, EMNLP.

[8] John F. Canny,et al. Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[9] Kun Li,et al. The MADlib Analytics Library or MAD Skills, the SQL , 2012, Proc. VLDB Endow..

[10] Sanjay Krishnan,et al. HIRL: Hierarchical Inverse Reinforcement Learning for Long-Horizon Tasks with Delayed Rewards , 2016, ArXiv.

[11] Ion Stoica,et al. Multi-Level Discovery of Deep Options , 2017, ArXiv.

[12] Tim Kraska,et al. Tupleware: Distributed Machine Learning on Small Clusters , 2014, IEEE Data Eng. Bull..

[13] Samuel Madden,et al. Scorpion: Explaining Away Outliers in Aggregate Queries , 2013, Proc. VLDB Endow..