论文信息 - Training Set Debugging Using Trusted Items

Training Set Debugging Using Trusted Items

Training set bugs are flaws in the data that adversely affect machine learning. The training set is usually too large for man- ual inspection, but one may have the resources to verify a few trusted items. The set of trusted items may not by itself be adequate for learning, so we propose an algorithm that uses these items to identify bugs in the training set and thus im- proves learning. Specifically, our approach seeks the smallest set of changes to the training set labels such that the model learned from this corrected training set predicts labels of the trusted items correctly. We flag the items whose labels are changed as potential bugs, whose labels can be checked for veracity by human experts. To find the bugs in this way is a challenging combinatorial bilevel optimization problem, but it can be relaxed into a continuous optimization problem. Ex- periments on toy and real data demonstrate that our approach can identify training set bugs effectively and suggest appro- priate changes to the labels. Our algorithm is a step toward trustworthy machine learning.

Stephen J. Wright | Xiaojin Zhu | Xuezhou Zhang | Xiaojin Zhu | Xuezhou Zhang

[1] Jun Sakuma,et al. Fairness-aware Learning through Regularization Approach , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[2] Sriram K. Rajamani,et al. The SLAM project: debugging system software via static analysis , 2002, POPL '02.

[3] Matthias Hein,et al. Correction of noisy labels via mutual consistency check , 2015, Neurocomputing.

[4] Nathan Srebro,et al. Equality of Opportunity in Supervised Learning , 2016, NIPS.

[5] Carlos Eduardo Scheidegger,et al. Certifying and Removing Disparate Impact , 2014, KDD.

[6] Ran Gilad-Bachrach,et al. Debugging Machine Learning Models , 2016 .

[7] Shie Mannor,et al. Learning in the Limit with Adversarial Disturbances , 2008, COLT.

[8] Xindong Wu,et al. Eliminating Class Noise in Large Datasets , 2003, ICML.

[9] Pang-Ning Tan,et al. Kernel Based Detection of Mislabeled Training Examples , 2007, SDM.

[10] Zhi-Hua Zhou,et al. Editing Training Data for kNN Classifiers with Neural Network Ensemble , 2004, ISNN.

[11] Tsuyoshi Murata,et al. {m , 1934, ACML.

[12] Shalini Ghosh,et al. Trusted Machine Learning for Probabilistic Models , 2016 .

[13] Carla E. Brodley,et al. Improving automated land cover mapping by identifying and eliminating mislabeled observations from training data , 1996, IGARSS '96. 1996 International Geoscience and Remote Sensing Symposium.

[14] Avi Feller,et al. Algorithmic Decision Making and the Cost of Fairness , 2017, KDD.

[15] Ankur Taly,et al. Gradients of Counterfactuals , 2016, ArXiv.

[16] Thomas J. Ostrand,et al. Experiments on the effectiveness of dataflow- and control-flow-based test adequacy criteria , 1994, Proceedings of 16th International Conference on Software Engineering.

[17] Toniann Pitassi,et al. Learning Fair Representations , 2013, ICML.

[18] Ron Kohavi,et al. Scaling Up the Accuracy of Naive-Bayes Classifiers: A Decision-Tree Hybrid , 1996, KDD.

[19] E. Tronci,et al. 1996 , 1997, Affair of the Heart.

[20] B. Ripley,et al. Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[21] Carla E. Brodley,et al. Identifying Mislabeled Training Data , 1999, J. Artif. Intell. Res..

[23] A. Azzouz. 2011 , 2020, City.

[24] Martin Wattenberg,et al. SmoothGrad: removing noise by adding noise , 2017, ArXiv.

[25] Ehud Shapiro,et al. Algorithmic Program Debugging , 1983 .

[26] Yuanyuan Zhou,et al. BugBench: Benchmarks for Evaluating Bug Detection Tools , 2005 .

[27] Abhishek Das,et al. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[28] Ramprasaath R. Selvaraju,et al. Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization , 2016 .

[29] Prasad Raghavendra,et al. Hardness of Learning Halfspaces with Noise , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[30] Percy Liang,et al. Understanding Black-box Predictions via Influence Functions , 2017, ICML.