论文信息 - Advances in Knowledge Discovery and Data Mining

Advances in Knowledge Discovery and Data Mining

Evaluating a trained system is an important component of machine learning. Labeling test data for large scale evaluation of a trained model can be extremely time consuming and expensive. In this paper we propose strategies for estimating performance of a classifier using as little labeling resource as possible. Specifically, we assume a labeling budget is given and the goal is to get a good estimate of the classifier performance using the provided labeling budget. We propose strategies to get a precise estimate of classifier accuracy under this restricted labeling budget scenario. We show that these strategies can reduce the variance in estimation of classifier accuracy by a significant amount compared to simple random sampling (over 65% in several cases). In terms of labeling resource, the reduction in number of samples required (compared to random sampling) to estimate the classifier accuracy with only 1% error is high as 60% in some cases.

[1] Donald W. Bouldin,et al. A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] Bart Baesens,et al. Social network analysis for customer churn prediction , 2014, Appl. Soft Comput..

[3] S. Shankar Sastry,et al. Dissimilarity-Based Sparse Subset Selection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] Maoqiang Xie,et al. Prioritizing Disease Genes by Bi-Random Walk , 2012, PAKDD.

[5] Walter Willinger,et al. On the Self-Similar Nature of Ethernet Traffic ( extended version ) , 1995 .

[6] C. Faloutsos,et al. Ensemble Methods , 2019, Machine Learning with Spark™ and Python®.

[7] Yong Yu,et al. Collaborative personalized tweet recommendation , 2012, SIGIR '12.

[8] Jian Tang,et al. Enhancing Effectiveness of Outlier Detections for Low Density Patterns , 2002, PAKDD.

[9] Eric C. Larson,et al. Disaggregated End-Use Energy Sensing for the Smart Grid , 2011, IEEE Pervasive Computing.

[10] J. Zico Kolter,et al. Contextually Supervised Source Separation with Application to Energy Disaggregation , 2013, AAAI.

[11] Joydeep Ghosh,et al. Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[12] Swapna S. Gokhale,et al. Log-logistic software reliability growth model , 1998, Proceedings Third IEEE International High-Assurance Systems Engineering Symposium (Cat. No.98EX231).

[13] Silvia Santini,et al. The ECO data set and the performance of non-intrusive load monitoring algorithms , 2014, BuildSys@SenSys.

[14] Jasper Snoek,et al. Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[15] Krishnakumar Balasubramanian,et al. Unsupervised Supervised Learning I: Estimating Classification and Regression Errors without Labels , 2010, J. Mach. Learn. Res..

[16] Jianyong Wang,et al. Retweet or not?: personalized tweet re-ranking , 2013, WSDM.

[17] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[18] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[19] Qing Yang,et al. Time-Dependent Models in Collaborative Filtering Based Recommender System , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[20] Yiming Yang,et al. RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[21] Christos Faloutsos,et al. Surprising Patterns for the Call Duration Distribution of Mobile Phone Users , 2010, ECML/PKDD.

[22] Lars Schmidt-Thieme,et al. BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[23] Arthur Zimek,et al. Ensembles for unsupervised outlier detection: challenges and research questions a position paper , 2014, SKDD.

[24] Paul N. Bennett,et al. Online stratified sampling: evaluating classifiers at web-scale , 2010, CIKM.

[25] R. Singh,et al. Approximately Optimum Stratification on the Auxiliary Variable , 1971 .

[26] Panagiotis Symeonidis,et al. Tag recommendations based on tensor dimensionality reduction , 2008, RecSys '08.

[27] J. L. Hodges,et al. Minimum Variance Stratification , 1959 .

[28] Manish Marwah,et al. Unsupervised Disaggregation of Low Frequency Power Measurements , 2011, SDM.

[29] Ming Zhang,et al. Understanding data center traffic characteristics , 2010, CCRV.

[30] Shwetak N. Patel,et al. ElectriSense: single-point sensing using EMI for electrical event detection and classification in the home , 2010, UbiComp.

[31] Martin Saveski. Web Services for Stream Mining : A Stream-Based Active Learning Use Case , 2011 .