Instance-level accuracy versus bag-level accuracy in multi-instance learning

In multi-instance learning, instances are organized into bags, and a bag is labeled positive if it contains at least one positive instance, and negative otherwise; the labels of the individual instances are not given. The task is to learn a classifier from this limited information. While the original task description involved learning an instance classifier, in the literature the task is often interpreted as learning a bag classifier. Depending on which of these two interpretations is used, it is more natural to evaluate classifiers according to how well they predict, respectively, instance labels or bag labels. In the literature, however, the two interpretations are often mixed, or the intended interpretation is left implicit. In this paper, we investigate the difference between bag-level and instance-level accuracy, both analytically and empirically. We show that there is a substantial difference between these two, and better performance on one does not necessarily imply better performance on the other. It is therefore useful to clearly distinguish the two settings, and always use the evaluation criterion most relevant for the task at hand. We show experimentally that the same conclusions hold for area under the ROC curve.

[1]  Richard S. Johannes,et al.  Using the ADAP Learning Algorithm to Forecast the Onset of Diabetes Mellitus , 1988 .

[2]  David W. Aha,et al.  Incremental Constructive Induction: An Instance-Based Approach , 1991, ML.

[3]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[4]  S. Cessie,et al.  Ridge Estimators in Logistic Regression , 1992 .

[5]  金田 重郎,et al.  C4.5: Programs for Machine Learning (書評) , 1995 .

[6]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[7]  Ron Kohavi,et al.  Scaling Up the Accuracy of Naive-Bayes Classifiers: A Decision-Tree Hybrid , 1996, KDD.

[8]  Philip M. Long,et al.  PAC Learning Axis-Aligned Rectangles with Respect to Product Distributions from Multiple-Instance Examples , 1996, COLT.

[9]  Philip M. Long,et al.  PAC Learning Axis-aligned Rectangles with Respect to Product Distributions from Multiple-Instance Examples , 1996, COLT '96.

[10]  Thomas G. Dietterich,et al.  Solving the Multiple Instance Problem with Axis-Parallel Rectangles , 1997, Artif. Intell..

[11]  Aravind Srinivasan,et al.  Approximating hyper-rectangles: learning and pseudo-random sets , 1997, STOC '97.

[12]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[13]  Tomás Lozano-Pérez,et al.  A Framework for Multiple-Instance Learning , 1997, NIPS.

[14]  Aravind Srinivasan,et al.  Approximating Hyper-Rectangles: Learning and Pseudorandom Sets , 1998, J. Comput. Syst. Sci..

[15]  Oded Maron,et al.  Multiple-Instance Learning for Natural Scene Classification , 1998, ICML.

[16]  M. Angela Sasse,et al.  Successful multiparty audio communication over the Internet , 1998, CACM.

[17]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[18]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[19]  Yann Chevaleyre,et al.  Solving multiple-instance and multiple-part learning problems with decision trees and decision rules . Application to the mutagenesis problem , 2000 .

[20]  Jan Ramon,et al.  Multi instance neural networks , 2000, ICML 2000.

[21]  Jun Wang,et al.  Solving the Multiple-Instance Problem: A Lazy Learning Approach , 2000, ICML.

[22]  Sally A. Goldman,et al.  Multiple-Instance Learning of Real-Valued Data , 2001, J. Mach. Learn. Res..

[23]  Yann Chevaleyre,et al.  Solving Multiple-Instance and Multiple-Part Learning Problems with Decision Trees and Rule Sets. Application to the Mutagenesis Problem , 2001, Canadian Conference on AI.

[24]  Qi Zhang,et al.  EM-DD: An Improved Multiple-Instance Learning Technique , 2001, NIPS.

[25]  Thomas Hofmann,et al.  Support Vector Machines for Multiple-Instance Learning , 2002, NIPS.

[26]  Thomas Gärtner,et al.  Multi-Instance Kernels , 2002, ICML.

[27]  Xin Xu,et al.  Statistical Learning in Multiple Instance Problems , 2003 .

[28]  Zhi-Hua Zhou,et al.  Ensembles of Multi-instance Learners , 2003, ECML.

[29]  Alois Potton Spam , 2003, PIK Prax. Informationsverarbeitung Kommun..

[30]  Eibe Frank,et al.  Applying propositional learning algorithms to multi-instance data , 2003 .

[31]  Peter Auer,et al.  A Boosting Approach to Multiple Instance Learning , 2004, ECML.

[32]  Yixin Chen,et al.  Image Categorization by Learning and Reasoning with Regions , 2004, J. Mach. Learn. Res..

[33]  Adam Tauman Kalai,et al.  A Note on Learning from Multiple-Instance Examples , 2004, Machine Learning.

[34]  N. V. Vinodchandran,et al.  SVM-based generalized multiple-instance learning via approximate box counting , 2004, ICML.

[35]  D. Kibler,et al.  Instance-based learning algorithms , 2004, Machine Learning.

[36]  Xin Xu,et al.  Logistic Regression and Boosting for Labeled Bags of Instances , 2004, PAKDD.

[37]  Ashwin Srinivasan,et al.  Multi-instance tree learning , 2005, ICML.

[38]  Mark Craven,et al.  Supervised versus multiple instance learning: an empirical comparison , 2005, ICML.

[39]  Zhi-Hua Zhou,et al.  Locating Regions of Interest in CBIR with Multi-instance Learning Techniques , 2005, Australian Conference on Artificial Intelligence.

[40]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[41]  Murat Dundar,et al.  Multiple Instance Learning for Computer Aided Diagnosis , 2006, NIPS.

[42]  Lin Dong,et al.  A Comparison of Multi-instance Learning Algorithms , 2006 .

[43]  Mark Craven,et al.  Multiple-Instance Active Learning , 2007, NIPS.

[44]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[45]  Dongjian He,et al.  Multi-semantic Scene Classification Based on Region of Interest , 2008, 2008 International Conference on Computational Intelligence for Modelling Control & Automation.

[46]  Daniel P. W. Ellis,et al.  Multiple-Instance Learning for Music Information Retrieval , 2008, ISMIR.

[47]  Min-Ling Zhang,et al.  Generalized Multi-Instance Learning: Problems, Algorithms and Data Sets , 2009, 2009 WRI Global Congress on Intelligent Systems.

[48]  I-Cheng Yeh,et al.  Knowledge discovery on RFM model using Bernoulli sequence , 2009, Expert Syst. Appl..

[49]  Ivor W. Tsang,et al.  A Convex Method for Locating Regions of Interest with Multi-instance Learning , 2009, ECML/PKDD.

[50]  James R. Foulds,et al.  A review of multi-instance learning assumptions , 2010, The Knowledge Engineering Review.

[51]  Hendrik Blockeel,et al.  Multi-Instance Learning , 2010, Encyclopedia of Machine Learning.

[52]  Hendrik Blockeel,et al.  Multi-Instance Learning , 2010, Encyclopedia of Machine Learning.

[53]  Eamonn J. Keogh Instance-Based Learning , 2010, Encyclopedia of Machine Learning and Data Mining.

[54]  Jun Zhou,et al.  MILIS: Multiple Instance Learning with Instance Selection , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[55]  Eibe Frank,et al.  Beyond Trees: Adopting MITI to Learn Rules and Ensemble Classifiers for Multi-Instance Data , 2011, Australasian Conference on Artificial Intelligence.

[56]  Guoqing Liu,et al.  Key Instance Detection in Multi-Instance Learning , 2012, ACML.

[57]  Gary Doran,et al.  A theoretical and empirical analysis of support vector machine methods for multiple-instance classification , 2014, Machine Learning.

[58]  Jaume Amores,et al.  Multiple instance classification: Review, taxonomy and comparative study , 2013, Artif. Intell..

[59]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .