MILKDE: A new approach for multiple instance learning based on positive instance selection and kernel density estimation

Abstract Multiple Instance Learning (MIL) is a recent paradigm of learning, which is based on the assignment of a single label to a set of instances called bag. A bag is positive if it contains at least one positive instance, and negative otherwise. This work proposes a new algorithm based on likelihood computation by means of Kernel Density Estimation (KDE) called MILKDE. Using the LogitBoost classifier, its performance was compared to that of forty-three MIL algorithms available in the literature using five data sets. Our proposal outperformed all of them for the Elephant (87.40%), Fox (66.80%) and COREL 2000 data sets (77.8%), and achieved competitive results for the MUSK 1 (89.20%) and MUSK 2 (87.50%) data sets, which are comparable to the higher accuracies obtained by other methods for this data sets. Overall results are statistically comparable to those obtained by the most well known methods for MIL described in the literature.

[1]  Horst Bischof,et al.  MIForests: Multiple-Instance Learning with Randomized Trees , 2010, ECCV.

[2]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[3]  Jun Wang,et al.  Solving the Multiple-Instance Problem: A Lazy Learning Approach , 2000, ICML.

[4]  Robert P. W. Duin,et al.  Multiple-instance learning as a classifier combining problem , 2013, Pattern Recognit..

[5]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[6]  Li Sun,et al.  ECG Analysis Using Multiple Instance Learning for Myocardial Infarction Detection , 2012, IEEE Transactions on Biomedical Engineering.

[7]  Joachim M. Buhmann,et al.  Ellipsoidal Multiple Instance Learning , 2013, ICML.

[8]  Peter Auer,et al.  On Learning From Multi-Instance Examples: Empirical Evaluation of a Theoretical Approach , 1997, ICML.

[9]  Jun Zhou,et al.  MILIS: Multiple Instance Learning with Instance Selection , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Giancarlo Ruffo,et al.  Learning single and multiple instance decision tree for computer security applications , 2000 .

[11]  R. Tibshirani,et al.  Additive Logistic Regression : a Statistical View ofBoostingJerome , 1998 .

[12]  Wu-Jun Li,et al.  MILD: Multiple-Instance Learning via Disambiguation , 2010, IEEE Transactions on Knowledge and Data Engineering.

[13]  Chao Wen,et al.  Multiple instance learning based on positive instance selection and bag structure construction , 2014, Pattern Recognit. Lett..

[14]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[15]  Ivor W. Tsang,et al.  A Convex Method for Locating Regions of Interest with Multi-instance Learning , 2009, ECML/PKDD.

[16]  Min Yao,et al.  Semantic Image Retrieval Based on Multiple-Instance Learning , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[17]  Yixin Chen,et al.  MILES: Multiple-Instance Learning via Embedded Instance Selection , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Thomas G. Dietterich,et al.  Solving the Multiple Instance Problem with Axis-Parallel Rectangles , 1997, Artif. Intell..

[19]  Thomas Gärtner,et al.  Multi-Instance Kernels , 2002, ICML.

[20]  Zhi-Hua Zhou,et al.  On the relation between multi-instance learning and semi-supervised learning , 2007, ICML '07.

[21]  Marco Loog,et al.  Multiple instance learning with bag dissimilarities , 2013, Pattern Recognit..

[22]  Boris Babenko,et al.  Multiple Instance Learning with Manifold Bags , 2011, ICML.

[23]  Longbing Cao,et al.  A Similarity-Based Classification Framework for Multiple-Instance Learning , 2014, IEEE Transactions on Cybernetics.

[24]  Qiang Wu,et al.  MIL-SKDE: Multiple-instance learning with supervised kernel density estimation , 2013, Signal Process..

[25]  Qi Zhang,et al.  EM-DD: An Improved Multiple-Instance Learning Technique , 2001, NIPS.

[26]  Dan Zhang,et al.  MILEAGE: Multiple Instance LEArning with Global Embedding , 2013, ICML.

[27]  Yuhong Guo Max-margin Multiple-Instance Learning via Semidefinite Programming , 2009, ACML.

[28]  Ivor W. Tsang,et al.  Batch mode Adaptive Multiple Instance Learning for computer vision tasks , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Jessica K. Hodgins,et al.  Detecting Parkinsons' symptoms in uncontrolled home environments: A multiple instance learning approach , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[30]  Xindong Wu,et al.  SMILE: A Similarity-Based Approach for Multiple Instance Learning , 2010, 2010 IEEE International Conference on Data Mining.

[31]  Xin Huang,et al.  An open multiple instance learning framework and its application in drug activity prediction problems , 2003, Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings..

[32]  Lin Dong,et al.  A Comparison of Multi-instance Learning Algorithms , 2006 .

[33]  Yann Chevaleyre,et al.  Solving multiple-instance and multiple-part learning problems with decision trees and decision rules . Application to the mutagenesis problem , 2000 .

[34]  R. Storn,et al.  Differential Evolution: A Practical Approach to Global Optimization (Natural Computing Series) , 2005 .

[35]  Ye Xu,et al.  Multi-instance Metric Learning , 2011, 2011 IEEE 11th International Conference on Data Mining.

[36]  Edward W. Wild,et al.  Multiple Instance Classification via Successive Linear Programming , 2008 .

[37]  Diana Inkpen,et al.  A Pattern-Based Model for Generating Text to Express Emotion , 2011, ACII.

[38]  Boris Babenko Multiple Instance Learning: Algorithms and Applications , 2008 .

[39]  Oded Maron,et al.  Learning from Ambiguity , 1998 .

[40]  Ronald M. Summers,et al.  Seeing Is Believing: Video Classification for Computed Tomographic Colonography Using Multiple-Instance Learning , 2012, IEEE Transactions on Medical Imaging.

[41]  Thomas Hofmann,et al.  Support Vector Machines for Multiple-Instance Learning , 2002, NIPS.

[42]  Aykut Erdem,et al.  Multiple-Instance Learning with Instance Selection via Dominant Sets , 2011, SIMBAD.

[43]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[44]  Peter V. Gehler,et al.  Deterministic Annealing for Multiple-Instance Learning , 2007, AISTATS.

[45]  Jun Yang Review of Multi-Instance Learning and Its applications , 2005 .

[46]  Fernando De la Torre,et al.  Gaussian Processes Multiple Instance Learning , 2010, ICML.

[47]  Yann Chevaleyre,et al.  Solving Multiple-Instance and Multiple-Part Learning Problems with Decision Trees and Rule Sets. Application to the Mutagenesis Problem , 2001, Canadian Conference on AI.

[48]  Alexandros Kalousis,et al.  Adaptive Distances on Sets of Vectors , 2010, 2010 IEEE International Conference on Data Mining.

[49]  Yixin Chen,et al.  Image Categorization by Learning and Reasoning with Regions , 2004, J. Mach. Learn. Res..

[50]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .