Active learning with multi-criteria decision making systems

In active learning, the learner is required to measure the importance of unlabeled samples in a large dataset and select the best one iteratively. This sample selection process could be treated as a decision making problem, which evaluates, ranks, and makes choices from a finite set of alternatives. In many decision making problems, it usually applied multiple criteria since the performance is better than using a single criterion. Motivated by these facts, an active learning model based on multi-criteria decision making (MCMD) is proposed in this paper. After the investigation between any two unlabeled samples, a preference preorder is determined for each criterion. The dominated index and the dominating index are then defined and calculated to evaluate the informativeness of unlabeled samples, which provide an effective metric measure for sample selection. On the other hand, under multiple-instance learning (MIL) environment, the instances/samples are grouped into bags, a bag is negative only if all of its instances are negative, and is positive otherwise. Multiple-instance active learning (MIAL) aims to select and label the most informative bags from numerous unlabeled ones, and learn a MIL classifier for accurately predicting unseen bags by requesting as few labels as possible. It adopts a MIL algorithm as the base classifier, and follows an active learning procedure. In order to achieve a balance between learning efficiency and generalization capability, the proposed active learning model is restricted to a specific algorithm under MIL environment. Experimental results demonstrate the effectiveness of the proposed method.

[1]  Kaisa Miettinen,et al.  Ordinal criteria in stochastic multicriteria acceptability analysis (SMAA) , 2003, Eur. J. Oper. Res..

[2]  Kersten Meier Methods for decision making with cardinal numbers and additive aggregation , 1997, Fuzzy Sets Syst..

[3]  Keith W. Hipel,et al.  Multiple participant-multiple criteria decision making , 1993, IEEE Trans. Syst. Man Cybern..

[4]  Salvatore Greco,et al.  Ordinal regression revisited: Multiple criteria ranking using a set of additive value functions , 2008, Eur. J. Oper. Res..

[5]  W. Cook,et al.  A multiple criteria decision model with ordinal preference data , 1991 .

[6]  Sam Kwong,et al.  Inconsistency-based active learning for support vector machines , 2012, Pattern Recognit..

[7]  Annika Kangas,et al.  The risk of decision making with incomplete criteria weight information , 2006 .

[8]  Alexandre X. Falcão,et al.  Active learning paradigms for CBIR systems based on optimum-path forest classification , 2011, Pattern Recognit..

[9]  J. Branke,et al.  Guidance in evolutionary multi-objective optimization , 2001 .

[10]  Panos M. Pardalos,et al.  Multiple instance learning via margin maximization , 2010 .

[11]  Nello Cristianini,et al.  Query Learning with Large Margin Classi ersColin , 2000 .

[12]  H. Sebastian Seung,et al.  Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.

[13]  Kalyanmoy Deb,et al.  Muiltiobjective Optimization Using Nondominated Sorting in Genetic Algorithms , 1994, Evolutionary Computation.

[14]  Dong Wang,et al.  Multiple-Instance Learning Via Random Walk , 2006, ECML.

[15]  Jian Fu,et al.  Bag-level active multi-instance learning , 2011, 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[16]  WalleniusJyrki,et al.  Multiple Criteria Decision Making, Multiattribute Utility Theory , 2008 .

[17]  Rong Jin,et al.  Batch Mode Active Learning with Applications to Text Categorization and Image Retrieval , 2009, IEEE Transactions on Knowledge and Data Engineering.

[18]  Subhransu Maji,et al.  Fast and Accurate Digit Classification , 2009 .

[19]  Marc Roubens,et al.  Multiple criteria decision making , 1994 .

[20]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[21]  Matthias Ehrgott,et al.  Multiple criteria decision analysis: state of the art surveys , 2005 .

[22]  Sebastián Ventura,et al.  G3P-MI: A genetic programming algorithm for multiple instance learning , 2010, Inf. Sci..

[23]  Roberto Battiti,et al.  Brain-Computer Evolutionary Multiobjective Optimization: A Genetic Algorithm Adapting to the Decision Maker , 2010, IEEE Trans. Evol. Comput..

[24]  Bernard F. Lamond,et al.  A multiple criteria ranking procedure based on distance between partial preorders , 2001, Eur. J. Oper. Res..

[25]  Michael Lindenbaum,et al.  Selective Sampling for Nearest Neighbor Classifiers , 1999, Machine Learning.

[26]  Kalyanmoy Deb,et al.  Multiple Criteria Decision Making, Multiattribute Utility Theory: Recent Accomplishments and What Lies Ahead , 2008, Manag. Sci..

[27]  Martin J. Oates,et al.  The Pareto Envelope-Based Selection Algorithm for Multi-objective Optimisation , 2000, PPSN.

[28]  Thomas G. Dietterich,et al.  Solving the Multiple Instance Problem with Axis-Parallel Rectangles , 1997, Artif. Intell..

[29]  Shuenn-Ren Cheng,et al.  Multiple-instance content-based image retrieval employing isometric embedded similarity measure , 2009, Pattern Recognit..

[30]  Tomás Lozano-Pérez,et al.  A Framework for Multiple-Instance Learning , 1997, NIPS.

[31]  Arnold W. M. Smeulders,et al.  Active learning using pre-clustering , 2004, ICML.

[32]  James R. Foulds,et al.  Revisiting Multiple-Instance Learning Via Embedded Instance Selection , 2008, Australasian Conference on Artificial Intelligence.

[33]  J. Martel,et al.  A Distance-Based Collective Preorder Integrating the Relative Importance of the Group's Members , 2004 .

[34]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[35]  Paul D. Gader,et al.  Random set framework for multiple instance learning , 2011, Inf. Sci..

[36]  Yixin Chen,et al.  MILES: Multiple-Instance Learning via Embedded Instance Selection , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Ishwar K. Sethi,et al.  Confidence-based active learning , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  W. Cook,et al.  Multiple criteria modelling and ordinal data: Evaluation in terms of subsets of criteria , 1997 .

[39]  James W. Davis,et al.  Attention-Based Target Localization Using Multiple Instance Learning , 2010, ISVC.

[40]  Qi Yue,et al.  An approach to group decision-making with uncertain preference ordinals , 2010, Comput. Ind. Eng..

[41]  Andrew McCallum,et al.  Toward Optimal Active Learning through Sampling Estimation of Error Reduction , 2001, ICML.

[42]  Stephen Kwek,et al.  Real-valued multiple-instance learning with queries , 2006, J. Comput. Syst. Sci..

[43]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[44]  Sebastián Ventura,et al.  Multiple instance learning for classifying students in learning management systems , 2011, Expert Syst. Appl..

[45]  Rita Almeida Ribeiro,et al.  A framework for dynamic multiple-criteria decision making , 2011, Decis. Support Syst..

[46]  R. Słowiński,et al.  Criterion of distance between technical programming and socio-economic priority , 1993 .

[47]  Thomas Hofmann,et al.  Support Vector Machines for Multiple-Instance Learning , 2002, NIPS.

[48]  Sampath Srinivas,et al.  A Generalization of the Noisy-Or Model , 1993, UAI.

[49]  Fei Wang,et al.  Interactive localized content based image retrieval with multiple-instance active learning , 2010, Pattern Recognit..

[50]  Mark Craven,et al.  Multiple-Instance Active Learning , 2007, NIPS.

[51]  Zhan Li,et al.  LSA based multi-instance learning algorithm for image retrieval , 2011, Signal Process..

[52]  Dana Angluin,et al.  Queries and concept learning , 1988, Machine Learning.

[53]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[54]  Peter J. Fleming,et al.  Genetic Algorithms for Multiobjective Optimization: FormulationDiscussion and Generalization , 1993, ICGA.

[55]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[56]  Jian Su,et al.  Multi-Criteria-based Active Learning for Named Entity Recognition , 2004, ACL.

[57]  Tien-Chin Wang,et al.  Using the fuzzy multi-criteria decision making approach for measuring the possibility of successful knowledge management , 2009, Inf. Sci..

[58]  Oded Maron,et al.  Multiple-Instance Learning for Natural Scene Classification , 1998, ICML.

[59]  Sheng Gao,et al.  Exploiting generalized discriminative multiple instance learning for multimedia semantic concept detection , 2008, Pattern Recognit..

[60]  Rong Jin,et al.  Batch mode active learning and its application to medical image classification , 2006, ICML.