Active learning on anchorgraph with an improved transductive experimental design

Abstract Anchorgraph based learning methods have met with success in modeling the large data for scalable semi-supervised learning. However, like most graph based learning algorithms, they are usually built with a randomly selected labeled set classified in advance. Although many pool-based active learning methods have been proposed, they often require a relatively large computational and storage consumption, which tends to impose extra burden on the learning system. Thus in this paper, we propose a novel active learning method named anchor-based transductive experimental design (ATED). By fully utilizing the representing power of anchors, the improved method efficiently enhances the performance of the original anchorgraph based learning while introduces much less extra cost on computation and storage. Extensive experimental results on real-world datasets have validated our approach in terms of classifying accuracy and computational efficiency.

[1]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[2]  Tomoshi Otsuki,et al.  Active learning framework with iterative clustering for bioimage classification , 2012, Nature Communications.

[3]  Yuan Yan Tang,et al.  High-Order Distance-Based Multiview Stochastic Learning in Image Classification , 2014, IEEE Transactions on Cybernetics.

[4]  En Zhu,et al.  A Scalable Algorithm for Graph-Based Active Learning , 2008, FAW.

[5]  Eric Horvitz,et al.  An Interactive Approach to Solving Correspondence Problems , 2013, International Journal of Computer Vision.

[6]  Meng Wang,et al.  Multimodal Graph-Based Reranking for Web Image Search , 2012, IEEE Transactions on Image Processing.

[7]  Alexander M. Bronstein,et al.  Numerical Geometry of Non-Rigid Shapes , 2009, Monographs in Computer Science.

[8]  S. Sathiya Keerthi,et al.  Optimization Techniques for Semi-Supervised Support Vector Machines , 2008, J. Mach. Learn. Res..

[9]  Yi Yang,et al.  Interactive Video Indexing With Statistical Active Learning , 2012, IEEE Transactions on Multimedia.

[10]  Jing Wang,et al.  Scalable k-NN graph construction for visual descriptors , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Yihong Gong,et al.  trNon-greedy active learning for text categorization using convex ansductive experimental design , 2008, SIGIR '08.

[12]  Benoit Huet,et al.  Concept detector refinement using social videos , 2010, VLS-MCMR '10.

[13]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[14]  WangMeng,et al.  Active learning in multimedia annotation and retrieval , 2011 .

[15]  Yan Song,et al.  Multi-Concept Multi-Modality Active Learning for Interactive Video Annotation , 2007 .

[16]  Chun Chen,et al.  Document Summarization Based on Data Reconstruction , 2012, AAAI.

[17]  Yi Yang,et al.  Semi-Supervised Multiple Feature Analysis for Action Recognition , 2014, IEEE Transactions on Multimedia.

[18]  Kun Zhou,et al.  Laplacian optimal design for image retrieval , 2007, SIGIR.

[19]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[20]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[21]  Wei Liu,et al.  Large Graph Construction for Scalable Semi-Supervised Learning , 2010, ICML.

[22]  Jun Yu,et al.  Modern Machine Learning Techniques and Their Applications in Cartoon Animation Research , 2013 .

[23]  Bernhard Schölkopf,et al.  Learning with Hypergraphs: Clustering, Classification, and Embedding , 2006, NIPS.

[24]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[25]  Yanrong Guo,et al.  Active learning based intervertebral disk classification combining shape and texture similarities , 2013, Neurocomputing.

[26]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[27]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[28]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[29]  Rong Jin,et al.  Active Learning by Querying Informative and Representative Examples , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Xuelong Li,et al.  Event-Based Media Enrichment Using an Adaptive Probabilistic Hypergraph Model , 2015, IEEE Transactions on Cybernetics.

[31]  Mei-Ling Shyu,et al.  ASIC: Supervised Multi-class Classification using Adaptive Selection of Information Components , 2007 .

[32]  WangMeng,et al.  Beyond distance measurement , 2009 .

[33]  Deng Cai,et al.  Manifold Adaptive Experimental Design for Text Categorization , 2012, IEEE Transactions on Knowledge and Data Engineering.

[34]  Roman Filipovych,et al.  Semi-supervised pattern classification of medical images: Application to mild cognitive impairment (MCI) , 2011, NeuroImage.

[35]  Meng Wang,et al.  Interactive Video Annotation by Multi-Concept Multi-Modality Active Learning , 2007, Int. J. Semantic Comput..

[36]  Bruno Lévy,et al.  Laplace-Beltrami Eigenfunctions Towards an Algorithm That "Understands" Geometry , 2006, IEEE International Conference on Shape Modeling and Applications 2006 (SMI'06).

[37]  Vittorio Castelli,et al.  On the exponential value of labeled samples , 1995, Pattern Recognit. Lett..

[38]  Jun Yu,et al.  Click Prediction for Web Image Reranking Using Multimodal Sparse Coding , 2014, IEEE Transactions on Image Processing.

[39]  Meng Wang,et al.  Unified Video Annotation via Multigraph Learning , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[40]  Gustavo Camps-Valls,et al.  Semi-Supervised Graph-Based Hyperspectral Image Classification , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[41]  Fan Chung,et al.  Spectral Graph Theory , 1996 .

[42]  Meng Wang,et al.  Semisupervised Multiview Distance Metric Learning for Cartoon Synthesis , 2012, IEEE Transactions on Image Processing.

[43]  Michael I. Jordan,et al.  Robust design of biological experiments , 2005, NIPS.

[44]  Yi Yang,et al.  A Multimedia Retrieval Framework Based on Semi-Supervised Ranking and Relevance Feedback , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Pietro Perona,et al.  Self-Tuning Spectral Clustering , 2004, NIPS.

[46]  Jinbo Bi,et al.  Active learning via transductive experimental design , 2006, ICML.

[47]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[48]  Bingbing Ni,et al.  Assistive tagging: A survey of multimedia tagging with human-computer joint exploration , 2012, CSUR.

[49]  Meng Wang,et al.  Adaptive Hypergraph Learning and its Application in Image Classification , 2012, IEEE Transactions on Image Processing.

[50]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[51]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[52]  Deniz Erdogmus,et al.  Locally Defined Principal Curves and Surfaces , 2011, J. Mach. Learn. Res..

[53]  A. Emery,et al.  Optimal experiment design , 1998 .

[54]  Meng Wang,et al.  Active learning in multimedia annotation and retrieval: A survey , 2011, TIST.

[55]  Meng Wang,et al.  Beyond Distance Measurement: Constructing Neighborhood Similarity for Video Annotation , 2009, IEEE Transactions on Multimedia.

[56]  Yi Yang,et al.  Multi-Class Active Learning by Uncertainty Sampling with Diversity Maximization , 2015, International Journal of Computer Vision.

[57]  Chun Chen,et al.  EMR: A Scalable Graph-Based Ranking Model for Content-Based Image Retrieval , 2015, IEEE Transactions on Knowledge and Data Engineering.

[58]  Lei Shi,et al.  Fast Algorithm for Approximate k-Nearest Neighbor Graph Construction , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.