Representative selection based on sparse modeling

Selecting representatives for multimedia analysis applications could greatly reduce the time and memory consumption. Many representative selection methods have been proposed to select a subset from the database as representatives. However, current methods cannot guarantee that the selected subset could represent the global distribution of the entire dataset. In order to evaluate how well the subset represents the global distribution of the whole dataset, we use the distance metric: Kullback-Leibler (KL) divergence between the distribution of the fake dataset reconstructed from the subset and the distribution of the true dataset. In this work, we propose a sparse modeling based method to select representatives. The proposed method formulates the representative selection problem as a discrete dictionary learning problem. Based on the assumption that the dataset can be approximately reconstructed by linear combinations of dictionary items, we design a two-step iterative representative selection algorithm, which can minimize this KL divergence. Experiments evaluate the proposed algorithm in several multimedia analysis applications, including image and video summarization, classification using representative images and classification using representative features, and our method is shown to outperform state-of-the-art methods.

[1]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[2]  K. Fujii,et al.  Visualization for the analysis of fluid motion , 2005, J. Vis..

[3]  Guillermo Sapiro,et al.  See all by looking at a few: Sparse modeling for finding representative objects , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Frédéric Jurie,et al.  Creating efficient codebooks for visual recognition , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[5]  David Gustafson,et al.  Building an Effective Visual Codebook: Is K-Means Clustering Useful? , 2012, ISVC.

[6]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[7]  Rajat Raina,et al.  Self-taught learning: transfer learning from unlabeled data , 2007, ICML '07.

[8]  Deng Cai,et al.  Laplacian Score for Feature Selection , 2005, NIPS.

[9]  Meng Wang,et al.  Semi-automatic video annotation based on active learning with multiple complementary predictors , 2005, MIR '05.

[10]  Jonathan J. Hull,et al.  A Database for Handwritten Text Recognition Research , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Yu Liu,et al.  Bregman Iteration Based Efficient Algorithm for MR Image Reconstruction From Undersampled K-Space Data , 2013, IEEE Signal Processing Letters.

[12]  Guillermo Sapiro,et al.  Dictionary learning and sparse coding for unsupervised clustering , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[13]  R. Tibshirani,et al.  Prototype selection for interpretable classification , 2011, 1202.5933.

[14]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[15]  K. Thangavel,et al.  Rough set based unsupervised feature selection in digital mammogram image using entropy measure , 2012, 2012 International Conference on Biomedical Engineering (ICoBE).

[16]  Yongdong Zhang,et al.  Salient region detection for complex background images using integrated features , 2014, Inf. Sci..

[17]  Eli Shechtman,et al.  In defense of Nearest-Neighbor based image classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Fei Wang,et al.  Cascading outbreak prediction in networks: a data-driven approach , 2013, KDD.

[19]  Jianping Fan,et al.  Image collection summarization via dictionary learning for sparse representation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  David J. Kriegman,et al.  Acquiring linear subspaces for face recognition under variable lighting , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Yue Gao,et al.  When Amazon Meets Google: Product Visualization by Exploring Multiple Web Sources , 2013, TOIT.

[22]  Andrew Zisserman,et al.  Unifying statistical texture classification frameworks , 2004, Image Vis. Comput..

[23]  Meng Wang,et al.  Active learning in multimedia annotation and retrieval: A survey , 2011, TIST.

[24]  Charles A. Micchelli,et al.  Maximum entropy and maximum likelihood criteria for feature selection from multivariate data , 2000, 2000 IEEE International Symposium on Circuits and Systems. Emerging Technologies for the 21st Century. Proceedings (IEEE Cat No.00CH36353).

[25]  Guillermo Sapiro,et al.  Classification and clustering via dictionary learning with structured incoherence and shared features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[26]  Stéphane Ayache,et al.  Evaluation of active learning strategies for video indexing , 2007, Signal Process. Image Commun..

[27]  Nicu Sebe,et al.  Feature Selection for Multimedia Analysis by Sharing Information Among Multiple Tasks , 2013, IEEE Transactions on Multimedia.

[28]  Guillermo Sapiro,et al.  Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[29]  Joel A. Tropp,et al.  Column subset selection, matrix factorization, and eigenvalue optimization , 2008, SODA.

[30]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[31]  Thomas S. Huang,et al.  Leveraging Active Learning for Relevance Feedback Using an Information Theoretic Diversity Measure , 2006, CIVR.

[32]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[33]  David G. Stork,et al.  Pattern Classification , 1973 .

[34]  Jennifer G. Dy Unsupervised Feature Selection , 2007 .

[35]  Bart Thomee,et al.  New trends and ideas in visual concept detection: the MIR flickr retrieval evaluation initiative , 2010, MIR '10.

[36]  Tim K Marks,et al.  SUN: A Bayesian framework for saliency using natural statistics. , 2008, Journal of vision.

[37]  René Vidal,et al.  Recursive identification of switched ARX systems , 2008, Autom..

[38]  Christos Boutsidis,et al.  An improved approximation algorithm for the column subset selection problem , 2008, SODA.

[39]  Edward Y. Chang,et al.  Support vector machine active learning for image retrieval , 2001, MULTIMEDIA '01.

[40]  Xuelong Li,et al.  Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search , 2013, IEEE Transactions on Image Processing.

[41]  Qi Tian,et al.  Social-oriented visual image search , 2014, Comput. Vis. Image Underst..

[42]  Jidong Zhao,et al.  Locality sensitive semi-supervised feature selection , 2008, Neurocomputing.

[43]  Andrew Zisserman,et al.  A Visual Vocabulary for Flower Classification , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[44]  Hujun Bao,et al.  A Variance Minimization Criterion to Feature Selection Using Laplacian Regularization , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[46]  Guillermo Sapiro,et al.  Supervised Dictionary Learning , 2008, NIPS.

[47]  René Vidal,et al.  Robust classification using structured sparse representation , 2011, CVPR 2011.

[48]  Pedro M. Domingos A few useful things to know about machine learning , 2012, Commun. ACM.

[49]  Yongdong Zhang,et al.  Locally non-negative linear structure learning for interactive image retrieval , 2009, MM '09.

[50]  Elena Marchiori,et al.  Class Conditional Nearest Neighbor for Large Margin Instance Selection , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51]  Frédéric Jurie,et al.  Sampling Strategies for Bag-of-Features Image Classification , 2006, ECCV.

[52]  M. S. Bazaraa,et al.  Nonlinear Programming , 1979 .

[53]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[54]  Yi Wu,et al.  Sampling Strategies for Active Learning in Personal Photo Retrieval , 2006, 2006 IEEE International Conference on Multimedia and Expo.