论文信息 - Learning optimal visual features from Web sampling in online image retrieval

Learning optimal visual features from Web sampling in online image retrieval

Linear discriminant analysis (LDA) to improve a Web images retrieval system. Our work takes place in the official European ImagEVAL 2006 campaign evaluation. The task consists to retrieve Web images using both textual (Web pages) and visual information. Our visual features integrate subband entropy profile, usual mean and color standard deviation. A simple weighted norm fusion is done with standard tf-idf Web page text analysis. Our model is the second best model of the ImagEVAL task2. We show how, sampling online image sets from the Web, one can estimate by approximated Fisher criterion an optimal visual feature subsets for some query concepts and then enhance their mean average precision by 50%. We discuss on the fact that some concept may not so nicely be enhanced, but that in average, this optimization reduces by 10 the visual dimension, without any MAP degradation, yielding to a significant CPU cost reduction.

Hervé Glotin | Sabrina Tollari | H. Glotin | Sabrina Tollari

[1] Thomas S. Huang,et al. Unifying Keywords and Visual Contents in Image Retrieval , 2002, IEEE Multim..

[2] Hervé Glotin,et al. Shape reasoning on mis-segmented and mis-labeled objects using approximated Fisher criterion , 2006, Comput. Graph..

[3] R. Manmatha,et al. A Model for Learning the Semantics of Pictures , 2003, NIPS.

[4] Patrick Gros,et al. Robust Object Recognition in Images and the Related Database Problems , 2004, Multimedia Tools and Applications.

[5] Hervé Glotin,et al. LDA Versus MMD Approximation on Mislabeled Images for Dependant Selection of Visual Features and Their Heterogeneity , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[6] Keiji Yanai,et al. Image region entropy: a measure of "visualness" of web images associated with one concept , 2005, MULTIMEDIA '05.

[7] Hervé Glotin,et al. Web image retrieval on ImagEVAL: evidences on visualness and textualness concept dependency in fusion model , 2007, CIVR '07.

[8] James Ze Wang,et al. Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[9] Peter H. N. de With,et al. Multistage Face Recognition Using Adaptive Feature Selection and Classification , 2005, ACIVS.

[10] David A. Forsyth,et al. Learning the semantics of words and pictures , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.