论文信息 - Shape reasoning on mis-segmented and mis-labeled objects using approximated Fisher criterion

Shape reasoning on mis-segmented and mis-labeled objects using approximated Fisher criterion

To automatically determine semantics of a shape or to generate a set of keywords that describe the content of a given image is a difficult problem due to: (a) the high-dimensional problem, (b) the unsolved automatic object segmentation (mis-segmentation), and (c) the lack of well-labeled large image database (mis-labeling). In order to tackle (a), despite (b), (c) and the expensive handy image segmentation and labeling, visual features should be automatically selected to convey the most robust and discriminant information without requiring too computational cost. Therefore, we propose a novel method: 'Approximation of Linear Discriminant Analysis' (ALDA), which is more generic than LDA: ALDA does not require explicit class labeling of each training samples. We theoretically show that under weak assumption, ALDA allows efficient ranking estimation of the discriminant powers of the visual features. We apply ALDA on COREL database (10K images, 267 words) with Normalized Cuts segmentation algorithm. First, we demonstrate an image classification gain of 43%, while reducing features set by a factor 10. Secondly, we demonstrate that for some words (like 'Door', 'Flag'), even low-level shape features (convex hull, or moment of inertia) are more discriminant than any color or texture features.

Hervé Glotin | Sabrina Tollari | Pascale Giraudet

[1] Jing Peng,et al. LDA/SVM driven nearest neighbor classification , 2003, IEEE Trans. Neural Networks.

[2] Matthieu Cord,et al. A comparison of active classification methods for content-based image retrieval , 2004, CVDB '04.

[3] David A. Forsyth,et al. The effects of segmentation and feature choice in a translation model of object recognition , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[4] Jitendra Malik,et al. Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5] Avinash C. Kak,et al. On combining graph-partitioning with non-parametric clustering for image segmentation , 2004, Comput. Vis. Image Underst..

[6] David A. Forsyth,et al. Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[7] Paul L. Rosin,et al. Measuring rectilinearity , 2005, Comput. Vis. Image Underst..

[8] Hervé Glotin,et al. LDA Versus MMD Approximation on Mislabeled Images for Dependant Selection of Visual Features and Their Heterogeneity , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[9] Mirela Tanase-Avatavului,et al. Shape decomposition and retrieval , 2005 .

[10] Jian Yang,et al. Why can LDA be performed in PCA transformed space? , 2003, Pattern Recognit..

[11] Hervé Glotin,et al. Approximation of Linear Discriminant Analysis for Word Dependent Visual Features Selection , 2005, ACIVS.

[12] Thierry Pun,et al. The Truth about Corel - Evaluation in Image Retrieval , 2002, CIVR.

[13] James Ze Wang,et al. Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[14] Juyang Weng,et al. Using Discriminant Eigenfeatures for Image Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[15] Hans-Peter Kriegel,et al. State-of-the-Art in Content-Based Image and Video Retrieval , 2001, Computational Imaging and Vision.

[16] Juergen Luettin,et al. Hierarchical discriminant features for audio-visual LVCSR , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[17] David G. Stork,et al. Pattern Classification , 1973 .

[18] Gerard G. L. Meyer,et al. Geometric linear discriminant analysis for pattern recognition , 2004, Pattern Recognit..

[19] Hervé Glotin,et al. Enhancement of Textual Images Classification Using Segmented Visual Contents for Image Search Engine , 2005, Multimedia Tools and Applications.

[20] Hervé Glotin,et al. Large-vocabulary audio-visual speech recognition: a summary of the Johns Hopkins Summer 2000 Workshop , 2001, 2001 IEEE Fourth Workshop on Multimedia Signal Processing (Cat. No.01TH8564).

[21] Patrick Gros,et al. Robust Object Recognition in Images and the Related Database Problems , 2004, Multimedia Tools and Applications.

[22] Daniel Gatica-Perez,et al. On image auto-annotation with latent space models , 2003, ACM Multimedia.

[23] Remco C. Veltkamp,et al. Features in Content-based Image Retrieval Systems: a Survey , 1999, State-of-the-Art in Content-Based Image and Video Retrieval.