论文信息 - Information-theoretic semantic multimedia indexing

Information-theoretic semantic multimedia indexing

To solve the problem of indexing collections with diverse text documents, image documents, or documents with both text and images, one needs to develop a model that supports heterogeneous types of documents. In this paper, we show how information theory supplies us with the tools necessary to develop a unique model for text, image, and text/image retrieval. In our approach, for each possible query keyword we estimate a maximum entropy model based on exclusively continuous features that were preprocessed. The unique continuous feature-space of text and visual data is constructed by using a minimum description length criterion to find the optimal feature-space representation (optimal from an information theory point of view). We evaluate our approach in three experiments: only text retrieval, only image retrieval, and text combined with image retrieval.

Stefan M. Rüger | João Magalhães

[1] Stanley F. Chen,et al. A Gaussian Prior for Smoothing Maximum Entropy Models , 1999 .

[2] Yiming Yang,et al. A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[3] Tong Zhang,et al. Text Categorization Based on Regularized Linear Classification Methods , 2001, Information Retrieval.

[4] J. Rissanen,et al. Modeling By Shortest Data Description* , 1978, Autom..

[5] Rob Malouf,et al. A Comparison of Algorithms for Maximum Entropy Parameter Estimation , 2002, CoNLL.

[6] Anil K. Jain,et al. Unsupervised Learning of Finite Mixture Models , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[7] Yiming Yang,et al. A re-examination of text categorization methods , 1999, SIGIR '99.

[8] R. Manmatha,et al. Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[9] David A. Forsyth,et al. Learning the semantics of words and pictures , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[10] David A. Forsyth,et al. Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[11] Michael I. Jordan,et al. Modeling annotated data , 2003, SIGIR.

[12] Stefan M. Rüger,et al. High-dimensional visual vocabularies for image retrieval , 2007, SIGIR.

[13] Thomas M. Cover,et al. Elements of Information Theory , 2005 .

[14] Dennis Koelma,et al. The MediaMill TRECVID 2008 Semantic Video Search Engine , 2008, TRECVID.

[15] Yiming Yang,et al. An example-based mapping method for text categorization and retrieval , 1994, TOIS.

[16] Stefan M. Rüger,et al. Logistic Regression of Generic Codebooks for Semantic Image Retrieval , 2006, CIVR.

[17] Djoerd Hiemstra,et al. Combining Information Sources for Video Retrieval , 2003, TRECVID.

[18] Cordelia Schmid,et al. A maximum entropy framework for part-based texture and object recognition , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[19] John D. Lafferty,et al. Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[20] Gustavo Carneiro,et al. Formulating semantic image annotation as a supervised learning problem , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[21] Stefan M. Rüger,et al. Evaluation of Texture Features for Content-Based Image Retrieval , 2004, CIVR.

[22] Jorge Nocedal,et al. On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[23] John R. Smith,et al. IBM Research TRECVID-2009 Video Retrieval System , 2009, TRECVID.

[24] Marcus Jerome Pickering,et al. Video Retrieval by Feature Learning in Key Frames , 2002, CIVR.

[25] R. Manmatha,et al. Using Maximum Entropy for Automatic Image Annotation , 2004, CIVR.

[26] Stefan M. Rüger,et al. Automated Image Annotation Using Global Features and Robust Nonparametric Density Estimation , 2005, CIVR.

[27] David R. Bull,et al. Video Retrieval Using Global Features in Keyframes , 2002, TREC.

[28] Andrew McCallum,et al. Using Maximum Entropy for Text Classification , 1999 .

[29] R. Manmatha,et al. Multiple Bernoulli relevance models for image and video annotation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[30] Milind R. Naphade,et al. A probabilistic framework for semantic video indexing, filtering, and retrieval , 2001, IEEE Trans. Multim..

[31] Andrew McCallum,et al. A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[32] D. Ruppert. The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[33] R. Manmatha,et al. A Model for Learning the Semantics of Pictures , 2003, NIPS.

[34] George Forman,et al. An Extensive Empirical Study of Feature Selection Metrics for Text Classification , 2003, J. Mach. Learn. Res..

[35] Andrew R. Barron,et al. Minimum complexity density estimation , 1991, IEEE Trans. Inf. Theory.

[36] Yiming Yang,et al. An Evaluation of Statistical Approaches to Text Categorization , 1999, Information Retrieval.

[37] Anil K. Jain,et al. Image classification for content-based indexing , 2001, IEEE Trans. Image Process..

[38] Marcel Worring,et al. The Semantic Pathfinder: Using an Authoring Metaphor for Generic Multimedia Indexing , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39] Harriet J. Nock,et al. Semantic annotation of multimedia using maximum entropy models , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[40] Gerard Salton,et al. Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[41] Adam L. Berger,et al. A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[42] Thorsten Joachims,et al. Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[43] Thijs Westerveld,et al. Experimental result analysis for a generative probabilistic image retrieval model , 2003, SIGIR.

[44] Martin F. Porter,et al. An algorithm for suffix stripping , 1997, Program.