论文信息 - Linking image and text for semantic labeling of images and videos

Linking image and text for semantic labeling of images and videos

[1] William I. Grosky,et al. Narrowing the semantic gap - improved text-based web document retrieval using visual features , 2002, IEEE Trans. Multim..

[2] R. Manmatha,et al. Statistical models for automatic video annotation and retrieval , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3] Fran ine Chena,et al. Multi-Modal Browsing of Images in Web Do uments , 1999 .

[4] Peter G. B. Enser,et al. Analysis of user need in image archives , 1997, J. Inf. Sci..

[5] R. Manmatha,et al. Multiple Bernoulli relevance models for image and video annotation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[6] Gustavo Carneiro,et al. Formulating semantic image annotation as a supervised learning problem , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7] Nando de Freitas,et al. A Statistical Model for General Contextual Object Recognition , 2004, ECCV.

[8] Sanjeev Khudanpur,et al. Hidden Markov models for automatic annotation and content-based retrieval of images and video , 2005, SIGIR '05.

[9] Shih-Fu Chang,et al. Semantic knowledge construction from annotated image collections , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[10] Shih-Fu Chang,et al. Perceptual knowledge construction from annotated image collections , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[11] James Ze Wang,et al. Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[12] Marcel Worring,et al. Multimodal Video Indexing : A Review of the State-ofthe-art , 2001 .

[13] David A. Forsyth,et al. Clustering art , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[14] Shih-Fu Chang,et al. Combining text and audio-visual features in video indexing , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[15] Jitendra Malik,et al. Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[16] R. Manmatha,et al. Using Maximum Entropy for Automatic Image Annotation , 2004, CIVR.

[17] Oded Maron,et al. Multiple-Instance Learning for Natural Scene Classification , 1998, ICML.

[18] Y. Mori,et al. Image-to-word transformation based on dividing and vector quantizing images with words , 1999 .

[19] Eero Sormunen,et al. End-User Searching Challenges Indexing Practices in the Digital Newspaper Photo Archive , 2004, Information Retrieval.

[20] W. Bruce Croft,et al. Cross-lingual relevance models , 2002, SIGIR '02.

[21] Marcel Worring,et al. Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[22] Jean Ponce,et al. Computer Vision: A Modern Approach , 2002 .

[23] Shi-Kuo Chang,et al. Image Information Systems: Where Do We Go From Here? , 1992, IEEE Trans. Knowl. Data Eng..

[24] David A. Forsyth,et al. Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[25] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[26] Thomas Hofmann,et al. Learning and representing topic-a hierarchical mixture model for word occurences in document databas , 1998 .

[27] A. Dorado,et al. Semi-automatic image annotation using frequent keyword mining , 2003, Proceedings on Seventh International Conference on Information Visualization, 2003. IV 2003..

[28] Diane Gershon,et al. In the picture , 1990, Nature.

[29] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[30] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[31] Ching-Yung Lin,et al. Video Collaborative Annotation Forum: Establishing Ground-Truth Labels on Large Multimedia Datasets , 2003, TRECVID.

[32] Daniel Gatica-Perez,et al. PLSA-based image auto-annotation: constraining the latent space , 2004, MULTIMEDIA '04.

[33] Thomas Hofmann,et al. Statistical Models for Co-occurrence Data , 1998 .

[34] Rohini K. Srihari. Extracting visual information from text: using captions to label faces in newspaper photographs , 1992 .

[35] Debra T. Burhans,et al. Visual Semantics: Extracting Visual information from Text Accompanying Pictures , 1994, AAAI.

[36] S. Sclaroff,et al. Combining textual and visual cues for content-based image retrieval on the World Wide Web , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[37] Tat-Seng Chua,et al. A bootstrapping framework for annotating and retrieving WWW images , 2004, MULTIMEDIA '04.

[38] David A. Forsyth,et al. Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[39] Harriet J. Nock,et al. Semantic annotation of multimedia using maximum entropy models , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[40] Christos Faloutsos,et al. Automatic image captioning , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[41] Pinar Duygulu Sahin,et al. Systematic Evaluation of Machine Translation Methods for Image and Video Annotation , 2005, CIVR.

[42] Christos Faloutsos,et al. Automatic multimedia cross-modal correlation discovery , 2004, KDD.

[43] Abby Goodrum,et al. Image Information Retrieval: An Overview of Current Research , 2000, Informing Sci. Int. J. an Emerg. Transdiscipl..

[44] Moses Charikar,et al. Greedy approximation algorithms for finding dense components in a graph , 2000, APPROX.

[45] Michael I. Jordan,et al. Modeling annotated data , 2003, SIGIR.

[46] R. Manmatha,et al. Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[47] Shih-Fu Chang,et al. Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[48] Daniel Gatica-Perez,et al. On image auto-annotation with latent space models , 2003, ACM Multimedia.

[49] David A. Forsyth,et al. Learning the semantics of words and pictures , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[50] R. Manmatha,et al. A Model for Learning the Semantics of Pictures , 2003, NIPS.

[51] Jean-Luc Gauvain,et al. The LIMSI Broadcast News transcription system , 2002, Speech Commun..

[52] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .