Linking image and text for semantic labeling of images and videos
暂无分享,去创建一个
[1] William I. Grosky,et al. Narrowing the semantic gap - improved text-based web document retrieval using visual features , 2002, IEEE Trans. Multim..
[2] R. Manmatha,et al. Statistical models for automatic video annotation and retrieval , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[3] Fran ine Chena,et al. Multi-Modal Browsing of Images in Web Do uments , 1999 .
[4] Peter G. B. Enser,et al. Analysis of user need in image archives , 1997, J. Inf. Sci..
[5] R. Manmatha,et al. Multiple Bernoulli relevance models for image and video annotation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..
[6] Gustavo Carneiro,et al. Formulating semantic image annotation as a supervised learning problem , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).
[7] Nando de Freitas,et al. A Statistical Model for General Contextual Object Recognition , 2004, ECCV.
[8] Sanjeev Khudanpur,et al. Hidden Markov models for automatic annotation and content-based retrieval of images and video , 2005, SIGIR '05.
[9] Shih-Fu Chang,et al. Semantic knowledge construction from annotated image collections , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.
[10] Shih-Fu Chang,et al. Perceptual knowledge construction from annotated image collections , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.
[11] James Ze Wang,et al. Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..
[12] Marcel Worring,et al. Multimodal Video Indexing : A Review of the State-ofthe-art , 2001 .
[13] David A. Forsyth,et al. Clustering art , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.
[14] Shih-Fu Chang,et al. Combining text and audio-visual features in video indexing , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[15] Jitendra Malik,et al. Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying , 2002, IEEE Trans. Pattern Anal. Mach. Intell..
[16] R. Manmatha,et al. Using Maximum Entropy for Automatic Image Annotation , 2004, CIVR.
[17] Oded Maron,et al. Multiple-Instance Learning for Natural Scene Classification , 1998, ICML.
[18] Y. Mori,et al. Image-to-word transformation based on dividing and vector quantizing images with words , 1999 .
[19] Eero Sormunen,et al. End-User Searching Challenges Indexing Practices in the Digital Newspaper Photo Archive , 2004, Information Retrieval.
[20] W. Bruce Croft,et al. Cross-lingual relevance models , 2002, SIGIR '02.
[21] Marcel Worring,et al. Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..
[22] Jean Ponce,et al. Computer Vision: A Modern Approach , 2002 .
[23] Shi-Kuo Chang,et al. Image Information Systems: Where Do We Go From Here? , 1992, IEEE Trans. Knowl. Data Eng..
[24] David A. Forsyth,et al. Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.
[25] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[26] Thomas Hofmann,et al. Learning and representing topic-a hierarchical mixture model for word occurences in document databas , 1998 .
[27] A. Dorado,et al. Semi-automatic image annotation using frequent keyword mining , 2003, Proceedings on Seventh International Conference on Information Visualization, 2003. IV 2003..
[28] Diane Gershon,et al. In the picture , 1990, Nature.
[29] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.
[30] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.
[31] Ching-Yung Lin,et al. Video Collaborative Annotation Forum: Establishing Ground-Truth Labels on Large Multimedia Datasets , 2003, TRECVID.
[32] Daniel Gatica-Perez,et al. PLSA-based image auto-annotation: constraining the latent space , 2004, MULTIMEDIA '04.
[33] Thomas Hofmann,et al. Statistical Models for Co-occurrence Data , 1998 .
[34] Rohini K. Srihari. Extracting visual information from text: using captions to label faces in newspaper photographs , 1992 .
[35] Debra T. Burhans,et al. Visual Semantics: Extracting Visual information from Text Accompanying Pictures , 1994, AAAI.
[36] S. Sclaroff,et al. Combining textual and visual cues for content-based image retrieval on the World Wide Web , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).
[37] Tat-Seng Chua,et al. A bootstrapping framework for annotating and retrieving WWW images , 2004, MULTIMEDIA '04.
[38] David A. Forsyth,et al. Matching Words and Pictures , 2003, J. Mach. Learn. Res..
[39] Harriet J. Nock,et al. Semantic annotation of multimedia using maximum entropy models , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[40] Christos Faloutsos,et al. Automatic image captioning , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).
[41] Pinar Duygulu Sahin,et al. Systematic Evaluation of Machine Translation Methods for Image and Video Annotation , 2005, CIVR.
[42] Christos Faloutsos,et al. Automatic multimedia cross-modal correlation discovery , 2004, KDD.
[43] Abby Goodrum,et al. Image Information Retrieval: An Overview of Current Research , 2000, Informing Sci. Int. J. an Emerg. Transdiscipl..
[44] Moses Charikar,et al. Greedy approximation algorithms for finding dense components in a graph , 2000, APPROX.
[45] Michael I. Jordan,et al. Modeling annotated data , 2003, SIGIR.
[46] R. Manmatha,et al. Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.
[47] Shih-Fu Chang,et al. Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..
[48] Daniel Gatica-Perez,et al. On image auto-annotation with latent space models , 2003, ACM Multimedia.
[49] David A. Forsyth,et al. Learning the semantics of words and pictures , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.
[50] R. Manmatha,et al. A Model for Learning the Semantics of Pictures , 2003, NIPS.
[51] Jean-Luc Gauvain,et al. The LIMSI Broadcast News transcription system , 2002, Speech Commun..
[52] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .