Translating Images to Words for Recognizing Objects in Large Image and Video Collections
暂无分享,去创建一个
[1] Mary Czerwinski,et al. Semi-Automatic Image Annotation , 2001, INTERACT.
[2] Wei-Ying Ma,et al. Image and Video Retrieval , 2003, Lecture Notes in Computer Science.
[3] Pinar Duygulu Sahin,et al. Systematic Evaluation of Machine Translation Methods for Image and Video Annotation , 2005, CIVR.
[4] Jun Yang,et al. Finding Person X: Correlating Names with Visual Appearances , 2004, CIVR.
[5] James Ze Wang,et al. Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..
[6] Oded Maron,et al. Multiple-Instance Learning for Natural Scene Classification , 1998, ICML.
[7] James H. Martin,et al. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .
[8] R. Manmatha,et al. A Model for Learning the Semantics of Pictures , 2003, NIPS.
[9] Nando de Freitas,et al. A Statistical Model for General Contextual Object Recognition , 2004, ECCV.
[10] Mads Nielsen,et al. Computer Vision — ECCV 2002 , 2002, Lecture Notes in Computer Science.
[11] Sanjeev Khudanpur,et al. Hidden Markov models for automatic annotation and content-based retrieval of images and video , 2005, SIGIR '05.
[12] Y. Mori,et al. Image-to-word transformation based on dividing and vector quantizing images with words , 1999 .
[13] Christos Faloutsos,et al. Automatic multimedia cross-modal correlation discovery , 2004, KDD.
[14] James H. Martin,et al. Speech and language processing: an introduction to natural language processing , 2000 .
[15] R. Manmatha,et al. Multiple Bernoulli relevance models for image and video annotation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..
[16] Hinrich Schütze,et al. Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.
[17] Ching-Yung Lin,et al. Video Collaborative Annotation Forum: Establishing Ground-Truth Labels on Large Multimedia Datasets , 2003, TRECVID.
[18] Jean-Luc Gauvain,et al. The LIMSI Broadcast News transcription system , 2002, Speech Commun..
[19] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.
[20] Virginia Teller. Review of Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition by Daniel Jurafsky and James H. Martin. Prentice Hall 2000. , 2000 .
[21] Howard D. Wactlar,et al. Associating video frames with text , 2003 .
[22] David A. Forsyth,et al. Matching Words and Pictures , 2003, J. Mach. Learn. Res..
[23] Jean Ponce,et al. Computer Vision: A Modern Approach , 2002 .
[24] David A. Forsyth,et al. Clustering art , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.
[25] David A. Forsyth,et al. Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.
[26] Daniel Gatica-Perez,et al. On image auto-annotation with latent space models , 2003, ACM Multimedia.
[27] Michael I. Jordan,et al. Modeling annotated data , 2003, SIGIR.
[28] Eric Brill,et al. A Simple Rule-Based Part of Speech Tagger , 1992, HLT.
[29] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[30] Dan Tufis,et al. Empirical Methods for Exploiting Parallel Texts , 2002, Lit. Linguistic Comput..
[31] David A. Forsyth,et al. Learning the semantics of words and pictures , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.
[32] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.
[33] R. Manmatha,et al. Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.