Learning the Visual Interpretation of Sentences
暂无分享,去创建一个
Lucy Vanderwende | C. Lawrence Zitnick | Devi Parikh | C. L. Zitnick | Devi Parikh | Lucy Vanderwende
[1] I. Biederman,et al. Scene perception: Detecting and judging objects undergoing relational violations , 1982, Cognitive Psychology.
[2] Ken Perlin,et al. Improv: a system for scripting interactive actors in virtual worlds , 1996, SIGGRAPH.
[3] Richard Sproat,et al. WordsEye: an automatic text-to-scene conversion system , 2001, SIGGRAPH.
[4] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.
[5] R. Manmatha,et al. A Model for Learning the Semantics of Pictures , 2003, NIPS.
[6] John R. Smith,et al. Multimedia semantic indexing using model vectors , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).
[7] David A. Forsyth,et al. Matching Words and Pictures , 2003, J. Mach. Learn. Res..
[8] Beat Fasel,et al. Automati Fa ial Expression Analysis: A Survey , 1999 .
[9] Paul Clough,et al. The IAPR TC-12 Benchmark: A New Evaluation Resource for Visual Information Systems , 2006 .
[10] John R. Smith,et al. Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.
[11] James Ze Wang,et al. The Story Picturing Engine---a system for automatic text illustration , 2006, TOMCCAP.
[12] Nuno Vasconcelos,et al. Bridging the Gap: Query by Semantic Example , 2007, IEEE Transactions on Multimedia.
[13] Antonio Criminisi,et al. TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context , 2007, International Journal of Computer Vision.
[14] Quanfu Fan,et al. Reducing correspondence ambiguity in loosely labeled training data , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.
[15] Shih-Fu Chang,et al. CuZero: embracing the frontier of interactive visual search for informed users , 2008, MIR '08.
[16] Larry S. Davis,et al. Beyond Nouns: Exploiting Prepositions and Comparative Adjectives for Learning Visual Classifiers , 2008, ECCV.
[17] Shree K. Nayar,et al. FaceTracer: A Search Engine for Large Collections of Images with Faces , 2008, ECCV.
[18] Christoph H. Lampert,et al. Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[19] Ali Farhadi,et al. Describing objects by their attributes , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[20] David A. McAllester,et al. Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[21] Ali Farhadi,et al. Attribute-centric recognition for cross-category generalization , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[22] Mike Thelwall,et al. From sentence to emotion: a real-time three-dimensional graphics metaphor of emotions extracted from text , 2010, The Visual Computer.
[23] Bob Coyne,et al. Data collection and normalization for building the Scenario-Based Lexical Knowledge Resource of a text-to-scene conversion system , 2010, 2010 Fifth International Workshop Semantic Media Adaptation and Personalization.
[24] Abhinav Gupta,et al. Beyond active noun tagging: Modeling contextual interactions for multi-class active learning , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[25] Krista A. Ehinger,et al. SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[26] Fei-Fei Li,et al. Modeling mutual context of object and human pose in human-object interaction activities , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[27] Alexander C. Berg,et al. Automatic Attribute Discovery and Characterization from Noisy Web Data , 2010, ECCV.
[28] Cyrus Rashtchian,et al. Collecting Image Annotations Using Amazon’s Mechanical Turk , 2010, Mturk@HLT-NAACL.
[29] Yansong Feng,et al. How Many Words Is a Picture Worth? Automatic Caption Generation for News Images , 2010, ACL.
[30] Ahmet Aker,et al. Generating Image Descriptions Using Dependency Relational Patterns , 2010, ACL.
[31] Cyrus Rashtchian,et al. Every Picture Tells a Story: Generating Sentences from Images , 2010, ECCV.
[32] Cordelia Schmid,et al. Combining attributes and Fisher vectors for efficient image retrieval , 2011, CVPR 2011.
[33] Yejin Choi,et al. Baby talk: Understanding and generating simple image descriptions , 2011, CVPR 2011.
[34] Kristen Grauman,et al. Relative attributes , 2011, 2011 International Conference on Computer Vision.
[35] Yiannis Aloimonos,et al. Corpus-Guided Sentence Generation of Natural Images , 2011, EMNLP.
[36] Vicente Ordonez,et al. Im2Text: Describing Images Using 1 Million Captioned Photographs , 2011, NIPS.
[37] Yi Yang,et al. Articulated pose estimation with flexible mixtures-of-parts , 2011, CVPR 2011.
[38] Xiaogang Wang,et al. Query-specific visual semantic spaces for web image re-ranking , 2011, CVPR 2011.
[39] Ali Farhadi,et al. Recognition using visual phrases , 2011, CVPR 2011.
[40] Jianfeng Gao,et al. MSR SPLAT, a language analysis toolkit , 2012, HLT-NAACL.
[41] Adriana Kovashka,et al. WhittleSearch: Image search with relative attribute feedback , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[42] P SumathiC.,et al. Automatic Facial Expression Analysis A Survey , 2012 .
[43] C. Lawrence Zitnick,et al. Bringing Semantics into Focus Using Visual Abstraction , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.