Baby Talk : Understanding and Generating Image Descriptions
暂无分享,去创建一个
[1] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[2] David A. Forsyth,et al. Matching Words and Pictures , 2003, J. Mach. Learn. Res..
[3] Susan McRoy,et al. DOGHED: A Template-Based Generator for Multimodal Dialog Systems Targeting Heterogeneous Devices , 2003, HLT-NAACL.
[4] Pietro Perona,et al. What do we see when we glance at a scene , 2004 .
[5] Eduard Hovy,et al. Template-Filtered Headline Summarization , 2004 .
[6] Martin J. Wainwright,et al. MAP estimation via agreement on (hyper)trees: Message-passing and linear programming , 2005, ArXiv.
[7] Vladimir Kolmogorov,et al. Convergent Tree-Reweighted Message Passing for Energy Minimization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[8] Antonio Criminisi,et al. TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context , 2007, International Journal of Computer Vision.
[9] Andrew Zisserman,et al. Learning Visual Attributes , 2007, NIPS.
[10] Thorsten Brants,et al. Large Language Models in Machine Translation , 2007, EMNLP.
[11] Antonio Torralba,et al. Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition , 2022 .
[12] Larry S. Davis,et al. Beyond Nouns: Exploiting Prepositions and Comparative Adjectives for Learning Visual Classifiers , 2008, ECCV.
[13] Serge J. Belongie,et al. Object categorization using co-occurrence, location and appearance , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.
[14] Christoph H. Lampert,et al. Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[15] Herman Stehouwer,et al. Language Models for Contextual Error Detection and Correction , 2009 .
[16] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.
[17] Charless C. Fowlkes,et al. Discriminative Models for Multi-Class Object Layout , 2009, 2009 IEEE 12th International Conference on Computer Vision.
[18] Ali Farhadi,et al. Describing objects by their attributes , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[19] Shree K. Nayar,et al. Attribute and simile classifiers for face verification , 2009, 2009 IEEE 12th International Conference on Computer Vision.
[20] Liang Lin,et al. I2T: Image Parsing to Text Description , 2010, Proceedings of the IEEE.
[21] Christopher D. Manning,et al. Stanford typed dependencies manual , 2010 .
[22] Antonio Torralba,et al. Using the forest to see the trees: exploiting context for visual object detection and localization , 2010, CACM.
[23] Yansong Feng,et al. How Many Words Is a Picture Worth? Automatic Caption Generation for News Images , 2010, ACL.
[24] Ahmet Aker,et al. Generating Image Descriptions Using Dependency Relational Patterns , 2010, ACL.
[25] Cyrus Rashtchian,et al. Every Picture Tells a Story: Generating Sentences from Images , 2010, ECCV.