暂无分享,去创建一个
Christopher Potts | Angel X. Chang | Christopher D. Manning | Will Monroe | Manolis Savva | Christopher Potts | Will Monroe | M. Savva
[1] Luke S. Zettlemoyer,et al. A Joint Model of Language and Perception for Grounded Attribute Learning , 2012, ICML.
[2] Larry S. Davis,et al. Image ranking and retrieval based on multi-attribute queries , 2011, CVPR 2011.
[3] Armand Joulin,et al. Deep Fragment Embeddings for Bidirectional Image Sentence Mapping , 2014, NIPS.
[4] Lucy Vanderwende,et al. Learning the Visual Interpretation of Sentences , 2013, 2013 IEEE International Conference on Computer Vision.
[5] Mihai Surdeanu,et al. The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.
[6] Terry Winograd,et al. Understanding natural language , 1974 .
[7] Bob Coyne,et al. Annotation Tools and Knowledge Representation for a Text-To-Scene System , 2012, COLING.
[8] Pat Hanrahan,et al. On being the right scale: sizing large collections of 3D models , 2014, SIGGRAPH ASIA Indoor Scene Understanding Where Graphics Meets Vision.
[9] Jayant Krishnamurthy,et al. Jointly Learning to Parse and Perceive: Connecting Natural Language to the Physical World , 2013, TACL.
[10] Deb Roy,et al. Probabilistic grounding of situated speech using plan recognition and reference resolution , 2005, ICMI '05.
[11] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.
[12] Bob Coyne,et al. Collecting Semantic Data from Mechanical Turk for a Lexical Knowledge Resource in a Text to Picture Generating System , 2011, IWCS.
[13] Angel X. Chang,et al. Learning Spatial Knowledge for Text to 3D Scene Generation , 2014, EMNLP.
[14] Vicente Ordonez,et al. ReferItGame: Referring to Objects in Photographs of Natural Scenes , 2014, EMNLP.
[15] R. Baayen,et al. Mixed-effects modeling with crossed random effects for subjects and items , 2008 .
[16] Lijun Yin,et al. Real-time automatic 3D scene generation from natural language voice and text descriptions , 2006, MM '06.
[17] Deb Roy,et al. Grounded Semantic Composition for Visual Scenes , 2011, J. Artif. Intell. Res..
[18] Dan Klein,et al. A Game-Theoretic Approach to Generating Spatial Descriptions , 2010, EMNLP.
[19] Cyrus Rashtchian,et al. Every Picture Tells a Story: Generating Sentences from Images , 2010, ECCV.
[20] Richard Sproat,et al. WordsEye: an automatic text-to-scene conversion system , 2001, SIGGRAPH.
[21] Pat Hanrahan,et al. Example-based synthesis of 3D object arrangements , 2012, ACM Trans. Graph..
[22] Yejin Choi,et al. Baby talk: Understanding and generating simple image descriptions , 2011, CVPR 2011.
[23] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[24] Luke S. Zettlemoyer,et al. Learning Distributions over Logical Forms for Referring Expression Generation , 2013, EMNLP.