The Development of Multimodal Lexical Resources
暂无分享,去创建一个
James Pustejovsky | Nikhil Krishnaswamy | Tuan Do | Gitit Kehat | J. Pustejovsky | Nikhil Krishnaswamy | Tuan Do | Gitit Kehat
[1] Jeffrey Mark Siskind,et al. Grounding the Lexical Semantics of Verbs in Visual Perception using Force Dynamics and Event Logic , 1999, J. Artif. Intell. Res..
[2] Bernt Schiele,et al. A dataset for Movie Description , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[3] James Pustejovsky,et al. Annotation Methodologies for Vision and Language Dataset Creation , 2016, ArXiv.
[4] Christopher Potts,et al. Text to 3D Scene Generation with Rich Lexical Grounding , 2015, ACL.
[5] Nancy Ide,et al. An Open Linguistic Infrastructure for Annotated Corpora , 2013, The People's Web Meets NLP.
[6] Jiaxuan Wang,et al. HICO: A Benchmark for Recognizing Human-Object Interactions in Images , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[7] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.
[8] Alberto Del Bimbo,et al. Event detection and recognition for semantic annotation of video , 2010, Multimedia Tools and Applications.
[9] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[10] J. Pustejovsky. Dynamic Event Structure and Habitat Theory , 2013 .
[11] Rada Mihalcea,et al. Mining semantic affordances of visual object categories , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[12] James Pustejovsky,et al. ECAT: Event Capture Annotation Tool , 2016, ArXiv.
[13] Richard Sproat,et al. WordsEye: an automatic text-to-scene conversion system , 2001, SIGGRAPH.
[14] James Pustejovsky,et al. The Qualitative Spatial Dynamics of Motion in Language , 2011, Spatial Cogn. Comput..
[15] Michael S. Bernstein,et al. Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations , 2016, International Journal of Computer Vision.
[16] Pietro Perona,et al. Describing Common Human Visual Actions in Images , 2015, BMVC.
[17] Sheena Rogers,et al. Reasons for Realism: Selected Essays of James J. Gibson ed. by Edward Reed, Rebecca Jones (review) , 2017 .
[18] Will Goldstone. Unity Game Development Essentials , 2009 .
[19] James Pustejovsky,et al. VoxML: A Visualization Modeling Language , 2016, LREC.
[20] James Pustejovsky,et al. Interpreting Motion - Grounded Representations for Spatial Language , 2012, Explorations in language and space.
[21] Ali Farhadi,et al. Situation Recognition: Visual Semantic Role Labeling for Image Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[22] Gang Wang,et al. NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[23] J. Jacko,et al. The human-computer interaction handbook: fundamentals, evolving technologies and emerging applications , 2002 .
[24] James Pustejovsky,et al. The Generative Lexicon , 1995, CL.
[25] Anupam Agrawal,et al. Vision based hand gesture recognition for human computer interaction: a survey , 2012, Artificial Intelligence Review.
[26] James Pustejovsky,et al. Where Things Happen: On the Semantics of Event Localization , 2013 .
[27] Matthew Turk,et al. Multimodal interaction: A review , 2014, Pattern Recognit. Lett..
[28] James Pustejovsky,et al. Multimodal Semantic Simulations of Linguistically Underspecified Motion Events , 2016, Spatial Cognition.
[29] Frank Keller,et al. Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings , 2016, NAACL.
[30] J. Yolton. Reasons for Realism. Selected Essays of James J. Gibson. Edited by EDWARD REED and REBECCA JONES. New Jersey: Lawrence Erlbaum Associates, 1982. Pp. xvi + 449. $39.95 , 1984 .
[31] James Pustejovsky,et al. Generating Simulations of Motion Events from Verbal Descriptions , 2014, *SEMEVAL.
[32] James Pustejovsky,et al. On the Representation of Inferences and their Lexicalization , 2013 .