Predicting Motivations of Actions by Leveraging Text
暂无分享,去创建一个
Antonio Torralba | Hamed Pirsiavash | Carl Vondrick | Deniz Oktay | A. Torralba | Carl Vondrick | H. Pirsiavash | Deniz Oktay
[1] Sanja Fidler,et al. Skip-Thought Vectors , 2015, NIPS.
[2] S. Carey,et al. Understanding other minds: linking developmental psychology and functional neuroimaging. , 2004, Annual review of psychology.
[3] Bolei Zhou,et al. Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.
[4] Li Fei-Fei,et al. Reasoning about Object Affordances in a Knowledge Base Representation , 2014, ECCV.
[5] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.
[6] Yejin Choi,et al. Baby talk: Understanding and generating simple image descriptions , 2011, CVPR 2011.
[7] R Saxe,et al. People thinking about thinking people The role of the temporo-parietal junction in “theory of mind” , 2003, NeuroImage.
[8] ZissermanAndrew,et al. The Pascal Visual Object Classes Challenge , 2015 .
[9] H. Wimmer,et al. Beliefs about beliefs: Representation and constraining function of wrong beliefs in young children's understanding of deception , 1983, Cognition.
[10] Ella M. Atkins,et al. Human Intent Prediction Using Markov Decision Processes , 2015, J. Aerosp. Inf. Syst..
[11] Fernando De la Torre,et al. Max-Margin Early Event Detectors , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[12] David F. Fouhey,et al. Predicting Object Dynamics in Scenes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[13] Jaime Valls Miró,et al. Language for learning complex human-object interactions , 2013, 2013 IEEE International Conference on Robotics and Automation.
[14] Fei-Fei Li,et al. Deep visual-semantic alignments for generating image descriptions , 2015, CVPR.
[15] Monica N. Nicolescu,et al. Deep networks for predicting human intent with respect to objects , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).
[16] Raffaella Bernardi,et al. Exploiting language models to recognize unseen actions , 2013, ICMR '13.
[17] Antonio Torralba,et al. Where are they looking? , 2015, NIPS.
[18] Thorsten Joachims,et al. Cutting-plane training of structural SVMs , 2009, Machine Learning.
[19] Gregory Shakhnarovich,et al. Diverse M-Best Solutions in Markov Random Fields , 2012, ECCV.
[20] Trevor Darrell,et al. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.
[21] Lucy Vanderwende,et al. Learning the Visual Interpretation of Sentences , 2013, 2013 IEEE International Conference on Computer Vision.
[22] Hema Swetha Koppula,et al. Anticipating Human Activities Using Object Affordances for Reactive Robotic Response , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[23] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.
[24] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[25] E. Lawler. A PROCEDURE FOR COMPUTING THE K BEST SOLUTIONS TO DISCRETE OPTIMIZATION PROBLEMS AND ITS APPLICATION TO THE SHORTEST PATH PROBLEM , 1972 .
[26] Xinlei Chen,et al. NEIL: Extracting Visual Knowledge from Web Data , 2013, 2013 IEEE International Conference on Computer Vision.
[27] Chris L. Baker,et al. Action understanding as inverse planning , 2009, Cognition.
[28] Martial Hebert,et al. Activity Forecasting , 2012, ECCV.
[29] Stefan Carlsson,et al. CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.
[30] Song-Chun Zhu,et al. Inferring "Dark Matter" and "Dark Energy" from Videos , 2013, 2013 IEEE International Conference on Computer Vision.
[31] J.K. Aggarwal,et al. Human activity analysis , 2011, ACM Comput. Surv..
[32] Aaas News,et al. Book Reviews , 1893, Buffalo Medical and Surgical Journal.
[33] Deva Ramanan,et al. Dual coordinate solvers for large-scale structural SVMs , 2013, ArXiv.
[34] Hao Su,et al. Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification , 2010, NIPS.
[35] Koby Crammer,et al. On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..
[36] Ronald Poppe,et al. A survey on vision-based human action recognition , 2010, Image Vis. Comput..
[37] C. Lawrence Zitnick,et al. Bringing Semantics into Focus Using Visual Abstraction , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[38] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[39] Sanja Fidler,et al. MovieQA: Understanding Stories in Movies through Question-Answering , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[40] Kenneth Heafield,et al. KenLM: Faster and Smaller Language Model Queries , 2011, WMT@EMNLP.
[41] Martial Hebert,et al. Patch to the Future: Unsupervised Visual Prediction , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[42] Darius Burschka,et al. Predicting human intention in visual observations of hand/object interactions , 2013, 2013 IEEE International Conference on Robotics and Automation.
[43] Ali Farhadi,et al. Learning Everything about Anything: Webly-Supervised Visual Concept Learning , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[44] Jos Elfring,et al. Learning intentions for improved human motion prediction , 2013, 2013 16th International Conference on Advanced Robotics (ICAR).
[45] Kenneth Heafield,et al. N-gram Counts and Language Models from the Common Crawl , 2014, LREC.
[46] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[47] Song-Chun Zhu,et al. C^4: Exploring Multiple Solutions in Graphical Models by Cluster Sampling , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[48] Katja Markert,et al. Learning Models for Object Recognition from Natural Language Descriptions , 2009, BMVC.
[49] Song-Chun Zhu,et al. Visual Persuasion: Inferring Communicative Intents of Images , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.