Semantic Image Retrieval via Active Grounding of Visual Situations
暂无分享,去创建一个
[1] Jitendra Malik,et al. Visual Semantic Role Labeling , 2015, ArXiv.
[2] Ali Farhadi,et al. Situation Recognition: Visual Semantic Role Labeling for Image Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[3] Guodong Guo,et al. A survey on still image based human action recognition , 2014, Pattern Recognit..
[4] Yee Whye Teh,et al. Searching for objects driven by context , 2012, NIPS.
[5] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[6] Michael S. Bernstein,et al. Image retrieval using scene graphs , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Fei-Fei Li,et al. What, where and who? Classifying events by scene and object recognition , 2007, 2007 IEEE 11th International Conference on Computer Vision.
[8] Yu Qiao,et al. Object-Scene Convolutional Neural Networks for event recognition in images , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[9] Dana H. Ballard,et al. Animate Vision , 1991, Artif. Intell..
[10] Alan L. Yuille,et al. Generation and Comprehension of Unambiguous Object Descriptions , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[11] Dumitru Erhan,et al. Show and Tell: Lessons Learned from the 2015 MSCOCO Image Captioning Challenge , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[12] Abel Gonzalez-Garcia,et al. An active search strategy for efficient object class detection , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Gregory J. Zelinsky,et al. Scene context guides eye movements during visual search , 2006, Vision Research.
[14] C. Summerfield,et al. Expectation (and attention) in visual cognition , 2009, Trends in Cognitive Sciences.
[15] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[16] Larry S. Davis,et al. Modeling Context Between Objects for Referring Expression Understanding , 2016, ECCV.
[17] Trevor Darrell,et al. Grounding of Textual Phrases in Images by Reconstruction , 2015, ECCV.
[18] Koray Kavukcuoglu,et al. Visual Attention , 2020, Computational Models for Cognitive Vision.
[19] Alex Graves,et al. Recurrent Models of Visual Attention , 2014, NIPS.
[20] Michael S. Bernstein,et al. Visual Relationship Detection with Language Priors , 2016, ECCV.
[21] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.
[22] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[23] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.
[24] Melanie Mitchell,et al. The Copycat project: a model of mental fluidity and analogy-making , 1995 .
[25] Vladlen Koltun,et al. Geodesic Object Proposals , 2014, ECCV.
[26] Michael S. Bernstein,et al. Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations , 2016, International Journal of Computer Vision.
[27] Svetlana Lazebnik,et al. Active Object Localization with Deep Reinforcement Learning , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).