Comparing Data Sources and Architectures for Deep Visual Representation Learning in Semantics
暂无分享,去创建一个
[1] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.
[2] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.
[3] Elia Bruni,et al. Multimodal Distributional Semantics , 2014, J. Artif. Intell. Res..
[4] Marie-Francine Moens,et al. Multi-Modal Representations for Improved Bilingual Lexicon Learning , 2016, ACL.
[5] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[6] Roi Reichart,et al. Judgment Language Matters: Multilingual Vector Space Models for Judgment Language Aware Lexical Semantics , 2015, ArXiv.
[7] Douwe Kiela. MMFeat: A Toolkit for Extracting Multi-Modal Features , 2016, ACL.
[8] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[9] Stevan Harnad,et al. Symbol grounding problem , 1990, Scholarpedia.
[10] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.
[11] J. Bullinaria,et al. Extracting semantic representations from word co-occurrence statistics: A computational study , 2007, Behavior research methods.
[12] Stephen Clark,et al. Visual Bilingual Lexicon Induction with Transferred ConvNet Features , 2015, EMNLP.
[13] Patrick Pantel,et al. From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..
[14] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[15] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.
[16] Pietro Perona,et al. Learning object categories from Google's image search , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.
[17] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[18] Shimon Ullman,et al. Do You See What I Mean? Visual Resolution of Linguistic Ambiguities , 2015, EMNLP.
[19] Stephen Clark,et al. Vision and Feature Norms: Improving automatic feature norm learning through cross-modal maps , 2016, NAACL.
[20] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.
[21] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.
[22] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.
[23] Sabine Schulte im Walde,et al. A Multimodal LDA Model integrating Textual, Cognitive and Visual Modalities , 2013, EMNLP.
[24] David G. Lowe,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.
[25] Jean Maillard,et al. Black Holes and White Rabbits: Metaphor Identification with Visual Features , 2016, NAACL.
[26] Stephen Clark,et al. Exploiting Image Generality for Lexical Entailment Detection , 2015, ACL.
[27] Randy Goebel,et al. Using Visual Information to Predict Lexical Preference , 2011, RANLP.
[28] Felix Hill,et al. SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation , 2014, CL.
[29] Ilya Kostrikov,et al. PlaNet - Photo Geolocation with Convolutional Neural Networks , 2016, ECCV.
[30] Carina Silberer,et al. Learning Grounded Meaning Representations with Autoencoders , 2014, ACL.
[31] Nitish Srivastava,et al. Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.
[32] Stephen Clark,et al. Vector Space Models of Lexical Meaning , 2015 .
[33] Léon Bottou,et al. Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics , 2014, EMNLP.
[34] Benjamin Van Durme,et al. Learning Bilingual Lexicons Using the Visual Similarity of Labeled Web Images , 2011, IJCAI.
[35] Gemma Boleda,et al. Distributional Semantics in Technicolor , 2012, ACL.
[36] Stephen Clark,et al. A Systematic Study of Semantic Vector Space Model Parameters , 2014, CVSC@EACL.
[37] Laura A. Dabbish,et al. Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.
[38] Angeliki Lazaridou,et al. Combining Language and Vision with a Multimodal Skip-gram Model , 2015, NAACL.
[39] Qiang Chen,et al. Network In Network , 2013, ICLR.