论文信息 - Deep embodiment: grounding semantics in perceptual modalities

Deep embodiment: grounding semantics in perceptual modalities

Multi-modal distributional semantic models address the fact that text-based semantic models, which represent word meanings as a distribution over other words, suffer from the grounding problem. This thesis advances the field of multi-modal semantics in two directions. First, it shows that transferred convolutional neural network representations outperform the traditional bag of visual words method for obtaining visual features. It is then shown that these representations may be applied successfully to various natural language processing tasks. Second, it performs the first ever experiments with grounding in the non-visual modalities of auditory and olfactory perception using raw data. Deep learning, a natural fit for deriving grounded representations, is used to obtain the highest-quality representations compared to more traditional approaches. Multi-modal representation learning leads to improvements over language-only models in a variety of tasks. If we want to move towards human-level artificial intelligence, we will need to build multi-modal models that represent the full complexity of human meaning, including its grounding in our various perceptual modalities.

Douwe Kiela | Douwe Kiela

[1] Yoshua Bengio,et al. Hierarchical Probabilistic Neural Network Language Model , 2005, AISTATS.

[2] Ronan Collobert,et al. Word Embeddings through Hellinger PCA , 2013, EACL.

[3] Morten H. Christiansen,et al. Connectionist psycholinguistics: capturing the empirical data , 2001, Trends in Cognitive Sciences.

[4] Laura A. Dabbish,et al. Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[5] Jonathan Foote,et al. Content-based retrieval of music and audio , 1997, Other Conferences.

[6] Nazli Ikizler-Cinbis,et al. Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures , 2016, J. Artif. Intell. Res..

[7] Rochelle Lieber,et al. Word frequency distributions and lexical semantics , 1996, Comput. Humanit..

[8] W. Bruce Croft,et al. Cross-lingual relevance models , 2002, SIGIR '02.

[9] Dan Klein,et al. Learning Bilingual Lexicons from Monolingual Corpora , 2008, ACL.

[10] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[11] John McCarthy,et al. SOME PHILOSOPHICAL PROBLEMS FROM THE STANDPOINT OF ARTI CIAL INTELLIGENCE , 1987 .

[12] Kenneth Ward Church,et al. Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[13] Quoc V. Le,et al. Addressing the Rare Word Problem in Neural Machine Translation , 2014, ACL.

[14] Marc'Aurelio Ranzato,et al. DeViSE: A Deep Visual-Semantic Embedding Model , 2013, NIPS.

[15] Ilya Kostrikov,et al. PlaNet - Photo Geolocation with Convolutional Neural Networks , 2016, ECCV.

[16] D. T. Stanton,et al. Identification of latent variables in a semantic odor profile database using principal component analysis. , 2006, Chemical senses.

[17] David A. Forsyth,et al. Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[18] John R. Searle,et al. Chinese room argument , 2006, Scholarpedia.

[19] John R. Searle,et al. Minds, brains, and programs , 1980, Behavioral and Brain Sciences.

[20] V. Evans. The Crucible of Language: How Language and Mind Create Meaning , 2015 .

[21] Thomas A. Schreiber,et al. The University of South Florida free association, rhyme, and word fragment norms , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[22] Lawrence W. Barsalou,et al. Perceptions of perceptual symbols , 1999, Behavioral and Brain Sciences.

[23] T. Gelder,et al. Classical Questions, Radical Answers: Connectionism and the Structure of Mental Representations , 1991 .

[24] Brent Kievit-Kylar. Reading between the domains , 2014 .

[25] Nitish Srivastava,et al. Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..

[26] Ehud Rivlin,et al. Placing search in context: the concept revisited , 2002, TOIS.

[27] G. Lakoff,et al. The Brain's concepts: the role of the Sensory-motor system in conceptual knowledge , 2005, Cognitive neuropsychology.

[28] Stephen Clark,et al. Vision and Feature Norms: Improving automatic feature norm learning through cross-modal maps , 2016, HLT-NAACL.

[29] Larry S. Davis,et al. Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[30] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[31] Thomas Deselaers,et al. Visual and semantic similarity in ImageNet , 2011, CVPR 2011.

[32] Bradford Z. Mahon,et al. A critical look at the embodied cognition hypothesis and a new proposal for grounding conceptual content , 2008, Journal of Physiology-Paris.

[33] Alexander Dekhtyar,et al. Information Retrieval , 2018, Lecture Notes in Computer Science.

[34] James L. McClelland,et al. A computational model of semantic memory impairment: modality specificity and emergent category specificity. , 1991, Journal of experimental psychology. General.

[35] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[36] Geoffrey E. Hinton,et al. Zero-shot Learning with Semantic Output Codes , 2009, NIPS.

[37] Stevan Harnad,et al. GROUNDING SYMBOLS IN THE ANALOG WORLD WITH NEURAL NETS A Hybrid Model , 1993 .

[38] V. Gallese. Before and below ‘theory of mind’: embodied simulation and the neural correlates of social cognition , 2007, Philosophical Transactions of the Royal Society B: Biological Sciences.

[39] Giorgio Metta,et al. Grounding vision through experimental manipulation , 2003, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[40] Jens Lehmann,et al. Keyword Query Expansion on Linked Data Using Linguistic and Semantic Features , 2013, 2013 IEEE Seventh International Conference on Semantic Computing.

[41] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[42] Marco Baroni,et al. Grounding Distributional Semantics in the Visual World , 2016, Lang. Linguistics Compass.

[43] Ewan Klein,et al. Natural Language Processing with Python , 2009 .

[44] Ali Farhadi,et al. Describing objects by their attributes , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[45] J. Bullinaria,et al. Extracting semantic representations from word co-occurrence statistics: A computational study , 2007, Behavior research methods.

[46] Taro Watanabe,et al. Bilingual Lexicon Extraction from Comparable Corpora Using Label Propagation , 2012, EMNLP.

[47] H. Chertkow,et al. Semantic memory , 2002, Current neurology and neuroscience reports.

[48] Stephen Clark,et al. Vector Space Models of Lexical Meaning , 2015 .

[49] Nikolaus Kriegeskorte,et al. Frontiers in Systems Neuroscience Systems Neuroscience , 2022 .

[50] Ted Briscoe,et al. Looking for Hyponyms in Vector Space , 2014, CoNLL.

[51] Carina Silberer,et al. Learning Grounded Meaning Representations with Autoencoders , 2014, ACL.

[52] M. Bornstein,et al. Cross-linguistic analysis of vocabulary in young children: spanish, dutch, French, hebrew, italian, korean, and american english. , 2004, Child development.

[53] Andrew McCallum,et al. Polylingual Topic Models , 2009, EMNLP.

[54] Jay Zeman,et al. Peirce ’ s Theory of Signs , 2014 .

[55] Heng Ji,et al. New Tools for Web-Scale N-grams , 2010, LREC.

[56] L. Cosmides. From : The Cognitive Neurosciences , 1995 .

[57] Michael N. Jones,et al. Redundancy in Perceptual and Linguistic Experience: Comparing Feature-Based and Distributional Models of Semantic Representation , 2010, Top. Cogn. Sci..

[58] Trevor Darrell,et al. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[59] Brian Gygi,et al. Similarity and categorization of environmental sounds , 2007, Perception & psychophysics.

[60] J. Price,et al. Central olfactory connections in the macaque monkey , 1994, The Journal of comparative neurology.

[61] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[62] Zellig S. Harris,et al. Distributional Structure , 1954 .

[63] Monika Richter,et al. Cognition In The Wild , 2016 .

[64] Gonzalo Navarro,et al. A guided tour to approximate string matching , 2001, CSUR.

[65] Lionel Brunel,et al. The perceptual nature of the cross-modal priming effect: arguments in favor of a sensory-based conception of memory. , 2010, Experimental psychology.

[66] Rada Mihalcea,et al. Going Beyond Text: A Hybrid Image-Text Approach for Measuring Word Relatedness , 2011, IJCNLP.

[67] Marie-Francine Moens,et al. Identifying Word Translations from Comparable Corpora Using Latent Topic Models , 2011, ACL.

[68] Felix Hill,et al. Learning Abstract Concept Embeddings from Multi-Modal Data: Since You Probably Can’t See What I Mean , 2014, EMNLP.

[69] Qin Lu,et al. Chasing Hypernyms in Vector Spaces with Entropy , 2014, EACL.

[70] David Yarowsky,et al. Inducing Translation Lexicons via Diverse Similarity Measures and Bridge Languages , 2002, CoNLL.

[71] Yair Neuman,et al. Literal and Metaphorical Sense Identification through Concrete and Abstract Context , 2011, EMNLP.

[72] Aurélie Herbelot,et al. Measuring semantic content in distributional vectors , 2013, ACL.

[73] Mark D. Fairchild,et al. Status of CIE color appearance models , 2002, Other Conferences.

[74] Nick Chater,et al. Connectionism, Learning and Meaning , 1992 .

[75] Rada Mihalcea,et al. Measuring the semantic relatedness between words and images , 2011, IWCS.

[76] Max M. Louwerse,et al. Symbol Interdependency in Symbolic and Embodied Cognition , 2011, Top. Cogn. Sci..

[77] Hinrich Schütze,et al. Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[78] Saif Mohammad,et al. Experiments with three approaches to recognizing lexical entailment , 2014, Natural Language Engineering.

[79] Georgiana Dinu,et al. Don’t count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors , 2014, ACL.

[80] Yoshua Bengio,et al. Neural Probabilistic Language Models , 2006 .

[81] John A Bullinaria,et al. Extracting semantic representations from word co-occurrence statistics: stop-lists, stemming, and SVD , 2012, Behavior research methods.

[82] Allen Newell,et al. Physical Symbol Systems , 1980, Cogn. Sci..

[83] Geoffrey E. Hinton,et al. A Scalable Hierarchical Distributed Language Model , 2008, NIPS.

[84] Gabriella Vigliocco,et al. Integrating experiential and distributional data to learn semantic representations. , 2009, Psychological review.

[85] Bernt Schiele,et al. Grounding Action Descriptions in Videos , 2013, TACL.

[86] Angeliki Lazaridou,et al. Is this a wampimuk? Cross-modal mapping between distributional semantics and the visual world , 2014, ACL.

[87] Sergio E. Chaigneau,et al. THE SIMILARITY-IN-TOPOGRAPHY PRINCIPLE: RECONCILING THEORIES OF CONCEPTUAL DEFICITS , 2003, Cognitive neuropsychology.

[88] E. B. Newman,et al. A Scale for the Measurement of the Psychological Magnitude Pitch , 1937 .

[89] Marie-Francine Moens,et al. Cross-Lingual Semantic Similarity of Words as the Similarity of Their Semantic Word Responses , 2013, NAACL.

[90] Enhong Chen,et al. Word Embedding Revisited: A New Representation Learning and Explicit Matrix Factorization Perspective , 2015, IJCAI.

[91] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[92] Antti J. Eronen,et al. Musical instrument recognition using ICA-based transform of features and discriminatively trained HMMs , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[93] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[94] Phil Blunsom,et al. Multilingual Models for Compositional Distributed Semantics , 2014, ACL.

[95] Kunihiko Fukushima,et al. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[96] S. Schneider. Science fiction and philosophy : from time travel to superintelligence , 2016 .

[97] Z. Harris,et al. Foundations of Language , 1940 .

[98] Angelo Cangelosi,et al. An Embodied Model for Sensorimotor Grounding and Grounding Transfer: Experiments With Epigenetic Robots , 2006, Cogn. Sci..

[99] John B. Goodenough,et al. Contextual correlates of synonymy , 1965, CACM.

[100] Alessandro Lenci,et al. Concepts and properties in word spaces , 2008 .

[101] Raymond J. Mooney,et al. Learning to Connect Language and Perception , 2008, AAAI.

[102] David J. Weir,et al. Characterising Measures of Lexical Distributional Similarity , 2004, COLING.

[103] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[104] Gemma Boleda,et al. Distributional Semantics in Technicolor , 2012, ACL.

[105] David J. Weir,et al. Learning to Distinguish Hypernyms and Co-Hyponyms , 2014, COLING.

[106] Omer Levy,et al. Neural Word Embedding as Implicit Matrix Factorization , 2014, NIPS.

[107] Geoffrey Zweig,et al. Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[108] Jason Weston,et al. Curriculum learning , 2009, ICML '09.

[109] Stephen Clark,et al. A Systematic Study of Semantic Vector Space Model Parameters , 2014, CVSC@EACL.

[110] Katrin Erk,et al. Integrating Logical Representations with Probabilistic Information using Markov Logic , 2011, IWCS.

[111] Yansong Feng,et al. Visual Information in Semantic Representation , 2010, NAACL.

[112] Antonio Torralba,et al. Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[113] John Haugeland,et al. Artificial intelligence - the very idea , 1987 .

[114] Mirella Lapata,et al. Dependency-Based Construction of Semantic Space Models , 2007, CL.

[115] Michael C. Hout,et al. Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[116] Xavier Serra,et al. Freesound technical demo , 2013, ACM Multimedia.

[117] Raffaella Bernardi,et al. Entailment above the word level in distributional semantics , 2012, EACL.

[118] Michael Mohler,et al. Semantic Signatures for Example-Based Linguistic Metaphor Detection , 2013 .

[119] Pietro Perona,et al. Learning object categories from Google's image search , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[120] Marie-Francine Moens,et al. A Study on Bootstrapping Bilingual Vector Spaces from Non-Parallel Data (and Nothing Else) , 2013, EMNLP.

[121] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.

[122] Douglas W. Oard,et al. Dictionary-based techniques for cross-language information retrieval , 2005, Inf. Process. Manag..

[123] Thomas Martinetz,et al. On the dimensions of the olfactory perception space , 2004, Neurocomputing.

[124] Andrew Zisserman,et al. Learning Visual Attributes , 2007, NIPS.

[125] Dekang Lin,et al. Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[126] Geoffrey E. Hinton,et al. Distributed Representations , 1986, The Philosophy of Artificial Intelligence.

[127] Darren R. Gitelman,et al. When the Sense of Smell Meets Emotion: Anxiety-State-Dependent Olfactory Processing and Neural Circuitry Adaptation , 2013, The Journal of Neuroscience.

[128] Alessandro Lenci,et al. How we BLESSed distributional semantic evaluation , 2011, GEMS.

[129] Fabrizio Sebastiani,et al. Machine learning in automated text categorization , 2001, CSUR.

[130] Laura Rimell,et al. Distributional Lexical Entailment by Topic Coherence , 2014, EACL.

[131] James M. Bower,et al. Quantifying Olfactory Perception: Mapping Olfactory Perception Space by Using Multidimensional Scaling and Self-Organizing Maps , 2002, Neurocomputing.

[132] Andrew Y. Ng,et al. Improving Word Representations via Global Context and Multiple Word Prototypes , 2012, ACL.

[133] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[134] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[135] Stefan Evert,et al. A Large Scale Evaluation of Distributional Semantic Models: Parameters, Interactions and Model Selection , 2014, TACL.

[136] Timothy Baldwin,et al. An Empirical Model of Multiword Expression Decomposability , 2003, ACL 2003.

[137] Razvan Pascanu,et al. On the difficulty of training recurrent neural networks , 2012, ICML.

[138] Yoshua Bengio,et al. Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.

[139] Thad Starner,et al. An underwater wearable computer for two way human-dolphin communication experimentation , 2013, ISWC '13.

[140] Julia Hirschberg,et al. V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure , 2007, EMNLP.

[141] Luc Van Gool,et al. Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[142] Yulia Tsvetkov,et al. Problems With Evaluation of Word Embeddings Using Word Similarity Tasks , 2016, RepEval@ACL.

[143] Elia Bruni,et al. Distributional semantics from text and images , 2011, GEMS.

[144] Jeffrey Mark Siskind,et al. A Compositional Framework for Grounding Language Inference, Generation, and Acquisition in Video , 2015, J. Artif. Intell. Res..

[145] P. Smolensky. On the proper treatment of connectionism , 1988, Behavioral and Brain Sciences.

[146] Frédéric Jurie,et al. Sampling Strategies for Bag-of-Features Image Classification , 2006, ECCV.

[147] Reinhard Rapp,et al. Automatic Identification of Word Translations from Unrelated English and German Corpora , 1999, ACL.

[148] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[149] G. Kane. Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol 1: Foundations, vol 2: Psychological and Biological Models , 1994 .

[150] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[151] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[152] Charles A. Perfetti,et al. The limits of co‐occurrence: Tools and theories in language research , 1998 .

[153] Katrin Erk,et al. A Simple, Similarity-based Model for Selectional Preferences , 2007, ACL.

[154] Linda B. Smith,et al. Object perception and object naming in early development , 1998, Trends in Cognitive Sciences.

[155] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[156] Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[157] Wanxiang Che,et al. Learning Semantic Hierarchies via Word Embeddings , 2014, ACL.

[158] Gerard Salton,et al. A vector space model for automatic indexing , 1975, CACM.

[159] James Richard Curran,et al. From distributional to semantic similarity , 2004 .

[160] Carina Silberer,et al. Models of Semantic Representation with Visual Attributes , 2013, ACL.

[161] Andrew Zisserman,et al. Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[162] Yoshua Bengio,et al. Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[163] Randy Goebel,et al. Using Visual Information to Predict Lexical Preference , 2011, RANLP.

[164] Karen Spärck Jones. A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[165] Margaret Wilson,et al. Six views of embodied cognition , 2002, Psychonomic bulletin & review.

[166] Felix Hill,et al. SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation , 2014, CL.

[167] Katrin Erk,et al. Vector Space Models of Word Meaning and Phrase Meaning: A Survey , 2012, Lang. Linguistics Compass.

[168] Jimmy J. Lin,et al. Quantitative evaluation of passage retrieval algorithms for question answering , 2003, SIGIR.

[169] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[170] Alessandro Lenci,et al. Identifying hypernyms in distributional semantic spaces , 2012, *SEMEVAL.

[171] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.

[172] D. Sculley,et al. Web-scale k-means clustering , 2010, WWW '10.

[173] Marie-Francine Moens,et al. Multi-Modal Representations for Improved Bilingual Lexicon Learning , 2016, ACL.

[174] Eneko Agirre,et al. A Study on Similarity and Relatedness Using Distributional and WordNet-based Approaches , 2009, NAACL.

[175] R. Kurzweil,et al. The Singularity Is Near: When Humans Transcend Biology , 2006 .

[176] Patrick Pantel,et al. From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[177] Quoc V. Le,et al. Grounded Compositional Semantics for Finding and Describing Images with Sentences , 2014, TACL.

[178] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[179] B. Bahrami,et al. Coming of age: A review of embodiment and the neuroscience of semantics , 2012, Cortex.

[180] Michael P. Kaschak,et al. Grounding language in action , 2002, Psychonomic bulletin & review.

[181] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[182] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[183] Gabriella Vigliocco,et al. The Role of Sensory and Motor Information in Semantic Representation: A Review , 2008 .

[184] A. Clark. An embodied cognitive science? , 1999, Trends in Cognitive Sciences.

[185] Stephen Clark,et al. From distributional semantics to feature norms: grounding semantic models in human perceptual data , 2015, IWCS.

[186] Daoud Clarke. Context-theoretic Semantics for Natural Language: an Overview , 2009 .

[187] Carina Silberer,et al. Grounded Models of Semantic Representation , 2012, EMNLP.

[188] Alessandro Lenci,et al. Distributional Memory: A General Framework for Corpus-Based Semantics , 2010, CL.

[189] Mark S. Seidenberg,et al. Semantic feature production norms for a large set of living and nonliving things , 2005, Behavior research methods.

[190] Ido Dagan,et al. Directional distributional similarity for lexical inference , 2010, Natural Language Engineering.

[191] Pascale Fung,et al. An IR Approach for Translating New Words from Nonparallel, Comparable Texts , 1998, ACL.

[192] Linda B. Smith,et al. Object properties and knowledge in early lexical learning. , 1991, Child development.

[193] Nitish Srivastava,et al. Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[194] Anna Korhonen,et al. Unsupervised Metaphor Paraphrasing using a Vector Space Model , 2012, COLING.

[195] Reinhard Rapp,et al. Identifying Word Translations in Non-Parallel Texts , 1995, ACL.

[196] Mirella Lapata,et al. Composition in Distributional Models of Semantics , 2010, Cogn. Sci..

[197] Geoffrey Leech,et al. CLAWS4: The Tagging of the British National Corpus , 1994, COLING.

[198] Georgiana Dinu,et al. From Visual Attributes to Adjectives through Decompositional Distributional Semantics , 2015, Transactions of the Association for Computational Linguistics.

[199] Alessandro Lenci,et al. Distributional semantics in linguistic and cognitive research , 2008 .

[200] Gregory Grefenstette. Explorations in Automatic Thesaurus Construction , 1994 .

[201] Sanjeev Arora,et al. Random Walks on Context Spaces: Towards an Explanation of the Mysteries of Semantic Word Embeddings , 2015, ArXiv.

[202] Hinrich Schütze,et al. Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[203] Richard A. Harshman,et al. Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[204] Roberto Basili,et al. Automatic induction of FrameNet lexical units , 2008, EMNLP.

[205] Elia Bruni,et al. Multimodal Distributional Semantics , 2014, J. Artif. Intell. Res..

[206] Yee Whye Teh,et al. A fast and simple algorithm for training neural probabilistic language models , 2012, ICML.

[207] Manaal Faruqui,et al. Community Evaluation and Exchange of Word Vectors at wordvectors.org , 2014, ACL.

[208] James A. Reggia,et al. Systematically Grounding Language through Vision in a Deep, Recurrent Neural Network , 2011, AGI.

[209] T. Landauer,et al. A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[210] Hermann Ney,et al. Features for image retrieval: an experimental comparison , 2008, Information Retrieval.

[211] L. Barsalou. Grounded cognition. , 2008, Annual review of psychology.

[212] Allen Newell,et al. Computer science as empirical inquiry: symbols and search , 1976, CACM.

[213] Yoshua Bengio,et al. How transferable are features in deep neural networks? , 2014, NIPS.

[214] G. Miller,et al. Contextual correlates of semantic similarity , 1991 .

[215] Tom M. Mitchell,et al. Selecting Corpus-Semantic Models for Neurolinguistic Decoding , 2012, *SEMEVAL.

[216] Andrea Vedaldi,et al. Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[217] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[218] Benjamin Van Durme,et al. Learning Bilingual Lexicons Using the Visual Similarity of Labeled Web Images , 2011, IJCAI.

[219] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[220] G. Frege. Über Sinn und Bedeutung , 1892 .

[221] Stephen Clark,et al. Improving Multi-Modal Representations Using Image Dispersion: Why Less is Sometimes More , 2014, ACL.

[222] Curt Burgess,et al. Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[223] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[224] S. Brison. The Intentional Stance , 1989 .

[225] Alexander C. Berg,et al. Automatic Attribute Discovery and Characterization from Noisy Web Data , 2010, ECCV.

[226] Philipp Koehn,et al. Learning a Translation Lexicon from Monolingual Corpora , 2002, ACL 2002.

[227] Ivan Laptev,et al. Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[228] Ari Rappoport,et al. Bilingual Lexicon Generation Using Non-Aligned Signatures , 2010, ACL.

[229] Xiaodong Liu,et al. Topic Models + Word Alignment = A Flexible Framework for Extracting Bilingual Dictionary from Comparable Corpus , 2013, CoNLL.

[230] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[231] Scott Hotton,et al. Extending Dynamical Systems Theory to Model Embodied Cognition , 2011, Cogn. Sci..

[232] Tom Michael Mitchell,et al. Predicting Human Brain Activity Associated with the Meanings of Nouns , 2008, Science.

[233] Kathleen McKeown,et al. Classifying Taxonomic Relations between Pairs of Wikipedia Articles , 2013, IJCNLP.

[234] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[235] R. Axel,et al. A novel multigene family may encode odorant receptors: A molecular basis for odor recognition , 1991, Cell.

[236] Sabine Schulte im Walde,et al. A Multimodal LDA Model integrating Textual, Cognitive and Visual Modalities , 2013, EMNLP.

[237] Silvia Coradeschi,et al. A Short Review of Symbol Grounding in Robotic and Intelligent Systems , 2013, KI - Künstliche Intelligenz.

[238] Omer Levy,et al. Improving Distributional Similarity with Lessons Learned from Word Embeddings , 2015, TACL.

[239] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[240] Angeliki Lazaridou,et al. Combining Language and Vision with a Multimodal Skip-gram Model , 2015, NAACL.

[241] Jeffrey P. Bigham,et al. Names and Similarities on the Web: Fact Extraction in the Fast Lane , 2006, ACL.

[242] Jean-Michel Renders,et al. A Geometric View on Bilingual Lexicon Extraction from Comparable Corpora , 2004, ACL.

[243] Roi Reichart,et al. Separated by an Un-common Language: Towards Judgment Language Informed Vector Space Modeling , 2015 .

[244] Qiang Chen,et al. Network In Network , 2013, ICLR.

[245] Stefan Carlsson,et al. CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.