Imageability Estimation using Visual and Language Features

Imageability is a concept from Psycholinguistics quantizing the human perception of words. However, existing datasets are created through subjective experiments and are thus very small. Therefore, methods to automatically estimate the imageability can be helpful. For an accurate automatic imageability estimation, we extend the idea of a psychological hypothesis called Dual-Coding Theory, that discusses the connection of our perception towards visual information and language information, and also focus on the relationship between the pronunciation of a word and its imageability. In this research, we propose a method to estimate imageability of words using both visual and language features extracted from corresponding data. For the estimation, we use visual features extracted from low- and high-level image features, and language features extracted from textual features and phonetic features of words. Evaluations show that our proposed method can estimate imageability more accurately than comparative methods, implying the contribution of each feature to the imageability.

[1]  David A. Shamma,et al.  YFCC100M , 2015, Commun. ACM.

[2]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Mingda Zhang,et al.  Equal But Not The Same: Understanding the Implicit Relationship Between Persuasive Images and Text , 2018, BMVC.

[4]  Hiroshi Murase,et al.  Estimating the imageability of words by mining visual characteristics from crawled image data , 2020, Multimedia Tools and Applications.

[5]  A. Paivio,et al.  Concreteness, imagery, and meaningfulness values for 925 nouns. , 1968, Journal of experimental psychology.

[6]  Allan Paivio,et al.  Dual Coding Theory , 1990 .

[7]  Anita Peti-Stantic,et al.  Predicting Concreteness and Imageability of Words Within and Across Languages via Word Embeddings , 2018, Rep4NLP@ACL.

[8]  Jamie Reilly,et al.  Formal Distinctiveness of High- and Low-Imageability Nouns: Analyses and Theoretical Implications , 2007, Cogn. Sci..

[9]  Michael J Cortese,et al.  Imageability ratings for 3,000 monosyllabic words , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[10]  Benjamin Naumann,et al.  Mental Representations A Dual Coding Approach , 2016 .

[11]  Tomas Mikolov,et al.  Advances in Pre-Training Distributed Word Representations , 2017, LREC.

[12]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[13]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[14]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.