Sound–meaning association biases evidenced across thousands of languages

Significance The independence between sound and meaning is believed to be a crucial property of language: across languages, sequences of different sounds are used to express similar concepts (e.g., Russian “ptitsa,” Swahili “ndege,” and Japanese “tori” all mean “bird”). However, a careful statistical examination of words from nearly two-thirds of the world’s languages reveals that unrelated languages very often use (or avoid) the same sounds for specific referents. For instance, words for tongue tend to have l or u, “round” often appears with r, and “small” with i. These striking similarities call for a reexamination of the fundamental assumption of the arbitrariness of the sign. It is widely assumed that one of the fundamental properties of spoken language is the arbitrary relation between sound and meaning. Some exceptions in the form of nonarbitrary associations have been documented in linguistics, cognitive science, and anthropology, but these studies only involved small subsets of the 6,000+ languages spoken in the world today. By analyzing word lists covering nearly two-thirds of the world’s languages, we demonstrate that a considerable proportion of 100 basic vocabulary items carry strong associations with specific kinds of human speech sounds, occurring persistently across continents and linguistic lineages (linguistic families or isolates). Prominently among these relations, we find property words (“small” and i, “full” and p or b) and body part terms (“tongue” and l, “nose” and n). The areal and historical distribution of these associations suggests that they often emerge independently rather than being inherited or borrowed. Our results therefore have important implications for the language sciences, given that nonarbitrary associations have been proposed to play a critical role in the emergence of cross-modal mappings, the acquisition of language, and the evolution of our species’ unique communication system.

[1]  Simon J. Greenhill,et al.  Language Phylogenies Reveal Expansion Pulses and Pauses in Pacific Settlement , 2009, Science.

[2]  Robert Forkel,et al.  Glottolog 2.0 , 2013 .

[3]  Gerhard Jäger,et al.  Support for linguistic macrofamilies from weighted sequence alignment , 2015, Proceedings of the National Academy of Sciences.

[4]  D. Maurer,et al.  The shape of boubas: sound-shape correspondences in toddlers and adults. , 2006, Developmental science.

[5]  Edith A. Moravcsik,et al.  Universals of Human Language, II: Phonology , 1980 .

[6]  Harald Hammarström,et al.  Some principles on the use of macro-areas in typological comparison , 2014 .

[7]  Mark Dingemanse,et al.  Iconicity in the lab: a review of behavioral, developmental, and neuroimaging research into sound-symbolism , 2015, Front. Psychol..

[8]  L. Nygaard,et al.  Sound to meaning correspondences facilitate word learning , 2009, Cognition.

[9]  P. Sham,et al.  A note on the calculation of empirical P values from Monte Carlo procedures. , 2002, American journal of human genetics.

[10]  F. D. Saussure Cours de linguistique générale , 1924 .

[11]  Lyle Campbell,et al.  Language Classification: Frontmatter , 2008 .

[12]  R. Jakobson Why "Mama" and "Papa"? , 1972 .

[13]  David Sankoff,et al.  Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison , 1983 .

[14]  P. MacNeilage,et al.  On the origin of internal structure of word forms. , 2000, Science.

[15]  Edith A. Moravcsik,et al.  Universals of human language , 1978 .

[16]  Søren Wichmann,et al.  Sound Symbolism in Basic Vocabulary , 2010, Entropy.

[17]  B. Bergen The Psychological Reality of Phonaesthemes , 2004 .

[18]  W. Köhler Gestalt psychology , 1967 .

[19]  Søren Wichmann,et al.  Explorations in automated language classification , 2008 .

[20]  Sue Gawned The language of classification , 1989 .

[21]  Jamie Reilly,et al.  Non-Arbitrariness in Mapping Word Form to Meaning: Cross-Linguistic Formal Markers of Word Concreteness , 2017, Cogn. Sci..

[22]  J. Algeo Words and Rules: The Ingredients of Language , 2000 .

[23]  Arthur Ransom Origin of Languages , 1870, Nature.

[24]  Morten H. Christiansen,et al.  The arbitrariness of the sign: learning advantages from the structure of the vocabulary. , 2011, Journal of experimental psychology. General.

[25]  Simon Kirby,et al.  How arbitrary is language? , 2014, Philosophical Transactions of the Royal Society B: Biological Sciences.

[26]  Nicoleta Bateman,et al.  On the Typology of Palatalization , 2011, Lang. Linguistics Compass.

[27]  V. Ramachandran,et al.  Synaesthesia? A window into perception, thought and language , 2001 .

[28]  Francisco Torreira,et al.  Is “Huh?” a Universal Word? Conversational Infrastructure and the Convergent Evolution of Linguistic Items , 2013, PloS one.

[29]  M. Urban Conventional sound symbolism in terms for organs of speech: A cross-linguistic study , 2011 .

[30]  M. Swadesh Towards Greater Accuracy in Lexicostatistic Dating , 1955, International Journal of American Linguistics.

[31]  Michael Smithson,et al.  A better lemon squeezer? Maximum-likelihood regression with beta-distributed dependent variables. , 2006, Psychological methods.

[32]  S. Kita,et al.  The sound symbolism bootstrapping hypothesis for language acquisition and language evolution , 2014, Philosophical Transactions of the Royal Society B: Biological Sciences.

[33]  P. Luce,et al.  Phonological Neighborhood Effects in Spoken Word Perception and Production , 2016 .

[34]  J. Nuckolls THE CASE FOR SOUND SYMBOLISM , 1999 .

[35]  Shane S. Sturrock,et al.  Time Warps, String Edits, and Macromolecules – The Theory and Practice of Sequence Comparison . David Sankoff and Joseph Kruskal. ISBN 1-57586-217-4. Price £13.95 (US$22·95). , 2000 .

[36]  Michael Cysouw,et al.  A Pipeline for Computational Historical Linguistics , 2011 .

[37]  Andrew Meade,et al.  Ultraconserved words point to deep language ancestry across Eurasia , 2013, Proceedings of the National Academy of Sciences.

[38]  Korbinian Strimmer,et al.  fdrtool: a versatile R package for estimating local and tail area-based false discovery rates , 2008, Bioinform..

[39]  Sharon Peperkamp,et al.  Consonants are More Important than Vowels in the Bouba-kiki Effect , 2015, Language and speech.

[40]  Mark Dingemanse,et al.  Advances in the Cross-Linguistic Study of Ideophones , 2012, Lang. Linguistics Compass.

[41]  J. Zlatev,et al.  Motivations for Sound Symbolism in Spatial Deixis: A Typological Study of 101 Languages , 1970 .

[42]  Uri Tadmor,et al.  Borrowability and the notion of basic vocabulary , 2012 .

[43]  S. Kirby,et al.  Synesthesia, Cross-Modality, and Language Evolution , 2013 .

[44]  Julia Simner,et al.  What is the link between synaesthesia and sound symbolism? , 2015, Cognition.

[45]  Claire Bowern,et al.  Sound Symbolism in the Languages of Australia , 2014, PloS one.

[46]  Cecil H. Brown,et al.  Sound Correspondences in the World's Languages , 2013 .

[47]  S. Pinker Words and Rules: The Ingredients of Language , 1999 .

[48]  Morten H. Christiansen,et al.  Arbitrariness, Iconicity, and Systematicity in Language , 2015, Trends in Cognitive Sciences.

[49]  William J. Poser,et al.  Language Classification: Acknowledgments , 2008 .

[50]  Morten H. Christiansen,et al.  The phonological-distributional coherence hypothesis: Cross-linguistic evidence in language acquisition , 2007, Cognitive Psychology.

[51]  W. Heeringa,et al.  Perceptive evaluation of Levenshtein dialect distance measurements using Norwegian dialect data , 2004, Language Variation and Change.

[52]  Simon J. Greenhill,et al.  Rate of language evolution is affected by population size , 2015, Proceedings of the National Academy of Sciences.

[53]  R. Paget The Origin of Speech , 1927, Nature.

[54]  G. Lupyan,et al.  Iconicity in English and Spanish and Its Relation to Lexical Category and Age of Acquisition , 2015, PloS one.

[55]  P. Lewis Ethnologue : languages of the world , 2009 .

[56]  D. Napoli,et al.  On the linguistic effects of articulatory ease, with a focus on sign languages , 2014 .

[57]  Simon J. Greenhill,et al.  Evolved structure of language shows lineage-specific trends in word-order universals , 2011, Nature.

[58]  M. Proctor,et al.  Gestural characterization of a phonological class: The liquids , 2009 .

[59]  D. Rendall,et al.  The sound of round: evaluating the sound-symbolic role of consonants in the classic Takete-Maluma phenomenon. , 2011, Canadian journal of experimental psychology = Revue canadienne de psychologie experimentale.

[60]  David Gil,et al.  The World Atlas of Language Structures , 2005 .

[61]  M. Ruhlen On the Origin of Languages: Studies in Linguistic Taxonomy , 1994 .

[62]  T. Matsuzawa,et al.  Visuoauditory mappings between high luminance and high pitch are shared by chimpanzees (Pan troglodytes) and humans , 2011, Proceedings of the National Academy of Sciences.