Linearisation during language production: evidence from scene meaning and saliency maps

ABSTRACT Speaking (1989) inspired research on topics such as word selection, syntactic formulation, and dialogue, but an issue that remains understudied is linearisation: the question of how speakers organise a series of utterances into a coherent sequence, allowing both speaker and listener to keep track of what has been said and what will come next. In this paper we describe a new line of research investigating linearisation during scene description tasks, and we argue that, as Pim Levelt suggested in 1981 and in the 1989 book, the need to linearise arises from attentional constraints in the language system. Our work shows that attentional, visual, and linguistic processes are flexibly coordinated during scene descriptions, and that speakers not only respond to what the eye sees, but also to what the mind anticipates finding in the visual world.

[1]  J. Henderson Gaze Control as Prediction , 2017, Trends in Cognitive Sciences.

[2]  Antje S. Meyer,et al.  Syntactic flexibility and planning scope: the effect of verb bias on advance planning during sentence recall , 2014, Front. Psychol..

[3]  J. K. Bock Syntactic persistence in language production , 1986, Cognitive Psychology.

[4]  K. Bock,et al.  Framing sentences , 1990, Cognition.

[5]  Zenzi M. Griffin,et al.  Structural Priming as Implicit Learning: A Comparison of Models of Sentence Production , 2000, Journal of psycholinguistic research.

[6]  Brad Wyble,et al.  Detecting meaning in RSVP at 13 ms per picture , 2013, Attention, perception & psychophysics.

[7]  A. Maes,et al.  Who is where referred to how, and why? The influence of visual saliency on referent accessibility in spoken language production , 2013 .

[8]  Antonio Torralba,et al.  Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. , 2006, Psychological review.

[9]  Zenzi M. Griffin,et al.  Why Look? Reasons for Eye Movements Related to Language Production. , 2004 .

[10]  Fernanda Ferreira,et al.  Scene Perception for Psycholinguists. , 2004 .

[11]  Kathryn Bock,et al.  Exploring Levels of Processing in Sentence Production , 1987 .

[12]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[13]  Holly P. Branigan,et al.  Parallel processing in language production , 2014 .

[14]  Micha Elsner,et al.  Visual Complexity and Its Effects on Referring Expression Generation. , 2018, Cognitive science.

[15]  Miguel P Eckstein,et al.  Temporal and peripheral extraction of contextual cues from scenes during visual search. , 2017, Journal of vision.

[16]  Max-Planck-Institutfuir Psycholinguistik The speaker's linearization problem , 2016 .

[17]  M. Tanenhaus,et al.  Watching the eyes when talking about size: An investigation of message formulation and utterance planning , 2006 .

[18]  D. E. Irwin,et al.  Minding the clock , 2003 .

[19]  M. Garrett Processes in language production , 1988 .

[20]  J. Trueswell,et al.  Getting the gist of events: recognition of two-participant actions from brief displays. , 2013, Journal of experimental psychology. General.

[21]  Brian McMahan,et al.  Why are the batteries in the microwave?: Use of semantic information under uncertainty in a search task , 2016, Cognitive research: principles and implications.

[22]  K Ball,et al.  Putting first things first. , 1999, Today's surgical nurse.

[23]  Lester C. Loschky,et al.  The cognitive systems of visual and multimodal narratives , 2018, CogSci.

[24]  M. Chun,et al.  Contextual cueing of visual attention , 2022 .

[25]  Taylor R. Hayes,et al.  Meaning-based guidance of attention in scenes as revealed by meaning maps , 2017, Nature Human Behaviour.

[26]  L. Gleitman,et al.  On the give and take between event apprehension and utterance formulation. , 2007, Journal of memory and language.

[27]  Gwendolyn Rehrig,et al.  Meaning Guides Attention during Real-World Scene Description , 2018, Scientific Reports.

[28]  Taylor R. Hayes,et al.  Meaning guides attention in real-world scene images: Evidence from eye movements and meaning maps , 2017, bioRxiv.

[29]  Christoph Scheepers,et al.  Visual Attention and Structural Choice in Sentence Production Across Languages , 2011, Lang. Linguistics Compass.

[30]  H. H. Clark Speech errors as linguistic evidence. , 1975 .

[31]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[32]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[33]  F. Ferreira,et al.  How incremental is language production? Evidence from the production of utterances requiring the computation of arithmetic sums , 2002 .

[34]  W. Levelt,et al.  Speaking: From Intention to Articulation , 1990 .

[35]  Irving Biederman,et al.  On the Semantics of a Glance at a Scene , 2017 .

[36]  Maryellen C. MacDonald,et al.  How language production shapes language form and comprehension , 2012, Front. Psychol..

[37]  C. Dobel,et al.  Seeing for speaking: Semantic and lexical information provided by briefly presented, naturalistic action scenes , 2018, PloS one.

[38]  Zenzi M. Griffin,et al.  PSYCHOLOGICAL SCIENCE Research Article WHAT THE EYES SAY ABOUT SPEAKING , 2022 .

[39]  V. Ferreira,et al.  The Oxford Handbook of Language Production , 2014 .

[40]  Miguel P Eckstein,et al.  Beyond Scene Gist: Objects Guide Search More Than Scene Background , 2017, Journal of experimental psychology. Human perception and performance.

[41]  J. Henderson,et al.  The influence of color on the perception of scene gist. , 2008, Journal of experimental psychology. Human perception and performance.

[42]  Anna Papafragou,et al.  Event Structure Influences Language Production: Evidence from Structural Priming in Motion Event Description. , 2013, Journal of memory and language.

[43]  S. Brown-Schmidt,et al.  Processes of incremental message planning during conversation , 2015, Psychonomic bulletin & review.

[44]  J. Henderson,et al.  Initial scene representations facilitate eye movement guidance in visual search. , 2007, Journal of experimental psychology. Human perception and performance.

[45]  J. Henderson,et al.  Linearization strategies during language production , 1998, Memory & cognition.

[46]  Taylor R. Hayes,et al.  Meaning guides attention during scene viewing, even when it is irrelevant , 2018, Attention, perception & psychophysics.