Distal prosody influences lexical interpretation in online sentence processing

Distal prosody influences lexical interpretation in online sentence processing Meredith Brown (mbrown@bcs.rochester.edu) Department of Brain & Cognitive Sciences, University of Rochester Meliora Hall, Box 270268, Rochester, NY 14627-0268 Anne Pier Salverda (asalverda@bcs.rochester.edu) Department of Brain & Cognitive Sciences, University of Rochester Meliora Hall, Box 270268, Rochester, NY 14627-0268 Laura C. Dilley (ldilley@msu.edu) Department of Communicative Sciences & Disorders, Michigan State University 116 Oyer, East Lansing, MI 48824 Michael K. Tanenhaus (mtan@bcs.rochester.edu) Department of Brain & Cognitive Sciences, University of Rochester Meliora Hall, Box 270268, Rochester, NY 14627-0268 Abstract Previous work examining prosodic cues in online spoken word recognition has focused primarily on local cues to word identity. However, recent studies have suggested that sentence-level prosodic patterns can also influence the downstream interpretation of lexically ambiguous syllables (Dilley & McAuley, 2008; Dilley, Mattys, & Vinke, 2010). To test the hypothesis that these distal prosody effects are based on expectations about the organization of upcoming material, we conducted a visual world experiment using fixations to competing alternatives such as pan and panda, which differ in the presence or absence of a prosodic boundary after pan(-). As predicted, the acoustic properties of distal sentence material affected the proportion of fixations to the monosyllabic competitor beginning 200 ms after the onset of the target word. These findings support the hypothesis that expectations based on perceived prosodic patterns in the distal context influence lexical segmentation and recognition. Keywords: Prosody; spoken word recognition; lexical competition; perceptual organization; visual world paradigm Introduction Expectation and prediction are increasingly playing a major role in models of language processing (e.g., Levy, 2008; Ju- rafsky, 1996; Hale, 2001), and effects of expectations based on lexical and structural properties of sentences are well doc- umented (reviewed in Kamide, 2008). However, another pos- sible source of expectations that may be useful in language processing are global prosodic patterns, including prosodic phenomena that are distal to (i.e., several syllables removed from) the locus of processing. Such patterns may also in- fluence the interpretation of proximal sentence material by contributing to listeners’ expectations about the lexical struc- ture of upcoming material. Most work at the interface of prosody and spoken word recognition to date has focused on the effects of local prosodic cues to word identity, such as stress, duration, and the strength of local prosodic bound- aries (Cutler & van Donselaar, 2001; Salverda, Dahan, & McQueen, 2003; Christophe, Peperkamp, Pallier, Block, & Mehler, 2004). In this paper, we provide evidence that dis- tal prosody generates expectations based on perceived global prosodic patterns, and that these expectations constrain word segmentation and recognition. Background Previous work using non-linguistic auditory stimuli ranging from simple tone sequences to musical passages has shown that listeners detect pitch, temporal, and/or amplitude pat- terning in distal (i.e., non-local) auditory context. Such pat- terning influences the metrical and grouping structures that listeners perceive downstream (reviewed in Handel, 1989; Huron, 2006). For example, listeners tend to perceptually organize a continuous stream of tones of equal duration, tem- poral separation and amplitude that differ only in terms of pitch height (as high or low pitched) into repeating sequences of either high-low or low-high perceived groups, with the first element in each group perceived as accented (Woodrow, 1911; Thomassen, 1982). Although high tones are more fre- quently perceived as more prominent than low tones, the per- ceived relative prominence of high and low tones within each bitonal group depends on the context. To describe such per- ceptual proclivities in the domain of music processing, Ler- dahl and Jackendoff (1983) proposed parallelism preference rules, which state that when a musical passage contains seg- ments that are perceived as similar or repetitive, parallel parts of segments will be construed as having similar metrical and grouping structure. These studies suggest that distal regu- larities in the acoustic characteristics of auditory stimuli can influence the processing of proximal material. Reports from linguistics suggest that prosodic context of- ten exhibits regularities in pitch, duration, and/or ampli- tude. In particular, speech intonation and rhythm often show characteristics that listeners perceive as patterning (Couper-

[1]  Tom Barney English Speech Rhythm: Form and Function in Everyday Verbal Interaction , 1994 .

[2]  Laura C. Dilley,et al.  Altering Context Speech Rate Can Cause Words to Appear or Disappear , 2010, Psychological science.

[3]  A. Christophea,et al.  Phonological phrase boundaries constrain lexical access I . Adult data q , 2003 .

[4]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[5]  L. Holt Temporally Nonadjacent Nonlinguistic Sounds Affect Speech Categorization , 2005, Psychological science.

[6]  D. Mirman,et al.  Statistical and computational models of the visual world paradigm: Growth curves and individual differences. , 2008, Journal of memory and language.

[7]  A. Cutler,et al.  Voornaam is not (really) a Homophone: Lexical Prosody and Lexical Access in Dutch , 2001, Language and speech.

[8]  Laura C. Dilley,et al.  Distal prosodic context affects word segmentation and lexical processing , 2008 .

[9]  Eric Moulines,et al.  Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones , 1989, Speech Commun..

[10]  J. McQueen,et al.  Speaking rate from proximal and distal contexts is used during word segmentation. , 2011, Journal of experimental psychology. Human perception and performance.

[11]  Anne Pier Salverda,et al.  The role of prosodic boundaries in the resolution of lexical embedding in speech comprehension , 2003, Cognition.

[12]  Daniel Jurafsky,et al.  A Probabilistic Model of Lexical and Syntactic Access and Disambiguation , 1996, Cogn. Sci..

[13]  H. Woodrow,et al.  The role of pitch in rhythm. , 1911 .

[14]  R. Jackendoff,et al.  A Generative Theory of Tonal Music , 1985 .

[15]  J. Mehler,et al.  Phonological phrase boundaries constrain lexical access II. Infant data , 2004 .

[16]  Audra Dainora,et al.  An empirically based probabilistic model of intonation in American English , 2001 .

[17]  J. Thomassen Melodic accent: Experiments and a tentative model , 1982 .

[18]  E. Couper-Kuhlen English speech rhythm , 1993 .

[19]  Julie C. Sedivy,et al.  Subject Terms: Linguistics Language Eyes & eyesight Cognition & reasoning , 1995 .

[20]  R. Levy Expectation-based syntactic comprehension , 2008, Cognition.

[21]  S. Handel,et al.  Listening: An Introduction to the Perception of Auditory Events , 1993 .

[22]  M. Pearce,et al.  Sweet Anticipation : Music and the Psychology of Expectation , 2007 .

[23]  Yuki Kamide,et al.  Anticipatory Processes in Sentence Processing , 2008, Lang. Linguistics Compass.

[24]  R. Collier Prosodic Systems and Intonation in English , 1969 .

[25]  M. Pickering,et al.  Architectures and Mechanisms for Language Processing , 1999 .

[26]  Laura C. Dilley,et al.  Potent prosody: Comparing the effects of distal prosody, proximal prosody, and semantic context on word segmentation☆ , 2010 .

[27]  Janet B. Pierrehumbert,et al.  Tonal Elements and Their Alignment , 2000 .

[28]  John Hale,et al.  A Probabilistic Earley Parser as a Psycholinguistic Model , 2001, NAACL.