An Empirical Analysis of the Correlation of Syntax and Prosody

The relation of syntax and prosody (the syntax--prosody interface) has been an active area of research, mostly in linguistics and typically studied under controlled conditions. More recently, prosody has also been successfully used in the data-based training of syntax parsers. However, there is a gap between the controlled and detailed study of the individual effects between syntax and prosody and the large-scale application of prosody in syntactic parsing with only a shallow analysis of the respective influences. In this paper, we close the gap by investigating the significance of correlations of prosodic realization with specific syntactic functions using linear mixed effects models in a very large corpus of read-out German encyclopedic texts. Using this corpus, we are able to analyze prosodic structuring performed by a diverse set of speakers while they try to optimize factual content delivery. After normalization by speaker, we obtain significant effects, e.g. confirming that the subject function, as compared to the object function, has a positive effect on pitch and duration of a word, but a negative effect on loudness.

[1]  S. Nooteboom,et al.  THE PROSODY OF SPEECH: MELODY AND RHYTHM , 2001 .

[2]  Mattias Heldner,et al.  Exploring the prosody-syntax interface in conversations , 2003 .

[3]  F. Grosjean,et al.  Analyse contrastive des variables temporelles de l’anglais et du français: vitesse de parole et variables composantes, phénomènes d’hésitation , 1975 .

[4]  Cheryl M. Beach,et al.  The interpretation of prosodic patterns at points of syntactic structure ambiguity: Evidence for cue trading relations☆ , 1991 .

[5]  Angela D. Friederici,et al.  It's Early: Event-related Potential Evidence for Initial Interaction of Syntax and Prosody in Speech Comprehension , 2006, Journal of Cognitive Neuroscience.

[6]  Mari Ostendorf,et al.  The use of prosody in syntactic disambiguation , 1991 .

[7]  Mari Ostendorf,et al.  Parsing Speech: a Neural Approach to Integrating Lexical and Acoustic-Prosodic Information , 2017, NAACL.

[8]  Matthew Lease,et al.  Effective Use of Prosody in Parsing Conversational Speech , 2005, HLT.

[9]  Gaëlle Ferré,et al.  Prosodic boundaries in subordinate syntactic constructions , 2016 .

[10]  Andrew Rosenberg,et al.  AutoBI - a tool for automatic toBI annotation , 2010, INTERSPEECH.

[11]  Harlan Lane,et al.  The patterns of silence: Performance structures in sentence production , 1979, Cognitive Psychology.

[12]  Timo Baumann,et al.  The Spoken Wikipedia Corpus collection: Harvesting, alignment and an application to hyperlistening , 2019, Lang. Resour. Evaluation.

[13]  Hugo Quené On the just‐noticeable difference for tempo in speech , 2004 .

[14]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[15]  Markus Brauer,et al.  Linear Mixed-Effects Models and the Analysis of Nonindependent Data: A Unified Framework to Analyze Categorical and Continuous Independent Variables that Vary Within-Subjects and/or Within-Items , 2017, Psychological methods.

[16]  M. Grice,et al.  The role of prosody in the interpretation of structural ambiguities: A study of anticipatory eye movements , 2006, Cognition.

[17]  Andreas Stolcke,et al.  Prosody Modeling for Automatic Speech Recognition and Understanding , 2004 .

[18]  Sampo Pyysalo,et al.  Universal Dependencies v1: A Multilingual Treebank Collection , 2016, LREC.

[19]  Eugene Charniak,et al.  Sentence-Internal Prosody Does not Help Parsing the Way Punctuation Does , 2004, NAACL.

[20]  Mark Steedman,et al.  The NXT-format Switchboard Corpus: a rich resource for investigating the syntax, semantics, pragmatics and prosody of dialogue , 2010, Lang. Resour. Evaluation.

[21]  Wolfgang Menzel,et al.  Because Size Does Matter: The Hamburg Dependency Treebank , 2014, LREC.

[22]  Marc Schröder,et al.  The German Text-to-Speech Synthesis System MARY: A Tool for Research, Development and Teaching , 2003, Int. J. Speech Technol..

[23]  Noah A. Smith,et al.  Turning on the Turbo: Fast Third-Order Non-Projective Turbo Parsers , 2013, ACL.