Prosody in a corpus of French spontaneous speech: perception, annotation and prosody-syntax interaction

Our study focuses on the issue of prosodic annotation and of the prosody ~ syntax interface in conversation and is based on a large corpus of conversational speech in French. The results of inter-transcriber agreement tests show that two expert transcribers are consistent in their labeling of prosodic phrasing and the consistency is well above the chance. A qualitative analysis reveals transcribers' individual strategies, namely in reference to Intermediate Phrases sometimes found for French in specific intonation patterns. The syntactic division of the corpus both in terms of syntactic chunks and in terms of pseudo-phrases is further analyzed in its interaction with the distribution of major prosodic breaks. In more than 60% of cases the boundaries of the pseudo-phrases co-occurs with the boundaries of major prosodic units (Intonational Phrases, IPs). At the same time, 50% of IP boundaries are aligned with smaller syntactic constituents. On the other hand, in our study beginnings of intonational phrases are more often misalign with syntactic constituent boundaries than their ends. We discuss as well the issue of conversational corpus annotation in terms of prosodic units, given specific constraints on planning and execution in spontaneous speech.

[1]  R. Espesser,et al.  Le CID - Corpus of Interactional Data. Annotation et exploitation multimodale de parole conversationnelle [The “Corpus of Interactional Data” (CID) - Multimodal annotation of conversational speech”] , 2008, ICON.

[2]  J. Pierrehumbert,et al.  Intonational structure in Japanese and English , 1986, Phonology.

[3]  Eileen Fitzpatrick,et al.  A Computational Grammar of Discourse-Neutral Prosodic Phrasing in English , 1990, Comput. Linguistics.

[4]  J. Pierrehumbert,et al.  Japanese Tone Structure , 1988 .

[5]  Philippe Blache,et al.  Influence de la qualité de l’étiquetage sur le chunking : une corrélation dépendant de la taille des chunks , 2008, JEPTALNRECITAL.

[6]  Lyn Frazier,et al.  Informative Prosodic Boundaries , 2002, Language and speech.

[7]  James Paul Gee,et al.  Performance structures: A psycholinguistic and linguistic appraisal , 1983, Cognitive Psychology.

[8]  C. Fougeron,et al.  Realizations of accentual phrase in French intonation , 2002 .

[9]  Patrick Paroubek,et al.  Data, Annotations and Measures in EASY the Evaluation Campaign for Parsers of French. , 2006, LREC.

[10]  Patrick Paroubek,et al.  PEAS, the first instantiation of a comparative framework for evaluating parsers of French , 2003, EACL.

[11]  Stefanie Shattuck-Hufnagel,et al.  The Use of Prosody in Syntactic Disambiguation , 1991, HLT.

[12]  Joël Pynte,et al.  Prosodic Breaks and Attachment Decisions in Sentence Parsing. , 1996 .

[13]  Elisabeth Selkirk,et al.  Phonology and Syntax: The Relation between Sound and Structure , 1984 .

[14]  Elisabeth Selkirk,et al.  The Prosodic Structure of Function Words , 2008 .

[15]  P. Lacy,et al.  Papers in optimality theory , 1995 .