Automatic Analysis of Speech Prosody in Dutch

In this paper we present a publicly available tool for automatic analysis of speech prosody (AASP) in Dutch. Incorporating the state-of-the-art analytical frameworks, AASP enables users to analyze prosody at two levels from different theoretical perspectives. Holistically, by means of the Functional Principal Component Analysis (FPCA) it generates mathematical functions that capture changes in the shape of a pitch contour. The tool outputs the weights of principal components in a table for users to process in further statistical analysis. Structurally, AASP analyzes prosody in terms of prosodic events within the auto-segmental metrical framework, hypothesizing prosodic labels in accordance with Transcription of Dutch Intonation (ToDI) with accuracy comparable to similar tools for other languages. Published as a Docker container, the tool can be set up on various operating systems in only two steps. Moreover, the tool is accessed through a graphic user interface, making it accessible to users with limited programming skills.

[1]  Mary E. Beckman,et al.  The Parsing of Prosody , 1996 .

[2]  Xuejing Sun,et al.  Pitch accent prediction using ensemble machine learning , 2002, INTERSPEECH.

[3]  Pilar Prieto,et al.  Intonational meaning. , 2015, Wiley interdisciplinary reviews. Cognitive science.

[4]  Lou Boves,et al.  What's in a word: Sounding sarcastic in British English , 2018, Journal of the International Phonetic Association.

[5]  Lou Boves,et al.  Using Functional Data Analysis for investigating multidimensional dynamic phonetic contrasts , 2015, J. Phonetics.

[6]  Mari Ostendorf,et al.  TOBI: a standard for labeling English prosody , 1992, ICSLP.

[7]  Aoju Chen,et al.  What's in a Rise: Evidence for an Off-ramp Analysis of Dutch Intonation , 2011, ICPhS.

[8]  David Escudero Mancebo,et al.  A fuzzy classifier to deal with similarity between labels on automatic prosodic labeling , 2014, Comput. Speech Lang..

[9]  Carlos Gussenhoven Correction: Analysis of Intonation: the Case of MAE_ToBI , 2016 .

[10]  Michael Hammond Prosodic Phonology , 2020, The Handbook of English Linguistics.

[11]  Andrew Rosenberg,et al.  Cross-Language Prominence Detection , 2012 .

[12]  Judith Hanssen,et al.  Regional variation in the realization of intonation contours in the Netherlands , 2006 .

[13]  Gina-Anne Levow,et al.  Context in multi-lingual tone and pitch accent recognition , 2005, INTERSPEECH.

[14]  C. Gussenhoven,et al.  Prosodic effects of focus in Dutch declaratives , 2008, Speech Prosody 2008.

[15]  Louise Corti,et al.  A CLARIN Transcription Portal for Interview Data , 2020, LREC.

[16]  Michele Gubian,et al.  L1 Prosodic transfer and priming effects: A quantitative study on semi-spontaneous dialogues , 2012 .

[17]  Andrew Rosenberg,et al.  AutoBI - a tool for automatic toBI annotation , 2010, INTERSPEECH.

[18]  Argyro Katsika,et al.  VARIABILITY AND CATEGORY OVERLAP IN THE REALIZATION OF INTONATION , 2019 .

[19]  Andreas Stolcke,et al.  Prosody Modeling for Automatic Speech Recognition and Understanding , 2004 .

[20]  Bo Xu,et al.  Automatic Prosodic Events Detection by Using Syllable-Based Acoustic, Lexical and Syntactic Features , 2011, INTERSPEECH.

[21]  J. Cole,et al.  Prosody in context: a review , 2015 .

[22]  Oliver Jokisch,et al.  Intonation-based classification of language proficiency using FDA , 2014 .

[23]  Lou Boves,et al.  Joint analysis of f0 and speech rate with Functional Data Analysis , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[24]  Ann Cutler,et al.  Prosody in the Comprehension of Spoken Language: A Literature Review , 1997, Language and speech.

[25]  Mari Ostendorf,et al.  Automatic labeling of prosodic patterns , 1994, IEEE Trans. Speech Audio Process..

[26]  Carlos Gussenhoven,et al.  Semantic judgments as evidence for the intonational structure of Dutch , 2008 .

[27]  Yang Liu,et al.  Automatic prosodic events detection using syllable-based acoustic and syntactic features , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[28]  David Escudero Mancebo,et al.  Improving Automatic Classification of Prosodic Events by Pairwise Coupling , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[29]  Carlos Gussenhoven,et al.  Transcription of Dutch intonation , 2005 .