A Manually Annotated Resource for the Investigation of Nasal Grunts

This paper presents an annotation framework for nasal grunts of the whole French CID corpus (Bertrand et al., 2008). The acoustic components under scrutiny are justified and the annotation guidelines are described. We carefully characterise the acoustic cues and visual cues followed by the annotator, especially for non-modal phonation types. The conventions followed for the annotation of interactional and positional properties of grunts are explained. The resulting datasets after data extraction with Praat scripts (Boersma and Weenink, 2019) are analysed with R (R Core Team, 2017), focusing on duration. We analyse the effect of non-modal phonation (especially ingressive phonation) on duration and discuss a specialisation of grunts observed in the CID for grunts with ingressive phonation. The more general aim of this research is to establish putative core and additive properties of grunts and a tentative typology of grunts in spoken interactions.

[1]  Esther Le Grézause,et al.  Um and Uh, and the expression of stance in conversational speech , 2017 .

[2]  Paul Boersma,et al.  Praat: doing phonetics by computer , 2003 .

[3]  W. Nigel,et al.  Pragmatic functions of prosodic features in non-lexical utterances , 2004, Speech Prosody 2004.

[4]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality. , 1994, Journal of speech and hearing research.

[5]  Lynnelle Rhinier Brown,et al.  Requesting the Context: A Context Analysis of Let Statement and If Statement Requests and Commands in the Santa Barbara Corpus of Spoken American English , 2014 .

[6]  David Crystal,et al.  A dictionary of linguistics and phonetics , 1997 .

[7]  Nigel Ward Issues in the Transcription of English Conversational Grunts , 2000, SIGDIAL Workshop.

[8]  R. Espesser,et al.  Le CID - Corpus of Interactional Data. Annotation et exploitation multimodale de parole conversationnelle [The “Corpus of Interactional Data” (CID) - Multimodal annotation of conversational speech”] , 2008, ICON.

[9]  Patricia A. Keating,et al.  Voicesauce: A Program for Voice Analysis , 2009, ICPhS.

[10]  Nigel Ward,et al.  Non-lexical conversational sounds in American English , 2006 .

[11]  H. H. Clark,et al.  Using uh and um in spontaneous speaking , 2002, Cognition.

[12]  Laurence Anthony,et al.  AntConc: A Learner and Classroom Friendly, Multi-Platform Corpus Analysis Toolkit , 2004 .

[13]  R. Eklund Pulmonic ingressive phonation: Diachronic and synchronic characteristics, distribution and function in animal and human sound production and in human speech , 2008, Journal of the International Phonetic Association.

[14]  Mitchell P. Marcus,et al.  Text Chunking using Transformation-Based Learning , 1995, VLC@ACL.

[15]  Jody Kreiman,et al.  Acoustic properties of different kinds of creaky voice , 2015, ICPhS.

[16]  G. Tottie From pause to word: uh, um and er in written American English , 2017, English Language and Linguistics.

[17]  Roxane Bertrand,et al.  Influence de la transcription sur la phonétisation automatique de corpus oraux (what is the impact of the transcription on the phonetization) [in French] , 2012, JEP/TALN/RECITAL.

[18]  Heather L. Balog,et al.  Do children produce the melody before the words? A review of developmental intonation research , 2002 .

[19]  Erik R. Thom,et al.  University of Pennsylvania Working Papers in Linguistics , 2007 .

[20]  Richard Ogden,et al.  An Introduction to English Phonetics , 2009, Phonetica.

[21]  Laurent Prevot,et al.  CoFee-Toward a multidimensional analysis of conversational feedback, the case of French language , 2012 .

[22]  Y. Meynadier La syllabe phonétique et phonologique : une introduction , 2001 .

[23]  Ashish Verma,et al.  Formant-based technique for automatic filled-pause detection in spontaneous spoken english , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  Allard Jongman,et al.  Acoustic correlates of breathy and clear vowels: the case of Khmer , 2003, J. Phonetics.

[25]  R. Eklund Pulmonic ingressive speech: a neglected universal? , 2007 .

[26]  Nivja H. Jong,et al.  Praat script to detect syllable nuclei and measure speech rate automatically , 2009, Behavior research methods.

[27]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .