Analysis and Automatic Classification of Some Discourse Particles on a Large Set of French Spoken Corpora

In French, quite a number of words and expressions are frequently used as discourse particles in spoken language, especially in spontaneous speech. The semantic load of these words or expressions differ whether they are used as discourse particles or not. Therefore, the correct identification of their discourse function remains of great importance. In this paper the distribution of the discourse function (or not discourse function), and of the detailed discourse functions of some of these words, is studied on a large set of French corpora ranging from prepared speech (e.g. storytelling and broadcast news) to spontaneous speech (e.g. interviews and interactions between people). The paper is focused on a subset of discourse particles that are recurrent in the considered corpora. The discourse function of a few thousand occurrences of these words have been manually annotated. A statistical analysis of the functions of the words is presented and discussed with respect to the types of spoken corpora. Finally, some statistics with respect to a few prosodic correlates of the discourse particles are presented, as well as some results of automatic classification and detection of the word function (discourse particle or not) using prosodic features.

[1]  D. Blakemore Semantic Constraints on Relevance , 1987 .

[2]  G. Dostie Pragmaticalisation et marqueurs discursifs , 2004 .

[3]  Frédéric Béchet,et al.  DECODA: a call-centre human-human spoken conversation corpus , 2012, LREC.

[4]  Maj-Britt Mosegaard Hansen,et al.  Particles at the Semantics/pragmatics Interface: Synchronic and Diachronic Issues, a Study with Special Reference to the French Phasal Adverbs , 2008 .

[5]  M. M. Jocelyne Fernandez Les particules énonciatives : dans la construction du discours , 1994 .

[6]  Guillaume Gravier,et al.  The ester 2 evaluation campaign for the rich transcription of French radio broadcasts , 2009, INTERSPEECH.

[7]  Katarina Bartkova,et al.  How to be a Discourse Particle , 2016 .

[8]  Liesbeth Degand,et al.  Alors between discourse and grammar: the role of syntactic position. , 2008 .

[9]  Katarina Bartkova,et al.  Automatic Detection of the Prosodic Structures of Speech Utterances , 2013, SPECOM.

[10]  Maj-Britt Mosegaard Hansen,et al.  The Function of Discourse Particles: A Study with Special Reference to Spoken Standard French , 1998 .

[11]  Katarina Bartkova,et al.  Discourse Particles in French: Prosodic Parameters Extraction and Analysis , 2015, SLSP.

[12]  Antonio Moreno-Sandoval,et al.  The C-ORAL-ROM CORPUS. A Multilingual Resource of Spontaneous Speech for Romance Languages , 2004, LREC.

[13]  K. Aijmer Understanding Pragmatic Markers: A Variational Pragmatic Approach , 2013 .

[14]  Manfred Stede,et al.  Discourse Particles and Discourse Functions , 2000, Machine Translation.

[15]  Philippe Martin,et al.  Prosodic and rhythmic structures in French , 1987 .

[16]  How prosody reflects semantic change: a synchronic case study of 'of course' , 2010 .

[17]  D. Sperber,et al.  Relevance: Communication and Cognition , 1997 .