The aix-MARSEC project: an evolutive database of spoken british English

This paper presents the Aix-MARSEC project, an evolutive database of spoken British English. Specific details are given about the grapheme-phoneme conversion from the orthographic transcripts, the optimisation by elision rules of the phonetic transcription, the automatic phoneme-level alignment and automatic higher level treatment (syllables, subsyllabic structure, rhythmic units and MOMEL-INTSINT intonation coding). Integration of users’ contributions will be within the general framework of GNU GPL licensing. Preliminary (pragmatic and prosodic) studies are presented in the final part of the paper.

[1]  D. Hirst,et al.  Isochrony and prosodic structure in British English , 2004, Speech Prosody 2004.

[2]  Peter Roach,et al.  Conversion between prosodic transcription systems: "Standard British" and ToBI , 1994, Speech Communication.

[3]  A. Cruttenden Gimson's Pronunciation of English , 1994 .

[4]  Tamás Váradi,et al.  MARSEC: A Machine-Readable Spoken English Corpus , 1993, Journal of the International Phonetic Association.

[5]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[6]  Daniel Hirst,et al.  Modelling French Micromelody: Analysis and Synthesis , 1986 .

[7]  E. Pulgram Syllable, word, nexus, cursus , 1970 .

[8]  D. Abercrombie,et al.  Elements of General Phonetics , 1967 .

[9]  Caroline Bouzon,et al.  Phonotactique prédictive et alignement automatique : Application au corpus MARSEC et perspectives , 2003 .

[10]  Daniel Hirst,et al.  Levels of Representation and Levels of Analysis for the Description of Intonation Systems , 2000 .

[11]  Daniel Hirst,et al.  ProZed: a multilingual prosody editor for speech synthesis , 2000 .

[12]  Gerald Knowles,et al.  Working with speech : perspectives on research into the Lancaster/IBM Spoken English Corpus , 1996 .

[13]  Francis Corblin,et al.  Les formes de reprise dans le discours. Anaphores et chaînes de référence , 1995 .

[14]  Van Nostrand,et al.  Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm , 1967 .

[15]  Daniel Jones,et al.  English Pronouncing Dictionary , 1917 .