Enhanced Rule-Based Phonetic Transcription for the Romanian Language

This paper presents a rule-based approach for the phonetic transcription of the Romanian language. We integrate this phonetic analysis in the text processing component of a text-to-speech system for Romanian. Grapheme-to-phoneme rules are constructed based on expert information from DOOMII dictionary. In the cases when rules are useless, we employed decision trees constructed on engineered training sets to help the classifiers to learn the language exceptions.

[1]  M. Divay,et al.  Grapheme-to-phoneme transcription for French , 1977 .

[2]  Walter Daelemans,et al.  Language-Independent Data-Oriented Grapheme-to-Phoneme Conversion , 1996 .

[3]  Helmer Strik,et al.  On automatic phonetic transcription quality: lower word error rates do not guarantee better transcriptions , 2004, Comput. Speech Lang..

[4]  Y. Freund,et al.  Discussion of the Paper \additive Logistic Regression: a Statistical View of Boosting" By , 2000 .

[5]  Alan W. Black,et al.  Unit selection in a concatenative speech synthesis system using a large speech database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[6]  Dragos Burileanu,et al.  Basic Research and Implementation Decisions for a Text-to-Speech Synthesis System in Romanian , 2002, Int. J. Speech Technol..

[7]  Gheorghe Cosmin Silaghi,et al.  Efficient Parsing of Romanian Language for Text-to-Speech Purposes , 2009, TSD.

[8]  Alex Acero,et al.  Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .

[9]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[10]  Terrence J. Sejnowski,et al.  NETtalk: a parallel network that learns to read aloud , 1988 .

[11]  Ken Lunde CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing , 1999 .

[12]  Xuedong Huang,et al.  Improvements on a trainable letter-to-sound converter , 1997, EUROSPEECH.

[13]  Sabine Koch,et al.  A Procedure Of An Automatic Grapheme-To-Phoneme Transformation Of German , 1982, COLING.

[14]  Haijia Shi Best-first Decision Tree Learning , 2007 .