An open-source rule-based syllabification tool for Brazilian Portuguese

BackgroundThe automatic syllabification process is an essential prerequisite for speech synthesis systems. However, the task is not trivial, and several techniques have been adopted over the last decade. Furthermore, while there are many public resources for some languages (e.g., English and Japanese), the resources for Brazilian Portuguese (BP) are still limited. This paper discusses ways to diminish this drawback, through the implementation of an open-source syllabification system for BP.MethodsThe proposed tool is based on published rule-based algorithms, with some new proposals, especially in the treatment of words with diphthongs and hiatus.ResultsComputer experiments were performed on a randomly chosen extract of the CETEN-Folha text corpus, and the results showed the percentage of correctly syllabified words of 99%.ConclusionsA subjective evaluation was also conducted in order to compare the elaborated syllabification algorithm with the reference one within a text-to-speech system for BP. All developed codes and databases are publicly available.

[1]  Marc Schröder,et al.  The German Text-to-Speech Synthesis System MARY: A Tool for Research, Development and Teaching , 2003, Int. J. Speech Technol..

[2]  António J. S. Teixeira,et al.  On european Portuguese automatic syllabification , 2005, INTERSPEECH.

[3]  Fernando Perdigão,et al.  Investigating New Syllable Prototypes for the Portuguese Language , 2011, ICPhS.

[4]  Fábio Violaro,et al.  Um Sistema de Conversão Texto-Fala Para o Português Falado no Brasil , 2000 .

[5]  Arlo Faria Applied Phonetics : Portuguese Text-to-Speech , 2003 .

[6]  Aldebaro Klautau,et al.  A Rule-based Syllabification Algorithm with Stress Determination for Brazilian Portuguese Natural Language Processing , 2011, ICPhS.

[7]  Sandra G. Kafka,et al.  Parâmetros Lingüísticos Utilizados para a Geração Automática de Prosódia em Sistemas de Síntese de Fala , 2004, Anais do XXI Simpósio Brasileiro de Telecomunicações.

[8]  Maria Helena Mira Mateus Introdução a estudos de fonologia do português brasileiro , 2000 .

[9]  Keiichi Tokuda,et al.  Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis , 1999, EUROSPEECH.

[10]  Isabel Trancoso,et al.  WFST based Unit Selection for Concatenative Speech Synthesis in European Portuguese , 2003 .

[11]  Ranniery Maia,et al.  An Open Source HMM-based Text-to-Speech System for Brazilian Portuguese , 2010 .

[12]  Heiga Zen,et al.  An HMM-Based Brazilian Portuguese Speech Synthesizer and Its Characteristics DOI: 10.14209/jcis.2006.11 , 2015 .

[13]  Pedro Silva,et al.  HMM-based Brazilian Portuguese TTS , 2008 .

[14]  Jorge Proença,et al.  Rule-Based Algorithms for Automatic Pronunciation of Portuguese Verbal Inflections , 2014, PROPOR.

[15]  João Paulo Teixeira A prosody model to TTS systems , 2004 .

[16]  Paul Taylor,et al.  Text-to-Speech Synthesis , 2009 .

[17]  João Paulo da Silva Neto,et al.  Syllable onset detection applied to the portuguese language , 1999, EUROSPEECH.

[18]  Markus Rupp,et al.  Reproducible research in signal processing , 2009, IEEE Signal Processing Magazine.

[19]  C. Oliveira,et al.  On Automatic European Portuguese Syllabification , 2006 .

[20]  Jelena Kovacevic,et al.  Reproducible research in signal processing , 2009, IEEE Signal Process. Mag..

[21]  Diamantino Freitas,et al.  Towards an intonation module for a portuguese TTS system , 2002, INTERSPEECH.

[22]  João Paulo Teixeira,et al.  Divisão silábica automática do texto escrito e falado , 2000 .

[23]  Alan W. Black,et al.  Unit selection in a concatenative speech synthesis system using a large speech database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.