Application of Genetic Algorithm in unit selection for Malay speech synthesis system

Corpus based speech synthesis can produce high quality synthetic speech due to it high sensitivity to unit context. Large speech database is embedded in synthesis system and search algorithm (unit selection) is needed to search for the optimal unit sequence. Speech feature which served as target cost is estimated from the input text. The acoustic parameters which served as join cost are derived from mel frequency cepstral coefficients (MFCCs) and Euclidean distance. In this paper, a new method which is Genetic Algorithm is proposed to search for optimal unit sequence. Genetic Algorithm (GA) is a population based search algorithm that is based on the biological principles of selection, reproduction, crossover and mutation. It is a stochastic search algorithm for solving optimization problem. The speech unit sequence that has minimum join cost will be synthesized into complete waveform data.

[1]  Yannis Stylianou,et al.  Perceptual and objective detection of discontinuities in concatenative speech synthesis , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[2]  L. Darrell Whitley,et al.  The GENITOR Algorithm and Selection Pressure: Why Rank-Based Allocation of Reproductive Trials is Best , 1989, ICGA.

[3]  Wagner F. Sacco,et al.  A parallel genetic algorithm with niching technique applied to a nuclear reactor core design optimization problem , 2008 .

[4]  Eduardo Rodríguez Banga,et al.  A method for combining intonation modelling and speech unit selection in corpus-based speech synthesis systems , 2006, Speech Commun..

[5]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[6]  Tsung-Ying Sun,et al.  Potential offspring production strategies: An improved genetic algorithm for global numerical optimization , 2009, Expert Syst. Appl..

[7]  Kenneth Alan De Jong,et al.  An analysis of the behavior of a class of genetic adaptive systems. , 1975 .

[8]  Andrea Reese,et al.  Random number generators in genetic algorithms for unconstrained and constrained optimization , 2009 .

[9]  Chiu-yu Tseng,et al.  Corpus-based Mandarin speech synthesis with contextual syllabic units based on phonetic properties , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[10]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[11]  Xiaohua Yang,et al.  A gray-encoded, hybrid-accelerated, genetic algorithm for global optimizations in dynamical systems , 2005 .

[12]  Alan W. Black,et al.  Optimal data selection for unit selection synthesis , 2001, SSW.

[13]  Lech Józwiak,et al.  Genetic engineering versus natural evolution: Genetic algorithms with deterministic operators , 2002, J. Syst. Archit..

[14]  Derya Birant,et al.  An incremental genetic algorithm for classification and sensitivity analysis of its parameters , 2011, Expert Syst. Appl..

[15]  David E. Goldberg,et al.  The parameter-less genetic algorithm in practice , 2004, Inf. Sci..

[16]  Elisaveta G. Shopova,et al.  BASIC - A genetic algorithm for engineering problems solution , 2006, Comput. Chem. Eng..

[17]  Mitsuo Gen,et al.  Network design techniques using adapted genetic algorithms , 2001 .

[18]  D. E. Goldberg,et al.  Genetic Algorithms in Search, Optimization & Machine Learning , 1989 .

[19]  Francisco Herrera,et al.  Gradual distributed real-coded genetic algorithms , 2000, IEEE Trans. Evol. Comput..

[20]  Simon King,et al.  Multisyn: Open-domain unit selection for the Festival speech synthesis system , 2007, Speech Commun..

[21]  Roy George,et al.  A variable-length genetic algorithm for clustering and classification , 1995, Pattern Recognit. Lett..

[22]  Young-Doo Kwon,et al.  Convergence enhanced genetic algorithm with successive zooming method for solving continuous optimization problems , 2003 .

[23]  Tian Swee Tan,et al.  Corpus Design for Malay Corpus-based Speech Synthesis System , 2009 .

[24]  Ivanoe De Falco,et al.  Mutation-based genetic algorithm: performance evaluation , 2002, Appl. Soft Comput..

[25]  Seamus D. Garvey,et al.  A COMBINED GENETIC AND EIGENSENSITIVITY ALGORITHM FOR THE LOCATION OF DAMAGE IN STRUCTURES , 1998 .

[26]  Anna Kucerová,et al.  Improvements of real coded genetic algorithms based on differential operators preventing premature convergence , 2004, ArXiv.

[27]  Nick Campbell,et al.  Optimising selection of units from speech databases for concatenative synthesis , 1995, EUROSPEECH.

[28]  Jian Qin,et al.  A dynamic chain-like agent genetic algorithm for global numerical optimization and feature selection , 2009, Neurocomputing.

[29]  Louis Gosselin,et al.  Review of utilization of genetic algorithms in heat transfer problems , 2009 .

[30]  Tian-Swee Tan,et al.  Implementation of Phonetic Context Variable Length Unit Selection Module for Malay Text to Speech , 2008 .

[31]  R. Talafova,et al.  Indexing join costs for faster unit selection synthesis , 2008, 2008 15th International Conference on Systems, Signals and Image Processing.

[32]  Ju-Jang Lee,et al.  Adaptive simulated annealing genetic algorithm for system identification , 1996 .

[33]  S.H.S. Salleh,et al.  Corpus-based Malay text-to-speech synthesis system , 2008, 2008 14th Asia-Pacific Conference on Communications.

[34]  Pei-Chann Chang,et al.  Dynamic diversity control in genetic algorithm for mining unsearched solution space in TSP problems , 2010, Expert Syst. Appl..

[35]  Kiyohiro Shikano,et al.  Speech unit selection based on target values driven by speech data in concatenative speech synthesis , 2002, Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002..

[36]  Michael W. Macon,et al.  A perceptual evaluation of distance measures for concatenative speech synthesis , 1998, ICSLP.

[37]  Alan W. Black,et al.  Unit selection in a concatenative speech synthesis system using a large speech database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[38]  Laura Núñez-Letamendia,et al.  Fitting the control parameters of a genetic algorithm: An application to technical trading systems design , 2007, Eur. J. Oper. Res..

[39]  Yoke San Wong,et al.  Development of a parallel optimization method based on genetic simulated annealing algorithm , 2005, Parallel Comput..

[40]  José Manuel Pardo,et al.  New algorithm for spectral smoothing and envelope modification for LP-PSOLA synthesis , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[41]  Bor-Wen Cheng,et al.  A study on flowshop scheduling problem combining Taguchi experimental design and genetic algorithm , 2007, Expert Syst. Appl..

[42]  Francisco Herrera,et al.  Replacement strategies to preserve useful diversity in steady-state genetic algorithms , 2008, Inf. Sci..