Adapting Prosody in a Text-to-Speech System
暂无分享,去创建一个
[1] D H Klatt,et al. Review of text-to-speech conversion for English. , 1987, The Journal of the Acoustical Society of America.
[2] Holzapfel Martin. HMM‐based database segmentation and unit selection for concatenative speech synthesis , 1999 .
[3] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .
[4] Hans-Georg Zimmermann,et al. A data-driven method for input feature selection within neural prosody generation , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[5] Justin Fackrell,et al. Designing prosodic databases for automatic modelling in 6 languages , 1998, SSW.
[6] Jacob Benesty,et al. Springer handbook of speech processing , 2007, Springer Handbooks.
[7] Ralf Kompe,et al. Prosody in Speech Understanding Systems , 1997, Lecture Notes in Computer Science.
[8] Barbara Heuft,et al. Prosody generation with a neural network , 1996 .
[9] Bogomir Horvat,et al. Labeling of Symbolic Prosody Breaks for the Slovenian Language , 2003, Int. J. Speech Technol..
[10] Rüdiger Hoffmann,et al. Natural F0 contours with a new neural-network-hybrid approach , 2000, INTERSPEECH.
[11] Christof Traber. F0 generation with a data base of natural F0 patterns and with a neural network , 1990, SSW.
[12] Horst-Udo Hain. Automation of the training procedures for neural networks performing multi-lingual grapheme to phoneme conversion , 1999, EUROSPEECH.
[13] Horst-Udo Hain,et al. A multi-lingual system for the determination of phonetic word stress using soft feature selection by neural networks , 2001, SSW.
[14] Matej Rojc,et al. Design of Optimal Slovenian Speech Corpus for Use in the Concatenative Speech Synthesis System , 2000, LREC.
[15] Justin Fackrell,et al. Automatic prosodic labeling of 6 languages , 1998, ICSLP.
[16] N. Campbell,et al. Voice Quality : the 4 th Prosodic Dimension , 2004 .
[17] Barbara Heuft,et al. Prosody generation with a neural network: weighing the importance of input parameters , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[18] P. Boersma. ACCURATE SHORT-TERM ANALYSIS OF THE FUNDAMENTAL FREQUENCY AND THE HARMONICS-TO-NOISE RATIO OF A SAMPLED SOUND , 1993 .
[19] Lutz Prechelt,et al. Early Stopping-But When? , 1996, Neural Networks: Tricks of the Trade.
[20] Hans-Georg Zimmermann,et al. Segmental duration control by time delay neural networks with asymmetric causal and retro-causal information flows , 2002, ESANN.
[21] Halewijn Vereecken,et al. Improving the phonetic annotation by means of prosodic phrasing , 1997, EUROSPEECH.
[22] Thierry Dutoit. Corpus-Based Speech Synthesis , 2008 .
[23] Ralph Neuneier,et al. Robust generation of symbolic prosody by a neural classifier based on autoassociators , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[24] Fabio Tamburini,et al. Automatic detection of prosodic prominence in continuous speech , 2002, LREC.
[25] Rüdiger Hoffmann,et al. Robust unit selection based on syllable prosody parameters , 2002, Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002..
[26] Bogomir Horvat,et al. Designing Prosodic Databases for Automatic Modeling of Slovenian Language in a Multilingual TTS System , 2002, LREC.
[27] Rüdiger Hoffmann,et al. Data-driven importance analysis of linguistic and phonetic information , 2000, INTERSPEECH.
[28] Christopher M. Bishop,et al. Neural networks for pattern recognition , 1995 .
[29] Vincent J. van Heuven,et al. Acoustic correlates of linguistic stress and accent in Dutch and American English , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[30] Ralph Neuneier,et al. Modeling Dynamical Systems by Error Correction Neural Networks , 2002 .
[31] J. V. Santen,et al. The analysis of contextual effects on segmental duration , 1990 .
[32] Paul Taylor,et al. Assigning phrase breaks from part-of-speech sequences , 1997, Comput. Speech Lang..
[33] E. Nöth,et al. Recognition of Selected Prosodic Events in Slovenian Speech , 2022 .
[34] Slovenian Lang,et al. An Environment for Word Prominence Classification in Slovenian Language , 2003 .
[35] Peter Jackson,et al. Overview of Current Text-to-Speech Techniques: Part II - Prosody and Speech Generation , 1996 .
[36] Nick Campbell,et al. A nonlinear unit selection strategy for concatenative speech synthesis based on syllable level features , 1998, ICSLP.