Segmentation of Monologues in Audio Books for Building Synthetic Voices
暂无分享,去创建一个
[1] Keiichi Tokuda,et al. Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory speech synthesis , 2004, SSW.
[2] Alan W. Black,et al. The CMU Arctic speech databases , 2004, SSW.
[3] Pedro J. Moreno,et al. A factor automaton approach for the forced alignment of long speech recordings , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[4] Simon King,et al. Thousands of Voices for HMM-Based Speech Synthesis–Analysis and Application of TTS Systems Built on Various ASR Corpora , 2009, IEEE Transactions on Audio, Speech, and Language Processing.
[5] Tanja Schultz,et al. Synthesizer voice quality of new languages calibrated with mean mel cepstral distortion , 2008, SLTU.
[6] Luís Carriço,et al. Spoken language technologies applied to digital talking books , 2006, INTERSPEECH.
[7] Alan W. Black,et al. CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling , 2006, INTERSPEECH.
[8] Alan W. Black,et al. Optimizing segment label boundaries for statistical speech synthesis , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[9] Andreas Stolcke,et al. Automatic linguistic segmentation of conversational speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[10] Claudia Barolo,et al. Automatic diphone extraction for an Italian text-to-speech synthesis system , 1997, EUROSPEECH.
[11] Pedro J. Moreno,et al. A recursive algorithm for the forced alignment of very long audio segments , 1998, ICSLP.