Text-to-Speech Synthesis
暂无分享,去创建一个
Jinfu Ni | Yoshinori Shiga | Kentaro Tachibana | Takuma Okamoto | T. Okamoto | Kentaro Tachibana | Y. Shiga | Jinfu Ni
[1] Nick Campbell,et al. Optimising selection of units from speech databases for concatenative synthesis , 1995, EUROSPEECH.
[2] Tomoki Toda,et al. An Investigation of Subband Wavenet Vocoder Covering Entire Audible Frequency Range with Limited Acoustic Features , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[3] L. Rabiner,et al. Multirate digital signal processing: Prentice-Hall, Inc. Englewood Cliffs, New Jersey 07362, 1983, 411 pp., ISBN 0-13-605162-6 , 1983 .
[4] Dennis H. Klatt,et al. Software for a cascade/parallel formant synthesizer , 1980 .
[5] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[6] Tomoki Toda,et al. Speaker-Dependent WaveNet Vocoder , 2017, INTERSPEECH.
[7] Heiga Zen,et al. Statistical parametric speech synthesis using deep neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[8] Keiichi Tokuda,et al. Speech parameter generation algorithms for HMM-based speech synthesis , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[9] Yoshua Bengio,et al. SampleRNN: An Unconditional End-to-End Neural Audio Generation Model , 2016, ICLR.
[10] Adam Coates,et al. Deep Voice: Real-time Neural Text-to-Speech , 2017, ICML.
[11] Samy Bengio,et al. Tacotron: Towards End-to-End Speech Synthesis , 2017, INTERSPEECH.
[12] Tomoki Toda,et al. Subband wavenet with overlapped single-sideband filterbanks , 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[13] Heiga Zen,et al. Hidden Semi-Markov Model Based Speech Synthesis System , 2006 .
[14] Yoshua Bengio,et al. Char2Wav: End-to-End Speech Synthesis , 2017, ICLR.
[15] Eric Moulines,et al. Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones , 1989, Speech Commun..
[16] Hisashi Kawai,et al. Global Syllable Vectors for Building TTS Front-End with Deep Learning , 2017, INTERSPEECH.
[17] Claire Cardie,et al. Opinion Mining with Deep Recurrent Neural Networks , 2014, EMNLP.