Voice Conversion for Whispered Speech Synthesis
暂无分享,去创建一个
[1] S. Jovicic,et al. Acoustic analysis of consonants in whispered speech. , 2008, Journal of voice : official journal of the Voice Foundation.
[2] Andrew J Oxenham,et al. Intelligibility of whispered speech in stationary and modulated noise maskers. , 2012, The Journal of the Acoustical Society of America.
[3] Prasanta Kumar Ghosh,et al. Reconstruction of articulatory movements during neutral speech from those during whispered speech. , 2018, The Journal of the Acoustical Society of America.
[4] Junichi Yamagishi,et al. The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods , 2018, Odyssey.
[5] Tomoki Toda,et al. Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion , 2018, 2019 27th European Signal Processing Conference (EUSIPCO).
[6] Kazuya Takeda,et al. Acoustic analysis and recognition of whispered speech , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[7] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[8] Anders Krogh,et al. A Simple Weight Decay Can Improve Generalization , 1991, NIPS.
[9] Nathalie Henrich Bernardoni,et al. The spectrum of glottal flow models , 2006 .
[10] Seyed Hamidreza Mohammadi,et al. Voice conversion using deep neural networks with speaker-independent pre-training , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[11] Song Han,et al. Learning both Weights and Connections for Efficient Neural Network , 2015, NIPS.
[12] Satoshi Imai,et al. Cepstral analysis synthesis on the mel frequency scale , 1983, ICASSP.
[13] Moncef Gabbouj,et al. Voice Conversion Using Partial Least Squares Regression , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[14] Tomoki Toda,et al. sprocket: Open-Source Voice Conversion Software , 2018, Odyssey.
[15] Keiichi Tokuda,et al. A Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis , 2007, IEICE Trans. Inf. Syst..
[16] John H. L. Hansen,et al. Advancements in whisper-island detection within normally phonated audio streams , 2009, INTERSPEECH.
[17] Shinnosuke Takamichi,et al. Voice Conversion Using Input-to-Output Highway Networks , 2017, IEICE Trans. Inf. Syst..
[18] Eric Moulines,et al. Continuous probabilistic transform for voice conversion , 1998, IEEE Trans. Speech Audio Process..
[19] Masanori Morise,et al. WORLD: A Vocoder-Based High-Quality Speech Synthesis System for Real-Time Applications , 2016, IEICE Trans. Inf. Syst..
[20] Tomoki Toda,et al. Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[21] K. Kallail,et al. Formant-frequency differences between isolated whispered and phonated vowel samples produced by adult female subjects. , 1984, Journal of speech and hearing research.
[22] I. Mcloughlin,et al. A comprehensive vowel space for whispered speech. , 2012, Journal of voice : official journal of the Voice Foundation.
[23] V. Tartter. What's in a whisper? , 1989, The Journal of the Acoustical Society of America.
[24] Abeer Alwan,et al. Glottal source processing: From analysis to applications , 2014, Comput. Speech Lang..
[25] Terry K Koo,et al. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. , 2016, Journal Chiropractic Medicine.
[26] Kou Tanaka,et al. StarGAN-VC: non-parallel many-to-many Voice Conversion Using Star Generative Adversarial Networks , 2018, 2018 IEEE Spoken Language Technology Workshop (SLT).
[27] Gunnar Fant,et al. Acoustic Theory Of Speech Production , 1960 .
[28] Yu Tsao,et al. Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks , 2017, INTERSPEECH.
[29] J. Liljencrants,et al. Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .
[30] L. Bottou. Stochastic Gradient Learning in Neural Networks , 1991 .
[31] Kun Li,et al. Voice conversion using deep Bidirectional Long Short-Term Memory based Recurrent Neural Networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[32] Roland Maas,et al. LSTM-Based Whisper Detection , 2018, 2018 IEEE Spoken Language Technology Workshop (SLT).
[33] A Donner,et al. The estimation of intraclass correlation in the analysis of family data. , 1980, Biometrics.
[34] Haizhou Li,et al. Average Modeling Approach to Voice Conversion with Non-Parallel Data , 2018, Odyssey.
[35] Junichi Yamagishi,et al. High-Quality Nonparallel Voice Conversion Based on Cycle-Consistent Adversarial Network , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[36] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[37] Prasanta Kumar Ghosh,et al. Whispered Speech to Neutral Speech Conversion Using Bidirectional LSTMs , 2018, INTERSPEECH.
[38] W. Zemlin,et al. Quantitative study of whisper. , 1984, Folia phoniatrica.
[39] Kou Tanaka,et al. ATTS2S-VC: Sequence-to-sequence Voice Conversion with Attention and Context Preservation Mechanisms , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[40] Boon Pang Lim,et al. Computational differences between whispered and non-whispered speech , 2011 .
[41] Kou Tanaka,et al. ACVAE-VC: Non-parallel many-to-many voice conversion with auxiliary classifier variational autoencoder , 2018, ArXiv.
[42] A. Oppenheim,et al. Homomorphic analysis of speech , 1968 .
[43] Ian Vince McLoughlin,et al. Reconstruction of Normal Sounding Speech for Laryngectomy Patients Through a Modified CELP Codec , 2010, IEEE Transactions on Biomedical Engineering.