Accent Modeling of Low-Resourced Dialect in Pitch Accent Language Using Variational Autoencoder
暂无分享,去创建一个
[1] Thomas Drugman,et al. Camp: A Two-Stage Approach to Modelling Prosody in Context , 2020, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] David A. Landgrebe,et al. A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..
[3] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[4] T. Toda,et al. The NAIST Text-to-Speech System for the Blizzard Challenge 2015 , 2015, The Blizzard Challenge 2015.
[5] Masanori Morise,et al. WORLD: A Vocoder-Based High-Quality Speech Synthesis System for Real-Time Applications , 2016, IEICE Trans. Inf. Syst..
[6] Junichi Yamagishi,et al. Modeling and interpolation of Austrian German and Viennese dialect in HMM-based speech synthesis , 2010, Speech Commun..
[7] Oriol Vinyals,et al. Neural Discrete Representation Learning , 2017, NIPS.
[8] Tomoki Koriyama,et al. JSUT and JVS: Free Japanese voice corpora for accelerating speech synthesis research , 2020, Acoustical Science and Technology.
[9] Yuxuan Wang,et al. Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis , 2018, ICML.
[10] Vincent Wan,et al. CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network , 2019, ICML.
[11] Tomoki Koriyama,et al. Semi-Supervised Prosody Modeling Using Deep Gaussian Process Latent Variable Model , 2019, INTERSPEECH.
[12] Xin Wang,et al. Investigation of Enhanced Tacotron Text-to-speech Synthesis Systems with Self-attention for Pitch Accent Language , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[13] Jennifer Williams,et al. Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction , 2020, INTERSPEECH.