Vocaine the vocoder and applications in speech synthesis
暂无分享,去创建一个
[1] G. Goertzel. An Algorithm for the Evaluation of Finite Trigonometric Series , 1958 .
[2] S. Imai,et al. Mel Log Spectrum Approximation (MLSA) filter for speech synthesis , 1983 .
[3] Thomas F. Quatieri,et al. Sinusoidal transform coding , 1988 .
[4] R. J. McAulay,et al. Computationally efficient sine-wave synthesis and its application to sinusoidal transform coding , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.
[5] Jae S. Lim,et al. Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.
[6] Julius O. Smith,et al. Spectral modeling synthesis: A sound analysis/synthesis based on a deterministic plus stochastic decomposition , 1990 .
[7] Xavier Serra,et al. A sound analysis/synthesis system based on a deterministic plus stochastic decomposition , 1990 .
[8] Eric Moulines,et al. Voice transformation using PSOLA technique , 1991, Speech Commun..
[9] T. Barnwell,et al. A mixed excitation LPC vocoder model for low bit rate speech coding , 1995, IEEE Trans. Speech Audio Process..
[10] Yannis Stylianou,et al. Harmonic plus noise models for speech, combined with statistical methods, for speech and speaker modification , 1996 .
[11] Petros Maragos,et al. Speech analysis and synthesis using an AM-FM modulation model , 1999, Speech Commun..
[12] Alan McCree,et al. A 14 kb/s wideband speech coder with a parametric highband model , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[13] Keiichi Tokuda,et al. Speech parameter generation algorithms for HMM-based speech synthesis , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[14] Jan Skoglund,et al. On time-frequency masking in voiced speech , 2000, IEEE Trans. Speech Audio Process..
[15] Y. Stylianou. A simple and fast way of generating a harmonic signal , 2000, IEEE Signal Processing Letters.
[16] Thomas Quatieri,et al. Discrete-Time Speech Signal Processing: Principles and Practice , 2001 .
[17] Yannis Stylianou,et al. Applying the harmonic plus noise model in concatenative speech synthesis , 2001, IEEE Trans. Speech Audio Process..
[18] Axel R¨obel. A NEW APPROACH TO TRANSIENT PROCESSING IN THE PHASE VOCODER , 2003 .
[19] Fabrice Labeau,et al. Discrete Time Signal Processing , 2004 .
[20] Yannis Stylianou,et al. Combined estimation/coding of highband spectral envelopes for speech spectrum expansion , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[21] D. Mehta,et al. Synthesis, analysis, and pitch modification of the breathy vowel , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..
[22] Hideki Kawahara,et al. STRAIGHT, exploitation of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds , 2006 .
[23] K. Honda,et al. Cyclicity of laryngeal cavity resonance due to vocal fold vibration. , 2006, The Journal of the Acoustical Society of America.
[24] Yannis Stylianou,et al. Fast Analysis/Synthesis of Harmonic Signals , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.
[25] Heiga Zen,et al. Statistical Parametric Speech Synthesis , 2007, IEEE International Conference on Acoustics, Speech, and Signal Processing.
[26] Olivier Rosec,et al. Towards flexible speech coding for speech synthesis: an LF + modulated noise vocoder , 2008, INTERSPEECH.
[27] Yannis Stylianou,et al. Improving the modeling of the noise part in the harmonic plus noise model of speech , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
[28] Hideki Kawahara,et al. Tandem-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0, and aperiodicity estimation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
[29] Yannis Stylianou,et al. Wrapped Gaussian Mixture Models for Modeling and High-Rate Quantization of Phase Data of Speech , 2009, IEEE Transactions on Audio, Speech, and Language Processing.
[30] Olivier Rosec,et al. ARX-LF-based source-filter methods for voice modification and transformation , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[31] Junichi Yamagishi,et al. An experimental comparison of multiple vocoder types , 2013, SSW.
[32] Heiga Zen,et al. Statistical parametric speech synthesis using deep neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[33] Inma Hernáez,et al. Harmonics Plus Noise Model Based Vocoder for Statistical Parametric Speech Synthesis , 2014, IEEE Journal of Selected Topics in Signal Processing.
[34] Heiga Zen,et al. Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).