Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT
暂无分享,去创建一个
Hideki Kawahara | Toshio Irino | Alain de Cheveigné | Toru Takahashi | Hideki Banno | T. Irino | Hideki Kawahara | Toru Takahashi | Hideki Banno | A. Cheveigné
[1] Satoshi Nakamura,et al. Robust fundamental frequency estimation using instantaneous frequencies of harmonic components , 2000, INTERSPEECH.
[2] Peter F Assmann,et al. Synthesis fidelity and time-varying spectral change in vowels. , 2005, The Journal of the Acoustical Society of America.
[3] Diane Kewley-Port,et al. Vowel formant discrimination for high-fidelity speech. , 2004, The Journal of the Acoustical Society of America.
[4] Hideki Kawahara,et al. YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.
[5] Hideki Kawahara,et al. Acappella synthesis demonstrations using RWC music database , 2004, NIME.
[6] Roy D. Patterson,et al. Segregating information about the size and shape of the vocal tract using a time-domain auditory model: The stabilised wavelet-Mellin transform , 2002, Speech Commun..
[7] Roy D. Patterson,et al. Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity , 1999, EUROSPEECH.
[8] Hideki Kawahara,et al. Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..
[9] Hideki Kawahara,et al. Auditory morphing based on an elastic perceptual distance metric in an interference-free time-frequency representation , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[10] Richard E. Turner,et al. The processing and perception of size information in speech sounds. , 2005, The Journal of the Acoustical Society of America.
[11] Hideki Kawahara,et al. Algorithm amalgam: morphing waveform based methods, sinusoidal models and STRAIGHT , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[12] Hideki Kawahara,et al. Investigation of emotionally morphed speech perception and its structure using a high quality speech manipulation system , 2003, INTERSPEECH.