Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
暂无分享,去创建一个
Hideki Kawahara | Alain de Cheveigné | Ikuyo Masuda-Katsuse | Hideki Kawahara | A. Cheveigné | Ikuyo Masuda-Katsuse
[1] Albert S. Bregman,et al. Auditory Scene Analysis , 2001 .
[2] J. Jiang,et al. Vocal fold physiology. , 2000, Otolaryngologic clinics of North America.
[3] L. H. Anauer,et al. Speech Analysis and Synthesis by Linear Prediction of the Speech Wave , 2000 .
[4] A. Cheveigné. Cancellation model of pitch perception. , 1998 .
[5] A. de Cheveigné. Cancellation model of pitch perception. , 1998, The Journal of the Acoustical Society of America.
[6] Hideki Kawahara,et al. Speech representation and transformation using adaptive interpolation of weighted spectrum: vocoder revisited , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[7] Takao Kobayashi,et al. Robust pitch estimation with harmonics enhancement in noisy environments based on instantaneous frequency , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[8] Malcolm Slaney,et al. Automatic audio morphing , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[9] Raymond N. J. Veldhuis,et al. Time-scale and pitch modifications of speech signals and resynthesis from the discrete short-time Fourier transform , 1996, Speech Commun..
[10] Takao Kobayashi,et al. Harmonics Estimation Based on Instantaneous Frequency and Its Application to Pitch Determination of Speech , 1995, IEICE Trans. Inf. Syst..
[11] Bayya Yegnanarayana,et al. Transformation of formants for voice conversion using artificial neural networks , 1995, Speech Commun..
[12] Eric Moulines,et al. High-quality speech modification based on a harmonic + noise model , 1995, EUROSPEECH.
[13] Martin Cooke,et al. Modelling auditory processing and organisation , 1993, Distinguished dissertations in computer science.
[14] Thierry Dutoit,et al. An analysis of the performances of the MBE model when used in the context of a text-to-speech system , 1993, EUROSPEECH.
[15] Richard R. Fay,et al. The Mammalian Auditory Pathway: Neuroanatomy , 1992, Springer Handbook of Auditory Research.
[16] Boualem Boashash,et al. Estimating and interpreting the instantaneous frequency of a signal. I. Fundamentals , 1992, Proc. IEEE.
[17] Boualem Boashash,et al. Estimating and interpreting the instantaneous frequency of a signal. II. A/lgorithms and applications , 1992, Proc. IEEE.
[18] Amro El-Jaroudi,et al. Discrete all-pole modeling , 1991, IEEE Trans. Signal Process..
[19] Isabel Trancoso,et al. Hybrid sinusoidal modeling of speech without voicing decision , 1991, EUROSPEECH.
[20] L. Cohen,et al. Time-frequency distributions-a review , 1989, Proc. IEEE.
[21] Jae S. Lim,et al. Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.
[22] R. Patterson,et al. A pulse ribbon model of monaural phase perception. , 1987, The Journal of the Acoustical Society of America.
[23] B. Atal,et al. Role of multi-pulse excitation in synthesis of natural-sounding voiced speech , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[24] Thomas F. Quatieri,et al. Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..
[25] George R. Doddington,et al. An integrated pitch tracking algorithm for speech systems , 1983, ICASSP.
[26] H. Barlow. Vision: A computational investigation into the human representation and processing of visual information: David Marr. San Francisco: W. H. Freeman, 1982. pp. xvi + 397 , 1983 .
[27] J. Blauert,et al. Group delay distortions in electroacoustical systems , 1978 .
[28] E. A. Flinn. Comments on “Speech Analysis and Synthesis by Linear Prediction of the Speech Wave” [B. S. Atal and S. L. Hanauer, J. Acoust. Soc. Amer. 50, 637–655 (1971)] , 1972 .
[29] B. Atal,et al. Speech analysis and synthesis by linear prediction of the speech wave. , 1971, The Journal of the Acoustical Society of America.
[30] F. Itakura,et al. A statistical method for estimation of speech spectral density and formant frequencies , 1970 .
[31] B. L. Cardozo,et al. Pitch of the Residue , 1962 .