Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds

[1]  Albert S. Bregman,et al.  Auditory Scene Analysis , 2001 .

[2]  J. Jiang,et al.  Vocal fold physiology. , 2000, Otolaryngologic clinics of North America.

[3]  L. H. Anauer,et al.  Speech Analysis and Synthesis by Linear Prediction of the Speech Wave , 2000 .

[4]  A. Cheveigné Cancellation model of pitch perception. , 1998 .

[5]  A. de Cheveigné Cancellation model of pitch perception. , 1998, The Journal of the Acoustical Society of America.

[6]  Hideki Kawahara,et al.  Speech representation and transformation using adaptive interpolation of weighted spectrum: vocoder revisited , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Takao Kobayashi,et al.  Robust pitch estimation with harmonics enhancement in noisy environments based on instantaneous frequency , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[8]  Malcolm Slaney,et al.  Automatic audio morphing , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[9]  Raymond N. J. Veldhuis,et al.  Time-scale and pitch modifications of speech signals and resynthesis from the discrete short-time Fourier transform , 1996, Speech Commun..

[10]  Takao Kobayashi,et al.  Harmonics Estimation Based on Instantaneous Frequency and Its Application to Pitch Determination of Speech , 1995, IEICE Trans. Inf. Syst..

[11]  Bayya Yegnanarayana,et al.  Transformation of formants for voice conversion using artificial neural networks , 1995, Speech Commun..

[12]  Eric Moulines,et al.  High-quality speech modification based on a harmonic + noise model , 1995, EUROSPEECH.

[13]  Martin Cooke,et al.  Modelling auditory processing and organisation , 1993, Distinguished dissertations in computer science.

[14]  Thierry Dutoit,et al.  An analysis of the performances of the MBE model when used in the context of a text-to-speech system , 1993, EUROSPEECH.

[15]  Richard R. Fay,et al.  The Mammalian Auditory Pathway: Neuroanatomy , 1992, Springer Handbook of Auditory Research.

[16]  Boualem Boashash,et al.  Estimating and interpreting the instantaneous frequency of a signal. I. Fundamentals , 1992, Proc. IEEE.

[17]  Boualem Boashash,et al.  Estimating and interpreting the instantaneous frequency of a signal. II. A/lgorithms and applications , 1992, Proc. IEEE.

[18]  Amro El-Jaroudi,et al.  Discrete all-pole modeling , 1991, IEEE Trans. Signal Process..

[19]  Isabel Trancoso,et al.  Hybrid sinusoidal modeling of speech without voicing decision , 1991, EUROSPEECH.

[20]  L. Cohen,et al.  Time-frequency distributions-a review , 1989, Proc. IEEE.

[21]  Jae S. Lim,et al.  Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.

[22]  R. Patterson,et al.  A pulse ribbon model of monaural phase perception. , 1987, The Journal of the Acoustical Society of America.

[23]  B. Atal,et al.  Role of multi-pulse excitation in synthesis of natural-sounding voiced speech , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[24]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[25]  George R. Doddington,et al.  An integrated pitch tracking algorithm for speech systems , 1983, ICASSP.

[26]  H. Barlow Vision: A computational investigation into the human representation and processing of visual information: David Marr. San Francisco: W. H. Freeman, 1982. pp. xvi + 397 , 1983 .

[27]  J. Blauert,et al.  Group delay distortions in electroacoustical systems , 1978 .

[28]  E. A. Flinn Comments on “Speech Analysis and Synthesis by Linear Prediction of the Speech Wave” [B. S. Atal and S. L. Hanauer, J. Acoust. Soc. Amer. 50, 637–655 (1971)] , 1972 .

[29]  B. Atal,et al.  Speech analysis and synthesis by linear prediction of the speech wave. , 1971, The Journal of the Acoustical Society of America.

[30]  F. Itakura,et al.  A statistical method for estimation of speech spectral density and formant frequencies , 1970 .

[31]  B. L. Cardozo,et al.  Pitch of the Residue , 1962 .