Modelling and extraction of fundamental frequency in speech signals

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i Statement of copyright . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii Table of

[1]  D.P. Skinner,et al.  The cepstrum: A guide to processing , 1977, Proceedings of the IEEE.

[2]  Wei-Ping Zhu,et al.  Robust pitch estimation at very low SNR exploiting time and frequency domain cues , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[3]  F. Harris On the use of windows for harmonic analysis with the discrete Fourier transform , 1978, Proceedings of the IEEE.

[4]  David Friedman Multichannel zero-crossing-interval pitch estimation , 1979, ICASSP.

[5]  Yannis Stylianou,et al.  Applying the harmonic plus noise model in concatenative speech synthesis , 2001, IEEE Trans. Speech Audio Process..

[6]  G. Muhammad,et al.  Noise Robust Pitch Detection Based on Extended AMDF , 2008, 2008 IEEE International Symposium on Signal Processing and Information Technology.

[7]  Keikichi Hirose,et al.  A scheme for pitch extraction of speech using autocorrelation function with frame length proportional to the time lag , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  José Carlos Príncipe,et al.  A Pitch Detector Based on a Generalized Correlation Function , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Zhi-Yong Tao,et al.  Detection of Dynamic Structures of Speech Fundamental Frequency in Tonal Languages , 2010, IEEE Signal Processing Letters.

[10]  A. Shah,et al.  Robust pitch estimation using an event based adaptive Gaussian derivative filter , 2002, 2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353).

[11]  Takao Kobayashi,et al.  Robust pitch estimation with harmonics enhancement in noisy environments based on instantaneous frequency , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[12]  Hirokazu Kameoka,et al.  Speech Spectrum Modeling for Joint Estimation of Spectral Envelope and Fundamental Frequency , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Claude Barras,et al.  Speech fundamental frequency estimation using the alternate comb , 2007, INTERSPEECH.

[14]  David Talkin,et al.  A Robust Algorithm for Pitch Tracking ( RAPT ) , 2005 .

[15]  C Manfredi,et al.  A comparative analysis of fundamental frequency estimation methods with application to pathological voices. , 2000, Medical engineering & physics.

[16]  Thomas F. Quatieri,et al.  Pitch estimation and voicing detection based on a sinusoidal speech model , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[17]  Andreas Spanias,et al.  Cepstrum-based pitch detection using a new statistical V/UV classification algorithm , 1999, IEEE Trans. Speech Audio Process..

[18]  Ursula Gisela Goldstein,et al.  An articulatory model for the vocal tracts of growing children , 1980 .

[19]  Patrick J. Wolfe,et al.  Model-based estimation of instantaneous pitch in noisy speech , 2009, INTERSPEECH.

[20]  M. Ross,et al.  Average magnitude difference function pitch extractor , 1974 .

[21]  Saudi Arabia,et al.  A High Resolution Pitch Detection Algorithm Based on AMDF and ACF , 2009 .

[22]  Tayfun Akgül,et al.  Discrete all-pole modeling using higher-order spectra , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[23]  Sanjit K. Mitra,et al.  Pitch estimation of speech signal based on adaptive lattice notch filter , 2005, Signal Process..

[24]  David Gerhard,et al.  Pitch Extraction and Fundamental Frequency: History and Current Techniques , 2003 .

[25]  Guo Shize,et al.  Improving AMDF for pitch period detection , 2009, 2009 9th International Conference on Electronic Measurement & Instruments.

[26]  Moeness G. Amin A frequency-domain LMS comb filter , 1991 .

[27]  J. Bartošek A Pitch Detection Algorithm for Continuous Speech Signals Using Viterbi Traceback with Temporal Forgetting , 2011 .

[28]  D. Yavuz,et al.  Algorithm for pitch extraction using zero-crossing interval sequence , 1977 .

[29]  D. Tuffelli,et al.  A pitch detection algorithm with hypothesis and test strategy by means of fast surface AMDF , 1984, ICASSP.

[30]  Z. Milivojevic,et al.  Estimation of the fundamental frequency of the speech signal modeled by the SYMPES method , 2009 .

[31]  Jae S. Lim,et al.  Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.

[32]  Lawrence R. Rabiner,et al.  On the use of autocorrelation analysis for pitch detection , 1977 .

[33]  John J. Ohala,et al.  Speed of Pitch Change , 1973 .

[34]  PITCH DETERMINATION OF NOISY SPEECH USING HIGHER ORDER STATISTICS , 2004 .

[35]  Tomohiro Nakatani,et al.  Fundamental frequency of infants' and parents' utterances in longitudinal recordings. , 2006, The Journal of the Acoustical Society of America.

[36]  Jerry M. Mendel,et al.  Tutorial on higher-order statistics (spectra) in signal processing and system theory: theoretical results and some applications , 1991, Proc. IEEE.

[37]  S. Koh,et al.  Application of instantaneous frequency estimation for fundamental frequency detection , 1994, Proceedings of IEEE-SP International Symposium on Time- Frequency and Time-Scale Analysis.

[38]  Lawrence R. Rabiner,et al.  Application of an LPC distance measure to the voiced-unvoiced-silence detection problem , 1977 .

[39]  Gang Xu,et al.  Pitch estimation based on Circular AMDF , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[40]  T. Irino,et al.  Robust and accurate fundamental frequency estimation based on dominant harmonic components. , 2004, The Journal of the Acoustical Society of America.

[41]  Shigeru Ando,et al.  An Optimal Comb Filter for Time-Varying Harmonics Extraction(Special Section on Digital Signal Processing) , 1998 .

[42]  J. Liljencrants,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .

[43]  Xudong Jiang Fundamental frequency estimation by higher order spectrum , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[44]  F. Cesbron Pitch detection using the short-term phase spectrum , 1992 .

[45]  Martin Vondra,et al.  Speech Spectrum Envelope Modeling , 2007, COST 2102 Workshop.

[46]  T. Parks,et al.  Maximum likelihood pitch estimation , 1976 .

[47]  Greg Kochanski,et al.  Precision of phoneme boundaries derived using hidden Markov models , 2009, INTERSPEECH.

[48]  Saeed Vaseghi,et al.  Pitch extraction using modified higher order moments , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[49]  Aaron E. Rosenberg,et al.  A comparative performance study of several pitch detection algorithms , 1976 .

[50]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[51]  E. Chilton,et al.  The spectral autocorrelation applied to the linear prediction residual of speech for robust pitch detection , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[52]  E. P. Neuburg On estimating rate of change of pitch , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[53]  Mike Brookes,et al.  A Pitch Estimation Filter robust to high levels of noise (PEFAC) , 2011, 2011 19th European Signal Processing Conference.

[54]  Yoshiaki Tadokoro,et al.  Pitch detection of musical sounds noticing minimum output of parallel connected comb filters , 2003, TENCON 2003. Conference on Convergent Technologies for Asia-Pacific Region.

[55]  D. Paul The spectral envelope estimation vocoder , 1981 .

[56]  M. Sondhi,et al.  New methods of pitch extraction , 1968 .

[57]  Saeed V. Vaseghi,et al.  Advanced Digital Signal Processing and Noise Reduction , 2006 .

[58]  C. L. Nikias,et al.  Signal processing with higher-order spectra , 1993, IEEE Signal Processing Magazine.

[59]  Paul Taylor,et al.  A Phonetic Model of English Intonation , 1992 .

[60]  C. Espy-Wilson,et al.  Maximum likelihood pitch estimation using sinusoidal modeling , 2011, 2011 International Conference on Communications and Signal Processing.

[61]  Stephen A. Zahorian,et al.  Yet Another Algorithm for Pitch Tracking , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[62]  Ben P. Milner,et al.  A comparison of estimated and MAP-predicted formants and fundamental frequencies with a speech reconstruction application , 2007, INTERSPEECH.

[63]  Robert J. McAulay Maximum likelihood pitch estimation using state-variable techniques , 1978, ICASSP.

[64]  Saeed Vaseghi,et al.  Fundamental Frequency Estimation Using Modified Higher Order Moments and Multiple Windows , 2011, INTERSPEECH.

[65]  Sanjit K. Mitra,et al.  An efficient method for the removal of impulse noise from speech and audio signals , 1998, ISCAS '98. Proceedings of the 1998 IEEE International Symposium on Circuits and Systems (Cat. No.98CH36187).

[66]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[67]  A M Noll,et al.  Clipstrum pitch determination. , 1968, Journal of the Acoustical Society of America.

[68]  Weiping Zhu,et al.  On the estimation of pitch of noisy speech based on time and frequency domain representations , 2008, 2008 Canadian Conference on Electrical and Computer Engineering.

[69]  Hajime Kobayashi,et al.  Weighted autocorrelation for pitch extraction of noisy speech , 2001, IEEE Trans. Speech Audio Process..

[70]  C. Nadeu,et al.  Pitch determination using the cepstrum of the one-sided autocorrelation sequence , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[71]  Masaki Nishio,et al.  Changes in Speaking Fundamental Frequency Characteristics with Aging , 2005, Folia Phoniatrica et Logopaedica.

[72]  Y. H. Gu HMM-based noisy-speech pitch contour estimation , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[73]  Ronald W. Schafer,et al.  Real-time digital hardware pitch detector , 1976 .

[74]  Salina Abdul Samad,et al.  Pitch detection of speech signals using the cross-correlation technique , 2000, 2000 TENCON Proceedings. Intelligent Systems and Technologies for the New Millennium (Cat. No.00CH37119).

[75]  Chong Kwan Un,et al.  A performance comparison of pitch extraction algorithms for noisy speech , 1984, ICASSP.

[76]  Eliathamby Ambikairajah,et al.  A Novel Method for Automatic Tonal and Non-Tonal Language Classification , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[77]  Zhenyang Wu,et al.  Modified AMDF pitch detection algorithm , 2003, Proceedings of the 2003 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.03EX693).

[78]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .

[79]  Shih-Chien Yang,et al.  A pitch extraction algorithm based on LPC inverse filtering and AMDF , 1977 .

[80]  Yannis Stylianou,et al.  Modeling Speech Based on Harmonic Plus Noise Models , 2004, Summer School on Neural Networks.

[81]  Eliathamby Ambikairajah,et al.  Automatic Tonal and Non-Tonal Language Classification and Language Identification Using Prosodic Information , 2006 .

[82]  Jhing-Fa Wang,et al.  Extraction of pitch information in noisy speech using wavelet transform with aliasing compensation , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[83]  Shlomo Dubnov,et al.  Maximum a-posteriori probability pitch tracking in noisy environments using harmonic model , 2004, IEEE Transactions on Speech and Audio Processing.

[84]  David Laurenson,et al.  Estimating clean speech thresholds for perceptual based speech enhancement , 1999, Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452).

[85]  Hongbing Hu,et al.  A spectral/temporal method for robust fundamental frequency tracking. , 2008, The Journal of the Acoustical Society of America.

[86]  Hideki Kawahara,et al.  YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.

[87]  A. Noll Short‐Time Spectrum and “Cepstrum” Techniques for Vocal‐Pitch Detection , 1964 .

[88]  Hideki Kawahara,et al.  Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..

[89]  Taoufik En-Najjary,et al.  A new method for pitch prediction from spectral envelope and its application in voice conversion , 2003, INTERSPEECH.

[90]  Xavier Rodet,et al.  Speech analysis and synthesis methods based on spectral envelopes and voiced/unvoiced functions , 1987, ECST.

[91]  Anders Eriksson,et al.  The frequency range of the voice fundamental in the speech of male and female adults , 1993 .

[92]  Xavier Rodet,et al.  Fundamental frequency estimation and tracking using maximum likelihood harmonic matching and HMMs , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[93]  Xuejing Sun,et al.  Pitch determination and voice quality analysis using Subharmonic-to-Harmonic Ratio , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[94]  Tohru Takagi,et al.  A method for pitch extraction of speech signals using autocorrelation functions through multiple window lengths , 2000 .

[95]  Douglas A. Reynolds Gaussian Mixture Models , 2009, Encyclopedia of Biometrics.

[96]  Ahmet M. Kondoz,et al.  Pitch detection of speech signals using segmented autocorrelation , 1995 .

[97]  Leah H. Jamieson,et al.  A probabilistic approach to AMDF pitch detection , 1994, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[98]  Shubha Kadambe,et al.  Application of the wavelet transform for pitch detection of speech signals , 1992, IEEE Trans. Inf. Theory.

[99]  Goran S. Jovanovic A new algorithm for speech fundamental frequency estimation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[100]  Hui Li,et al.  A Pitch Detection Algorithm Based on AMDF and ACF , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[101]  I. Titze The physics of small-amplitude oscillation of the vocal folds. , 1988, The Journal of the Acoustical Society of America.

[102]  R. Gray,et al.  Distortion measures for speech processing , 1980 .

[103]  P. Boersma ACCURATE SHORT-TERM ANALYSIS OF THE FUNDAMENTAL FREQUENCY AND THE HARMONICS-TO-NOISE RATIO OF A SAMPLED SOUND , 1993 .

[104]  Xu Gang,et al.  Speech pitch period estimation using circular AMDF , 2003 .

[105]  A. Lacroix,et al.  Accurate pitch estimation using digital filters , 1977 .

[106]  S. Mahmoud,et al.  The third-order cumulant of speech signals with application to reliable pitch estimation , 1998, Ninth IEEE Signal Processing Workshop on Statistical Signal and Array Processing (Cat. No.98TH8381).

[107]  Jean Rouat,et al.  A pitch determination and voiced/unvoiced decision algorithm for noisy speech , 1995, Speech Commun..

[108]  Chin-Teng Lin,et al.  Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure , 2001, IEEE Trans. Speech Audio Process..

[109]  Amro El-Jaroudi,et al.  Discrete all-pole modeling , 1991, IEEE Trans. Signal Process..

[110]  Hwai-Tsu Hu,et al.  Usefulness of the Comb Filtering Output for Voiced/Unvoiced Classification and Pitch Detection , 2009, 2009 International Conference on Signal Processing Systems.

[111]  Wei-Ping Zhu,et al.  A Robust Pitch Estimation Algorithm in Noise , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[112]  J L Mller Interactions in processing segmental and suprasegmental features of speech. , 1978, Perception & psychophysics.

[113]  J. Markel,et al.  The SIFT algorithm for fundamental frequency estimation , 1972 .

[114]  Olivier Rosec,et al.  Speech spectral envelope estimation through explicit control of peak evolution in time , 2010, 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010).

[115]  Ian S. Burnett,et al.  Low Delay Pitch Detection Using Dynamic-programming/viterbi Techniques , 1996, Fourth International Symposium on Signal Processing and Its Applications.

[116]  R. Kumaresan,et al.  Model-based approach to envelope and positive instantaneous frequency estimation of signals with speech applications , 1999 .

[118]  Weihua Zhang,et al.  Investigation of the spectral envelope estimation vocoder and improved pitch estimation based on the sinusoidal speech model , 1997, Proceedings of ICICS, 1997 International Conference on Information, Communications and Signal Processing. Theme: Trends in Information Systems Engineering and Wireless Multimedia Communications (Cat..

[119]  W. H. Holmes,et al.  Harmonic-plus-noise decomposition and its application in voiced/unvoiced classification , 1997, TENCON '97 Brisbane - Australia. Proceedings of IEEE TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications (Cat. No.97CH36162).

[120]  Daniel W. Griffin,et al.  Multi-band excitation vocoder , 1987 .

[121]  Philippe Martin Comparison of pitch detection by cepstrum and spectral comb analysis , 1982, ICASSP.

[122]  Eugene Coyle,et al.  Multi Pitch Estimation by using IIR Comb Filters , 2005 .

[123]  Eliathamby Ambikairajah,et al.  Estimating the pitch period of voiced speech , 1980 .

[124]  Zhenmin Tang,et al.  A Method Combining LPC-Based Cepstrum and Harmonic Product Spectrum for Pitch Detection , 2006, 2006 International Conference on Intelligent Information Hiding and Multimedia.

[125]  Yannis Stylianou,et al.  HNM: a simple, efficient harmonic+noise model for speech , 1993, Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[126]  Saeed Vaseghi,et al.  Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications , 2007 .

[127]  F. Marir,et al.  The fourth order cumulant of speech signals applied to pitch estimation , 2004, 2004 IEEE International Conference on Industrial Technology, 2004. IEEE ICIT '04..

[128]  Wolfgang Hess,et al.  Accurate time-domain pitch determination of speech signals by means of a laryngograph , 1987, Speech Commun..

[129]  Wei-Ping Zhu,et al.  Pitch Estimation Based on a Harmonic Sinusoidal Autocorrelation Model and a Time-Domain Matching Scheme , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[130]  Ronald W. Schafer,et al.  Theory and Applications of Digital Speech Processing , 2010 .

[131]  Takao Kobayashi,et al.  Harmonics tracking and pitch extraction based on instantaneous frequency , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[132]  G. Rozinaj,et al.  A hybrid pitch period estimation method based on HNM model , 2007, ELMAR 2007.

[133]  J. Moorer,et al.  The optimum comb method of pitch period analysis of continuous digitized speech , 1974 .