A pitch determination algorithm based on subharmonic-to-harmonic ratio

In the present paper, a pitch determination algorithm (PDA) based on Subharmonic-to-Harmonic Ratio (SHR) is proposed. The algorithm is motivated by the results of a recent study on the perceived pitch of alternate pulse cycles in speech [1]. The algorithm employs a logarithmic frequency scale and a spectrum shifting technique to obtain the amplitude summation of harmonics and subharmonics, respectively. Through comparing the amplitude ratio of subharmonics and harmonics with the pitch perception results, the pitch of normal speech as well as speech with alternate pulse cycles (APC) can be determined. . Evaluation of the algorithm is performed on CSTR’s database and on synthesized speech with APC. The results show that this algorithm is one of the most reliable PDAs. Furthermore, superior to most other algorithms, it handles subharmonics reasonably well.

[1]  Ingo R. Titze,et al.  Principles of voice production , 1994 .

[2]  B Gold,et al.  Parallel processing techniques for estimating pitch periods of speech in the time domain. , 1969, The Journal of the Acoustical Society of America.

[3]  M. Schroeder Period histogram and product spectrum: new methods for fundamental-frequency measurement. , 1968, The Journal of the Acoustical Society of America.

[4]  Michael S. Phillips A feature‐based time domain pitch tracker , 1985 .

[5]  Wolfgang J. Hess,et al.  Pitch and voicing determination , 1992 .

[6]  D. J. Hermes,et al.  Measurement of pitch by subharmonic summation. , 1988, The Journal of the Acoustical Society of America.

[7]  Leah H. Jamieson,et al.  A probabilistic approach to AMDF pitch detection , 1994, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[8]  George R. Doddington,et al.  An integrated pitch tracking algorithm for speech systems , 1983, ICASSP.

[9]  Xuejing Sun The perceived pitch of synthesized vowels with alternate pulse cycles , 2000 .

[10]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[11]  Eyal Yair,et al.  Super resolution pitch determination of speech signals , 1991, IEEE Trans. Signal Process..