A Multipitch Tracking Algorithm Based on Wavelet Packet Analysis

In view of the limitation of many multipitch detection methods in single-channel speech, a robust and accurate multipitch estimation method for multiple voices is processed. In this study, our approach is based on the spectral analysis of the wavelet packet analysis. It utilizes the quasi-periodicity in a short time frame of speech, and gets candidate pitche in every frame by using peak selection and searching algorithm. The multipitch envelope of the mixed signal is obtained by neighborhood analysis and determined from these candidates by virtue of the fact that pitch should change rather smoothly in consecutive frame. Simulation results showed that the proposed method can robustly estimate fundamental frequency for mixed speech from single-channel speech.

[1]  Matti Karjalainen,et al.  A computationally efficient multipitch analysis model , 2000, IEEE Trans. Speech Audio Process..

[2]  Hema A. Murthy,et al.  Two-pitch tracking in co-channel speech using modified group delay functions , 2017, Speech Commun..

[3]  Guy J. Brown,et al.  A multi-pitch tracking algorithm for noisy speech , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Zhou Chao Detection of double pitches for Chinese Putonghua , 2011 .

[5]  John S. Baras,et al.  Properties of the multiscale maxima and zero-crossings representations , 1993, IEEE Trans. Signal Process..

[6]  Andreas Büchner,et al.  Wavelet Packet Filterbank for Speech Processing Strategies in Cochlear Implants , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[7]  Guy J. Brown,et al.  Separation of speech from interfering sounds based on oscillatory correlation , 1999, IEEE Trans. Neural Networks.

[8]  Poorva G. Parande,et al.  A study of the cocktail party problem , 2017, 2017 International Conference on Electrical and Computing Technologies and Applications (ICECTA).

[9]  Franz Pernkopf,et al.  A Probabilistic Interaction Model for Multipitch Tracking With Factorial Hidden Markov Models , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  DeLiang Wang,et al.  Robust pitch tracking in noisy speech using speaker-dependent deep neural networks , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11]  Noureddine Ellouze,et al.  Spectral Multi-Scale Analysis for Multi-Pitch Tracking , 2009, 2009 IEEE 13th Digital Signal Processing Workshop and 5th IEEE Signal Processing Education Workshop.

[12]  Shubha Kadambe,et al.  Application of the wavelet transform for pitch detection of speech signals , 1992, IEEE Trans. Inf. Theory.