IMPROVED HIDDEN MARKOV MODEL PARTIAL TRACKING THROUGH TIME-FREQUENCY ANALYSIS

In this article we propose a modification to the combinatorial hidden Markov model developed in [1] for tracking partial frequency trajectories. We employ the Wigner-Ville distribution and Hough transform in order to (re)estimate the frequency and chirp rate of partials in each analysis frame. We estimate the initial phase and amplitude of each partial by minimizing the squared error in the time-domain. We then formulate a new scoring criterion for the hidden Markov model which makes the tracker more robust for non-stationary and noisy signals. We achieve good performance tracking crossing linear chirps and crossing FM signals in white noise as well as real instrument recordings.

[1]  Jeremy J. Wells,et al.  REAL-TIME PARTIAL TRACKING IN AN AUGMENTED ADDITIVE SYNTHESIS SYSTEM , 2002 .

[2]  Gaël Richard,et al.  Estimation of Frequency for AM/FM Models Using the Phase Vocoder Framework , 2008, IEEE Transactions on Signal Processing.

[3]  L. Cohen,et al.  Time-frequency distributions-a review , 1989, Proc. IEEE.

[4]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[5]  Mathieu Lagrange,et al.  Tracking partials for the sinusoidal modeling of polyphonic sounds , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[6]  Xavier Serra,et al.  A sound analysis/synthesis system based on a deterministic plus stochastic decomposition , 1990 .

[7]  Mathieu Lagrange,et al.  Enhanced Partial Tracking Using Linear Prediction , 2003 .

[8]  E. Wigner On the quantum correction for thermodynamic equilibrium , 1932 .

[9]  Lippold Haken,et al.  Bandwidth Enhanced Sinusoidal Modeling in Lemur , 1995, ICMC.

[10]  Xavier Rodet,et al.  Tracking of partials for additive sound synthesis using hidden Markov models , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[12]  Sergio Barbarossa,et al.  Analysis of multicomponent LFM signals by a combined Wigner-Hough transform , 1995, IEEE Trans. Signal Process..

[13]  Axel Röbel,et al.  Adaptive additive modeling with continuous parameter trajectories , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Julius O. Smith,et al.  AM/FM rate estimation for time-varying sinusoidal modeling , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[15]  Julius O. Smith,et al.  Spectral modeling synthesis: A sound analysis/synthesis based on a deterministic plus stochastic decomposition , 1990 .

[16]  T. Claasen,et al.  THE WIGNER DISTRIBUTION - A TOOL FOR TIME-FREQUENCY SIGNAL ANALYSIS , 1980 .