Harmonic decomposition of audio signals with matching pursuit

We introduce a dictionary of elementary waveforms, called harmonic atoms, that extends the Gabor dictionary and fits well the natural harmonic structures of audio signals. By modifying the "standard" matching pursuit, we define a new pursuit along with a fast algorithm, namely, the fast harmonic matching pursuit, to approximate N-dimensional audio signals with a linear combination of M harmonic atoms. Our algorithm has a computational complexity of O(MKN), where K is the number of partials in a given harmonic atom. The decomposition method is demonstrated on musical recordings, and we describe a simple note detection algorithm that shows how one could use a harmonic matching pursuit to detect notes even in difficult situations, e.g., very different note durations, lots of reverberation, and overlapping notes.

[1]  Boris Doval Estimation de la frequence fondamentale des signaux sonores , 1994 .

[2]  J. Friedman,et al.  Projection Pursuit Regression , 1981 .

[3]  Michael M. Goodwin,et al.  Matching pursuit with damped sinusoids , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  B. Torrésani Wavelets associated with representations of the affine Weyl–Heisenberg group , 1991 .

[5]  L. Jones On a conjecture of Huber concerning the convergence of projection pursuit regression , 1987 .

[6]  A. Willsky,et al.  HIGH RESOLUTION PURSUIT FOR FEATURE EXTRACTION , 1998 .

[7]  Rémi Gribonval Approximations non-linéaires pour l'analyse de signaux sonores , 1999 .

[8]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[9]  Xavier Rodet,et al.  Tracking of partials for additive sound synthesis using hidden Markov models , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  H. Fletcher Normal Vibration Frequencies of a Stiff Piano String , 1964 .

[11]  Xavier Rodet,et al.  Sound Signals Decomposition Using a High Resolution Matching Pursuit , 1996, ICMC.

[12]  Donald Geman,et al.  An Active Testing Model for Tracking Roads in Satellite Images , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Robin Sibson,et al.  What is projection pursuit , 1987 .

[14]  Shie Qian,et al.  Signal representation using adaptive normalized Gaussian functions , 1994, Signal Process..

[15]  X. Rodet Time — Domain Formant — Wave — Function Synthesis , 1984 .

[16]  Susan Bradshaw,et al.  Dialogue de l'ombre Double , 1993 .

[17]  Xavier Rodet,et al.  Fundamental frequency estimation and tracking using maximum likelihood harmonic matching and HMMs , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  Michael A. Saunders,et al.  Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..

[19]  Rémi Gribonval,et al.  Approximate Weak Greedy Algorithms , 2001, Adv. Comput. Math..

[20]  Lucile Rossi Identification de sons polyphoniques de Piano , 1998 .

[21]  Martin Vetterli,et al.  Atomic signal models based on recursive filter banks , 1997, Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136).

[22]  Kevin Gilholm,et al.  Signal and image feature extraction from local maxima of generalised correlation , 1998, Pattern Recognit..