Single and Piecewise Polynomials for Modeling of Pitched Sounds

We present a compact approach to simultaneous modeling of non-stationary harmonic and transient components in pitched sound sources. The harmonic and transient components are described by separate models which are built from a common sinusoidal basis modified by a joint action of single and linear piecewise time polynomials respectively. A single polynomial accounts for slow and continuous signal time variations, while various piecewise polynomials can capture fast signal changes on smaller subintervals within the analysis window. The resulting model is linear-in-parameters and the solution to the corresponding linear system of equations provides correct model parameter estimates according to the signal content in the analysis window. The model is extended to deal with mixtures of sounds, where harmonics clustered in a small bandwidth are jointly modeled as a single harmonic. The comparative results suggest that the proposed model outperforms two reference modeling methods in terms of modeling errors and number of parameters.

[1]  Michael Unser,et al.  Splines: a perfect fit for signal and image processing , 1999, IEEE Signal Process. Mag..

[2]  Robert Strandh,et al.  FAST ADDITIVE SOUND SYNTHESIS USING POLYNOMIALS , 2006 .

[3]  Michael M. Goodwin Multiresolution sinusoidal modeling using adaptive segmentation , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[4]  Mark B. Sandler,et al.  A tutorial on onset detection in music signals , 2005, IEEE Transactions on Speech and Audio Processing.

[5]  Johan Schoukens,et al.  Time-variant harmonic and transient signal modeling by joint polynomial and piecewise linear approximation , 2010, 2010 18th European Signal Processing Conference.

[6]  Richard Heusdens,et al.  Modifying transients for efficient coding of audio , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[7]  Rémi Gribonval,et al.  Harmonic decomposition of audio signals with matching pursuit , 2003, IEEE Trans. Signal Process..

[8]  Ju Sung Park,et al.  Multiresolution sinusoidal model with dynamic segmentation for timescale modification of polyphonic audio signals , 2005, IEEE Transactions on Speech and Audio Processing.

[9]  Edward A. Lee,et al.  Adaptive Signal Models: Theory, Algorithms, and Audio Applications , 1998 .

[10]  R. T. Schumacher,et al.  ON THE OSCILLATIONS OF MUSICAL-INSTRUMENTS , 1983 .

[11]  Giovanni Bucci,et al.  New ADC with piecewise linear characteristic: case study-implementation of a smart humidity sensor , 2000, IEEE Trans. Instrum. Meas..

[12]  Johan Schoukens,et al.  Time-variant harmonic signal modeling by using polynomial approximation and fully automated spectral analysis , 2009, 2009 17th European Signal Processing Conference.

[13]  Martin Vetterli,et al.  Atomic signal models based on recursive filter banks , 1997, Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136).

[14]  D. A. Luce,et al.  Physical Correlates of Brass‐Instrument Tones , 1967 .

[15]  Andreas Jakobsson,et al.  Multi-Pitch Estimation , 2009, Multi-Pitch Estimation.

[16]  Joerg F. Hipp,et al.  Time-Frequency Analysis , 2014, Encyclopedia of Computational Neuroscience.

[17]  Nicolás Ruiz-Reyes,et al.  Polyphonic transcription based on temporal evolution of spectral similarity of gaussian mixture models , 2009, 2009 17th European Signal Processing Conference.

[18]  Michael M. Goodwin,et al.  Adaptive Signal Models , 1998 .

[19]  José Manuel Iñesta Quereda,et al.  Multiple fundamental frequency estimation using Gaussian smoothness , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[20]  Anssi Klapuri,et al.  Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Roland Badeau,et al.  High-resolution spectral analysis of mixtures of complex exponentials modulated by polynomials , 2006, IEEE Transactions on Signal Processing.

[22]  Michael A. Saunders,et al.  Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..

[23]  Gang Li,et al.  Signal representation based on instantaneous amplitude models with application to speech synthesis , 2000, IEEE Trans. Speech Audio Process..

[24]  Sabine Van Huffel,et al.  Speech compression based on exact modeling and structured total least norm optimization , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[25]  Laurent Daudet,et al.  A Review on Techniques for the Extraction of Transients in Musical Signals , 2005, CMMR.

[26]  Julius O. Smith,et al.  Alias-free, multiresolution sinusoidal modeling for polyphonic, wideband audio , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.

[27]  A. Willsky,et al.  HIGH RESOLUTION PURSUIT FOR FEATURE EXTRACTION , 1998 .

[28]  Johan Schoukens,et al.  On The Polynomial Approximation for Time-Variant Harmonic Signal Modeling , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[29]  Karim Abed-Meraim,et al.  Efficient parametric modeling for audio transients , 2002 .

[30]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[31]  Karim Abed-Meraim,et al.  Damped and delayed sinusoidal model for transient signals , 2005, IEEE Transactions on Signal Processing.

[32]  Patrick J. Wolfe,et al.  Analysis of reassigned spectrograms for musical transcription , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[33]  John Strawn,et al.  Approximation and Syntactic Analysis of Amplitude and Frequency Functions for Digital Sound Synthesis , 1980, ICMC.

[34]  Patrick Flandrin,et al.  Improving the readability of time-frequency and time-scale representations by the reassignment method , 1995, IEEE Trans. Signal Process..

[35]  Thierry Blu,et al.  Linear interpolation revitalized , 2004, IEEE Transactions on Image Processing.

[36]  Sabine Van Huffel,et al.  Perceptual audio modeling with exponentially damped sinusoids , 2005, Signal Process..

[37]  M.G. Christensen,et al.  Multi-Pitch Estimation Using Harmonic Music , 2006, 2006 Fortieth Asilomar Conference on Signals, Systems and Computers.

[38]  Jesper Jensen,et al.  Exponential sinusoidal modeling of transitional speech segments , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[39]  Julius O. Smith,et al.  Detection and modeling of transient audio signals with prior information , 2005 .

[40]  Axel Röbel,et al.  Transient detection and preservation in the phase vocoder , 2003, ICMC.

[41]  Laurent Daudet,et al.  Sparse and structured decompositions of signals with the molecular matching pursuit , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[42]  Shie Qian,et al.  Signal representation using adaptive normalized Gaussian functions , 1994, Signal Process..

[43]  Leon O. Chua,et al.  Canonical piecewise-linear representation , 1988 .

[44]  Jim Woodhouse,et al.  Plucked guitar transients: Comparison of measurements and synthesis (vol 90, pg 945, 2004) , 2004 .

[45]  Kelly Fitz,et al.  Correction to: 'On the Use of Time/Frequency Reassignment in Additive Sound Modeling' , 2002 .

[46]  Kyogu Lee,et al.  Explicit onset modeling of sinusoids using time reassignment , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[47]  Jesper Jensen,et al.  A perceptual subspace approach for modeling of speech and audio signals with damped sinusoids , 2004, IEEE Transactions on Speech and Audio Processing.

[48]  J. Keeler The attack transients of some organ pipes , 1972 .

[49]  Jim Woodhouse,et al.  On the Synthesis of Guitar Plucks , 2004 .

[50]  Manuel Rosa-Zurera,et al.  Transient modeling by matching pursuits with a wavelet dictionary for parametric audio coding , 2004, IEEE Signal Processing Letters.

[51]  J. Chick,et al.  Transient behaviour in the motion of the brass player's lips during a lip slur , 2009 .

[52]  M. David Freedman Analysis of Musical Instrument Tones , 1967 .

[53]  J. Keeler Piecewise-periodic analysis of almost-periodic sounds and musical transients , 1972 .

[54]  N. Ruiz Reyes,et al.  Adaptive Signal Modeling Based on Sparse Approximations for Scalable Parametric Audio Coding , 2010, IEEE Transactions on Audio, Speech, and Language Processing.