Alias-free, multiresolution sinusoidal modeling for polyphonic, wideband audio

We describe an improved method of generating more accurate sinusoidal parameters (amplitude, frequency, phase) from a wideband polyphonic audio source in a multiresolution, non-aliased fashion. This significantly improves upon previous work of sinusoidal modeling that assumes a single-pitched monophonic source, such as speech or an individual musical instrument. In addition to a more general analysis, we can now perform high-quality transformations such as time-stretching and pitch-shifting on polyphonic audio with ease.

[1]  M. Goodwin,et al.  Time-frequency signal models for music analysis, transformation, and synthesis , 1996, Proceedings of Third International Symposium on Time-Frequency and Time-Scale Analysis (TFTS-96).

[2]  Xavier Serra,et al.  Musical Sound Modeling with Sinusoids plus Noise , 1997 .

[3]  Louis Dunn Fielder,et al.  ISO/IEC MPEG-2 Advanced Audio Coding , 1997 .

[4]  Martin Vetterli,et al.  Optimal time segmentation for signal modeling and compression , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  B. Edler Aliasing reduction in sub-bands of cascaded filter banks with decimation , 1992 .

[6]  Julius O. Smith,et al.  PARSHL: An Analysis/Synthesis Program for Non-Harmonic Sounds Based on a Sinusoidal Representation , 1987, ICMC.

[7]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[8]  Kuldip K. Paliwal,et al.  Speech Coding and Synthesis , 1995 .

[9]  Teresa H. Y. Meng,et al.  Transient Modeling Synthesis: a flexible analysis/synthesis tool for transient signals , 1998, ICMC.

[10]  David V. Anderson Speech analysis and coding using a multi-resolution sinusoidal transform , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[11]  Udo Zölzer,et al.  Multi-complementary filter bank , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Abeer Alwan,et al.  Spectral analysis of subband filtered signals , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[13]  D. Thomson,et al.  Spectrum estimation and harmonic analysis , 1982, Proceedings of the IEEE.

[14]  J. Smith,et al.  A Sound Decomposition System Based on a Deterministic plus Residual Model , 1990 .

[15]  Daniel P. W. Ellis,et al.  A Wavelet Based Sinusoid Model of Sound for Auditory Signal Separation , 1991, ICMC.

[16]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[17]  Ahmed H. Tewfik,et al.  Low bit rate high quality audio coding with combined harmonic and wavelet representations , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.