论文信息 - Alias-free, multiresolution sinusoidal modeling for polyphonic, wideband audio

Alias-free, multiresolution sinusoidal modeling for polyphonic, wideband audio

We describe an improved method of generating more accurate sinusoidal parameters (amplitude, frequency, phase) from a wideband polyphonic audio source in a multiresolution, non-aliased fashion. This significantly improves upon previous work of sinusoidal modeling that assumes a single-pitched monophonic source, such as speech or an individual musical instrument. In addition to a more general analysis, we can now perform high-quality transformations such as time-stretching and pitch-shifting on polyphonic audio with ease.

Julius O. Smith | Scott Levine | T. S. Verma

[1] M. Goodwin,et al. Time-frequency signal models for music analysis, transformation, and synthesis , 1996, Proceedings of Third International Symposium on Time-Frequency and Time-Scale Analysis (TFTS-96).

[2] Xavier Serra,et al. Musical Sound Modeling with Sinusoids plus Noise , 1997 .

[3] Louis Dunn Fielder,et al. ISO/IEC MPEG-2 Advanced Audio Coding , 1997 .

[4] Martin Vetterli,et al. Optimal time segmentation for signal modeling and compression , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5] B. Edler. Aliasing reduction in sub-bands of cascaded filter banks with decimation , 1992 .

[6] Julius O. Smith,et al. PARSHL: An Analysis/Synthesis Program for Non-Harmonic Sounds Based on a Sinusoidal Representation , 1987, ICMC.

[7] Edward H. Adelson,et al. The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[8] Kuldip K. Paliwal,et al. Speech Coding and Synthesis , 1995 .

[9] Teresa H. Y. Meng,et al. Transient Modeling Synthesis: a flexible analysis/synthesis tool for transient signals , 1998, ICMC.

[10] David V. Anderson. Speech analysis and coding using a multi-resolution sinusoidal transform , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[11] Udo Zölzer,et al. Multi-complementary filter bank , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12] Abeer Alwan,et al. Spectral analysis of subband filtered signals , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[13] D. Thomson,et al. Spectrum estimation and harmonic analysis , 1982, Proceedings of the IEEE.

[14] J. Smith,et al. A Sound Decomposition System Based on a Deterministic plus Residual Model , 1990 .

[15] Daniel P. W. Ellis,et al. A Wavelet Based Sinusoid Model of Sound for Auditory Signal Separation , 1991, ICMC.

[16] Thomas F. Quatieri,et al. Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[17] Ahmed H. Tewfik,et al. Low bit rate high quality audio coding with combined harmonic and wavelet representations , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.