Time-scale modification of music using a subband approach based on the bark scale

Time-domain time-scaling algorithms are efficient in comparison to their frequency-domain counterparts, but they rely upon the existence of a quasi-periodic signal to produce a high quality output. This requirement makes them unsuitable for use on multi-pitched signals such as polyphonic music. However, time-domain techniques applied on a subband basis can resolve the multi-pitch problem. We propose an improved subband implementation based upon the bark scale for the time-scale modification of music. The new subband approach is supported by psychoacoustic and music theory and subjectively through informal listening tests.

[1]  R. Plomp,et al.  Tonal consonance and critical bandwidth. , 1965, The Journal of the Acoustical Society of America.

[2]  William M. Hartmann,et al.  Psychoacoustics: Facts and Models , 2001 .

[3]  Werner Verhelst,et al.  On the Application of Automatic Waveform Editing for Time Warping Digital and Analog Recordings , 1994 .

[4]  David Dorran,et al.  An Efficient Audio Time-scale Modification Algorithm for use in a Subband Implementation , 2003 .

[5]  Werner Verhelst,et al.  An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Roland K. C. Tan,et al.  A Time-Scale Modification Algorithm Based on the Subband Time-Domain Technique for Broad-Band Signal Applications , 2000 .

[7]  Thomas F. Quatieri,et al.  Shape invariant time-scale and pitch modification of speech , 1992, IEEE Trans. Signal Process..

[8]  Oscar C. Au,et al.  Fast SOLA-based time scale modification using Modified Envelope Matching , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  A. Wilgus,et al.  High quality time-scale modification for speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Eugene Coyle,et al.  High quality time-scale modification of speech using a peak alignment overlap-add algorithm (PAOLA) , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[11]  J. Blauert,et al.  Group delay distortions in electroacoustical systems , 1978 .

[12]  Jean Laroche,et al.  Improved phase vocoder time-scale modification of audio , 1999, IEEE Trans. Speech Audio Process..

[13]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .