PSYCHOACOUSTICALLY MOTIVATED NONUNIFORM COSINE MODULATED POLYPHASE FILTER BANK

Nonuniform cosine modulated filter bank is presented as valuable solution for perceptual sound processing systems. Proposed structure connects polyphase concept with idea of warping frequency by all-pass transformation. So it has uncommon flexibility of bandwidths shaping, relatively easy design and good performance. System is considered in context of criticalbands model approximation as front-end for audio coding and enhancement. Theoretical basics of fundamental ideas are reviewed and connecting them into final solution is shown. Issues of the design and details of the implementation are discussed, basing on examples of filter banks approximating well-known nonlinear Bark and ERB psychoacoustic scales. Perspectives of real-time dynamic tuning up the bandwidths are also considered.

[1]  Peter Jax,et al.  A novel psychoacoustically motivated audio enhancement algorithm preserving background noise characteristics , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[2]  Unto K. Laine,et al.  Frequency-warped signal processing for audio applications , 2000 .

[3]  Alexander A. Petrovsky,et al.  Speech enhancement system for hands-free telephone based on the psychoacoustically motivated filter bank with allpass frequency transformation # , 1999, EUROSPEECH.

[4]  A. Oppenheim,et al.  Computation of spectra with unequal resolution using the fast Fourier transform , 1971 .

[5]  M. Kappelan,et al.  Flexible nonuniform filter banks using allpass transformation of multiple order , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[6]  P. Yip,et al.  Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .

[7]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[8]  John Mourjopoulos,et al.  Improving the intelligibility of noisy speech using an audible noise suppression technique , 1997, EUROSPEECH.

[9]  Marek Parfieniuk,et al.  FILTER BANK AND ITS APLLICATION TO SPEECH PROCESSING , 2000 .

[10]  P. Vaidyanathan Multirate Systems And Filter Banks , 1992 .

[11]  Julius O. Smith,et al.  Bark and ERB bilinear transforms , 1999, IEEE Trans. Speech Audio Process..

[12]  Gerhard Doblinger,et al.  Computationally efficient speech enhancement by spectral minima tracking in subbands , 1995, EUROSPEECH.

[13]  Fabrizio Argenti,et al.  Non-uniform filter banks based on a multi-prototype cosine modulation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.