论文信息 - A new fast algorithm for the unified forward and inverse MDCT/MDST computation

A new fast algorithm for the unified forward and inverse MDCT/MDST computation

The modified discrete cosine transform (MDCT) and modified discrete sine transform (MDST) are employed in subband/transform coding schemes as the analysis/synthesis filter banks based on the concept of time domain aliasing cancellation (TDAC). Princen, Bradley and Johnson defined two types of the MDCT, specifically, for an evenly stacked and oddly stacked analysis/synthesis systems. The MDCT is the basic processing component in the international audio coding standards and commercial products for high-quality audio compression. Almost all existing audio coding systems have used the complex-valued or real-valued FFT algorithms, and the DCT/DST of type IV (DCT-IV/DST-IV) for the fast MDCT computation. New fast and efficient algorithm for a unified forward and inverse MDCT/MDST computation in the oddly stacked system is proposed. It is based on the DCT/DST of types II and III (DCT-II/DST-II, DCT-III/DST-III), and the real arithmetic is used only. Corresponding generalized signal flow graph is regular, structurally simple and enables to compute MDCT/MDST and their inverses in general for any N divisible by 4 (N being length of a data sequence). Consequently, the new fast algorithm can be adopted for the MDCT computation in the current audio coding standards such as MPEG family (MPEG-1, MPEG-2, MPEG-2 Advanced Audio Coding and MPEG-4 audio), and in commercial products (proprietary audio coding algorithms) such as Sony MiniDisc/ATRAC/ATRAC2/SDDS digital audio coding systems, the AT & T Perceptual Audio Coder (PAC) or Lucent Technologies PAC/Enhanced PAC/Multichannel PAC, and Dolby Labs AC-3 digital audio compression algorithm. Besides the new fast algorithm has some interesting properties, it provides an efficient implementation of the forward and inverse MDCT computation for layer III in MPEG audio coding, where the length of data blocks N ≠ 2n, Especially, for the AC-3 algorithm, it is shown how both the proposed new MDCT/MDST algorithm and existing fast algorithms/computational architectures for the discrete sinusoidal transforms computation of real data sequences such as the DCT-IV/DST-IV, generalized discrete Fourier transform of type IV (DFT-IV) and generalized discrete Hartley transform of type IV (DHT-IV) can be used for the fast alternate or simultaneous (on-line) MDCT/MDST computation by simple pre-and post-processing of data sequences.

Vladimir Britanak | K. R. Rao | Kamisetty Ramamohan Rao | V. Britanak

[1] D. Sevic,et al. A new efficient implementation of the oddly stacked Princen-Bradley filter bank , 1994, IEEE Signal Processing Letters.

[2] John Mourjopoulos,et al. A differential perceptual audio coding method with reduced bitrate requirements , 1995, IEEE Trans. Speech Audio Process..

[3] John Princen,et al. Subband/Transform coding using filter bank designs based on time domain aliasing cancellation , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4] Chi-Wah Kok,et al. Fast algorithm for computing discrete cosine transform , 1997, IEEE Trans. Signal Process..

[5] Deepen Sinha,et al. Audio compression at low bit rates using a signal adaptive switched filterbank , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[6] Takehiro Moriya,et al. High-quality audio-coding at less than 64 kbit/s by using transform-domain weighted interleave vector quantization (TwinVQ) , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[7] Henrique S. Malvar,et al. Signal processing with lapped transforms , 1992 .

[8] M. Bellanger,et al. Odd-time odd-frequency discrete Fourier transform for symmetric real-valued series , 1976, Proceedings of the IEEE.

[9] Takehiro Moriya,et al. Scalable audio coder based on quantizer units of MDCT coefficients , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[10] T. Yoshida,et al. The rewritable MiniDisc system , 1994, Proc. IEEE.

[11] John Princen,et al. Analysis/Synthesis filter bank design based on time domain aliasing cancellation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[12] Seymour Shlien,et al. The modulated lapped transform, its time-varying forms, and its applications to audio coding standards , 1997, IEEE Trans. Speech Audio Process..

[13] P. Yip,et al. Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .

[14] Henrique S. Malvar. Lapped transforms for efficient transform/subband coding , 1990, IEEE Trans. Acoust. Speech Signal Process..

[15] A. W. Johnson,et al. Adaptive transform coding incorporating Time Domain Aliasing Cancellation , 1987, Speech Commun..

[16] Jie-Cherng Liu,et al. Regressive implementations for the forward and inverse MDCT in MPEG audio coding , 1996, IEEE Signal Process. Lett..

[17] Z. Picel,et al. Flexible design of computationaly efficient nearly perfect QMF filter banks , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18] Akihiko Sugiyama,et al. A 128 kb/s Hi-Fi Audio CODEC Based on Adaptive Transform Coding with Adaptive Block Size MDCT , 1992, IEEE J. Sel. Areas Commun..

[19] Henrique S. Malvar. A modulated complex lapped transform and its applications to audio processing , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[20] Vladimir Britanak,et al. The fast generalized discrete Fourier transforms: A unified approach to the discrete sinusoidal transforms computation , 1999, Signal Process..

[21] R. Gluth. Regular FFT-related transform kernels for DCT/DST-based polyphase filter banks , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[22] Vladimir Britanak. A Unified Approach to FAST Computation of Discrete Sinusoidal Transforms II: DFT and DWT Transforms , 1998, Comput. Artif. Intell..

[23] K.R. Rao,et al. An efficient implementation of the forward and inverse MDCT in MPEG audio coding , 2001, IEEE Signal Processing Letters.

[24] Joël Mau. Perfect reconstruction modulated filter banks: fast algorithms and attractive new properties , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[25] J. Yang,et al. Regular implementation algorithms of time domain aliasing cancellation , 1996 .

[26] J. D. Johnston,et al. Sum-difference stereo transform coding , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[27] A. Spanias,et al. Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[28] K. Ho,et al. Fast algorithms for computing the discrete cosine transform , 1992 .

[29] Ronald N. Bracewell. The Hartley transform , 1986 .

[30] Henrique S. Malvar. Extended lapped transforms: fast algorithms and applications , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[31] Chi-Min Liu,et al. A Unified Fast Algorithm for Cosine Modulated Filter Banks in Current Audio Coding Standards , 1999 .

[32] H. Tai,et al. Fast algorithm for computing modulated lapped transform , 2001 .

[33] Zhong-De Wang. A fast algorithm for the discrete sine transform implemented by the fast cosine transform , 1982 .

[34] Zhongde Wang. Comments on "Generalized discrete Hartley transform" , 1995, IEEE Trans. Signal Process..

[35] S. C. Chan,et al. Direct methods for computing discrete sinusoidal transforms , 1990 .

[36] B. Hunt,et al. The discreteW transform , 1985 .

[37] Y. Mahieux,et al. Transform coding of audio signals at 64 kbit/s , 1990, [Proceedings] GLOBECOM '90: IEEE Global Telecommunications Conference and Exhibition.

[38] Pierre Duhamel,et al. A fast algorithm for the implementation of filter banks based on 'time domain aliasing cancellation' , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[39] Vijay K. Madisetti,et al. The Digital Signal Processing Handbook , 1997 .

[40] Y. Mahieux,et al. High quality audio transform coding at 64 kbit/s , 1994 .

[41] Henrique S. Malvar,et al. Fast algorithms for orthogonal and biorthogonal modulated lapped transforms , 1998, 1998 IEEE Symposium on Advances in Digital Filtering and Signal Processing. Symposium Proceedings (Cat. No.98EX185).

[42] Henrique S. Malvar,et al. Fast algorithm for modulated lapped transform , 1991 .

[43] J. Mau. Perfect reconstruction modulated filter banks , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[44] Russell M. Mersereau,et al. On fast algorithms for computing the inverse modified discrete cosine transform , 1999, IEEE Signal Processing Letters.

[45] Henrique S. Malvar. Biorthogonal and nonuniform lapped transforms for transform coding with reduced blocking and ringing artifacts , 1998, IEEE Trans. Signal Process..

[46] Kenzo Akagiri,et al. ATRAC: Adaptive Transform Acoustic Coding for MiniDisc , 1992 .

[47] Henrique S. Malvar. Modulated QMF filter banks with perfect reconstruction , 1990 .

[48] Henrique S. Malvar. Extended lapped transforms: properties, applications, and fast algorithms , 1992, IEEE Trans. Signal Process..

[49] K. R. Rao,et al. Correction to "An efficient implementation of the forward and inverse MDCT in MPEG audio coding" , 2001 .