Scalable interconnect networks for Discrete Cosine Transforms (DCT) for mobile and multimedia application

Scalable architectures were proposed for Discrete Cosine Transform (DCT). Number of processing elements (PE) can be reduced significantly using partial column structure for computing the DCT transform. This feature is very desirable for multimedia applications usage in handheld devices. As per transform computation, data reordering is required between stages (columns) where intermediate computed values are saved in memory-like temporary locations called FIFO's. A scalable interconnect network for both global and local data reordering and its implementation is presented in this paper. Scalability is based on transform size and desired number of processing elements (PE). The structure gives choice flexibility of throughput vs. complexity (cost and area.) of the overall system.

[1]  David Akopian Systematic approaches to parallel architectures for DSP algorithms , 1997, Signal Process..

[2]  David Akopian,et al.  Multi-port interconnection networks for radix-R algorithms , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[3]  Henrique S. Malvar,et al.  Signal processing with lapped transforms , 1992 .

[4]  David Akopian,et al.  Constant geometry algorithm for discrete cosine transform , 2000, IEEE Trans. Signal Process..

[5]  Ramesh A. Gopinath,et al.  Modulated filter banks and wavelets-a general unified theory , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[6]  George Miel,et al.  Constant Geometry Fast Fourier Transforms on Array Processors , 1993, IEEE Trans. Computers.

[7]  David Akopian,et al.  A family of scalable FFT architectures and an implementation of 1024-point radix-2 FFT for real-time communications , 2008, 2008 IEEE International Conference on Computer Design.

[8]  Earl E. Swartzlander,et al.  Modular pipeline fast fourier transform algorithm , 2003 .

[9]  Jaakko Astola,et al.  Scalable interconnection networks for partial column array processor architectures , 2000, 2000 IEEE International Symposium on Circuits and Systems. Emerging Technologies for the 21st Century. Proceedings (IEEE Cat No.00CH36353).

[10]  Jaakko Astola,et al.  Architecture-oriented regular algorithms for discrete sine and cosine transforms , 1996, Electronic Imaging.

[11]  J. Tukey,et al.  An algorithm for the machine calculation of complex Fourier series , 1965 .

[12]  Steve Gorman,et al.  Partial column FFT pipelines , 1995 .

[13]  David Akopian,et al.  Multistage interconnection networks for k/n rate Viterbi decoders , 1998, IEEE GLOBECOM 1998 (Cat. NO. 98CH36250).

[14]  P. Yip,et al.  Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .