Optimized Design Implementation of Direct Memory-Based Hardware for Efficient Resource- Constraint Digital Signal Processing Systems

An advance approach for Direct-Memory-Based hardware for area-delay-power efficient systems for commonly encountered computation-intensive cores of digital signal processing (DSP) systems is presented by combining three techniques, where the memory-size is reduced to one-eighth at the cost of some increase in combinational circuit complexity for signed magnitude numbers. Each of these techniques results in the reduction of the memory size by a factor of two. It is shown that by efficiently combining sign-bit exclusion technique, a different form of anti-symmetric product coding (APC) and a modified odd-multiple-storage (OMS) scheme, we get an optimized direct-memory-based multiplication hardware for resource-constraint DSP systems which provides a reduction in memory size to one-eighth over conventional direct-memorybased hardware, at the cost of a marginal area overhead. The proposed design for small input sizes can be used for efficient implementation of high-precision multiplication by input operand decomposition. The proposed optimized design also offers almost 87.5% and 85% reductions in directmemory size for L=5 bits and L=6 bits signed-magnitude numbers respectively, over conventional direct-memory size. Keywords— Digital signal processing (DSP), direct-memorybased computing, very large scale integration (VLSI).

[1]  Pramod Kumar Meher,et al.  LUT Optimization for Memory-Based Computation , 2010, IEEE Transactions on Circuits and Systems II: Express Briefs.

[2]  Pramod Kumar Meher New look-up-table optimizations for memory-based multiplication , 2009, Proceedings of the 2009 12th International Symposium on Integrated Circuits.

[3]  P.K. Meher,et al.  Memory-based hardware for resource-constraint digital signal processing systems , 2007, 2007 6th International Conference on Information, Communications & Signal Processing.

[4]  Chein-Wei Jen,et al.  On the design automation of the memory-based VLSI architectures for FIR filters , 1993 .

[5]  Chein-Wei Jen,et al.  A memory-efficient realization of cyclic convolution and its application to discrete cosine transform , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Thanos Stouraitis,et al.  Systolic algorithms and a memory-based design approach for a unified architecture for the computation of DCT/DST/IDCT/IDST , 2005, IEEE Transactions on Circuits and Systems I: Regular Papers.

[7]  Chein-Wei Jen,et al.  The efficient memory-based VLSI array designs for DFT and DCT , 1992 .

[8]  Pramod Kumar Meher New approach to LUT implementation and accumulation for memory-based multiplication , 2009, 2009 IEEE International Symposium on Circuits and Systems.

[9]  Thanos Stouraitis,et al.  A systolic array architecture for the discrete sine transform , 2002, IEEE Trans. Signal Process..

[10]  Pramod Kumar Meher Systolic Designs for DCT Using a Low-Complexity Concurrent Convolutional Formulation , 2006, IEEE Transactions on Circuits and Systems for Video Technology.