Matrix Operations Design Tool for FPGA and VLSI Systems

Embedded systems used in real-time applications require low power, less area and high computation speed. For digital signal processing, image processing and communication applications, data are often received at a continuously high rate. The type of necessary arithmetic functions and matrix operations may vary greatly among different applications. The RTL-based design and verification of one or more of these functions could be time-consuming. Some High Level Synthesis tools reduce this design and verification time but may not be optimal or suitable for low power applications. The design tool proposed in this paper can improve the design time and reduce the verification process. The design tool offers a fast design and verification platform for important matrix operations. These operations range from simple addition to more complex matrix operations such as LU and QR factorizations. The proposed platform can improve design time by reducing verification cycle. This tool generates Verilog code and its testbench that can be realized in FPGA and VLSI systems. The designed system uses MATLAB-based verification and reporting.

[1]  David C. Hendry,et al.  Area efficient DSP datapath synthesis , 1995, Proceedings of EURO-DAC. European Design Automation Conference.

[2]  P. Bertrand,et al.  Optimum smoothing of the Wigner-Ville distribution , 1987, IEEE Trans. Acoust. Speech Signal Process..

[3]  Yasuaki Inoue,et al.  High-performance systolic arrays for band matrix multiplication , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[4]  Hongyan Yang,et al.  FPGA-based Vector Processing for Matrix Operations , 2007, Fourth International Conference on Information Technology (ITNG'07).

[5]  Mitsuhisa Sato,et al.  Parallel implementation of Strassen's matrix multiplication algorithm for heterogeneous clusters , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[6]  J. Saniie,et al.  HW/SW co-design platform for image and video processing applications on Virtex-5 FPGA using PICO , 2010, 2010 IEEE International Conference on Electro/Information Technology.

[7]  Erdal Oruklu,et al.  Realization of area efficient QR factorization using unified division, square root, and inverse square root hardware , 2009, 2009 IEEE International Conference on Electro/Information Technology.

[8]  Sotirios G. Ziavras,et al.  Parallel LU factorization of sparse matrices on FPGA‐based configurable computing engines , 2004, Concurr. Comput. Pract. Exp..