FPGA Implementation of the Coupled Filtering Method and the Affine Warping Method

In ultrasound image analysis, the speckle tracking methods are widely applied to study the elasticity of body tissue. However, “feature-motion decorrelation” still remains as a challenge for the speckle tracking methods. Recently, a coupled filtering method and an affine warping method were proposed to accurately estimate strain values, when the tissue deformation is large. The major drawback of these methods is the high computational complexity. Even the graphics processing unit (GPU)-based program requires a long time to finish the analysis. In this paper, we propose field-programmable gate array (FPGA)-based implementations of both methods for further acceleration. The capability of FPGAs on handling different image processing components in these methods is discussed. A fast and memory-saving image warping approach is proposed. The algorithms are reformulated to build a highly efficient pipeline on FPGA. The final implementations on a Xilinx Virtex-7 FPGA are at least 13 times faster than the GPU implementation on the NVIDIA graphic card (GeForce GTX 580).

[1]  Tsutomu Maruyama,et al.  Performance comparison of FPGA, GPU and CPU in image processing , 2009, 2009 International Conference on Field Programmable Logic and Applications.

[2]  Karl M. Fant,et al.  A Nonaliasing, Real-Time Spatial Transform Technique , 1986, IEEE Computer Graphics and Applications.

[3]  Yong Dou,et al.  Optimized Generation of Memory Structure in Compiling Window Operations onto Reconfigurable Hardware , 2007, ARC.

[4]  K. Schwan,et al.  FPGA based real-time image segmentation for medical systems and data processing , 2005, 14th IEEE-NPSS Real Time Conference, 2005..

[5]  Weichuan Yu,et al.  On Feature Motion Decorrelation in Ultrasound Speckle Tracking , 2013, IEEE Transactions on Medical Imaging.

[6]  Frédérique Frouin,et al.  Ultrasound elastography based on multiscale estimations of regularized displacement fields , 2004, IEEE Transactions on Medical Imaging.

[7]  Iuliana Chiuchisan A new FPGA-based real-time configurable system for medical image processing , 2013, 2013 E-Health and Bioengineering Conference (EHB).

[8]  Limin Li,et al.  Design of an MR image processing module on an FPGA chip. , 2015, Journal of magnetic resonance.

[9]  Donald G. Bailey,et al.  A novel approach to real-time bilinear interpolation , 2004, Proceedings. DELTA 2004. Second IEEE International Workshop on Electronic Design, Test and Applications.

[10]  Greg Brown,et al.  A performance and energy comparison of FPGAs, GPUs, and multicores for sliding-window applications , 2012, FPGA '12.

[11]  Gyeonghwan Kim,et al.  FPGA-based fast image warping with data-parallelization schemes , 2008, IEEE Transactions on Consumer Electronics.

[12]  Alan C. Evans,et al.  3-D Brain MRI Tissue Classification on FPGAs , 2009, IEEE Transactions on Image Processing.

[13]  J A Jensen,et al.  A model for the propagation and scattering of ultrasound in tissue. , 1991, The Journal of the Acoustical Society of America.

[14]  Jun Chen,et al.  Improved LUT-Based Image Warping for Video Cameras , 2011, 2011 14th IEEE International Conference on Computational Science and Engineering.

[15]  Donald G. Bailey,et al.  A Real-time FPGA Implementation of a Barrel Distortion Correction Algorithm with Bilinear Interpolation , 2003 .

[16]  Jürgen Becker,et al.  A comprehensive comparison of GPU- and FPGA-based acceleration of reflection image reconstruction for 3D ultrasound computer tomography , 2012, Journal of Real-Time Image Processing.

[17]  J. Meunier,et al.  Echographic image mean gray level changes with tissue dynamics: a system-based model study , 1995, IEEE Transactions on Biomedical Engineering.

[18]  Greg Stitt,et al.  A Parallel Sliding-Window Generator for High-Performance Digital-Signal Processing on FPGAs , 2016, ACM Trans. Reconfigurable Technol. Syst..

[19]  John Morris,et al.  Comparison of FPGA and GPU implementations of real-time stereo vision , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[20]  Wayne Luk,et al.  Floating-Point FPGA: Architecture and Modeling , 2009, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[21]  J.A. Jensen,et al.  8A-3 System Architecture of an Experimental Synthetic Aperture Real-Time Ultrasound System , 2007, 2007 IEEE Ultrasonics Symposium Proceedings.

[22]  F.L. Fontaine,et al.  A Reconfigurable FPGA-based 16-Channel Front-End for MRI , 2006, 2006 Fortieth Asilomar Conference on Signals, Systems and Computers.

[23]  Kevin Skadron,et al.  Accelerating Compute-Intensive Applications with GPUs and FPGAs , 2008, 2008 Symposium on Application Specific Processors.

[24]  Miriam Leeser,et al.  Automatic Sliding Window Operation Optimization for FPGA-Based , 2006, 2006 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines.

[25]  Yvon Savaria,et al.  Reconfigurable pipelined 2-D convolvers for fast digital signal processing , 1999, IEEE Trans. Very Large Scale Integr. Syst..

[26]  Shinsuke Hara,et al.  Implementation of dynamic-range enhancement and super-resolution algorithms for medical image processing , 2014, 2014 IEEE International Conference on Consumer Electronics (ICCE).

[27]  Yongmin Kim,et al.  Generalized image warping using enhanced lookup tables , 1998, Int. J. Imaging Syst. Technol..

[28]  Walid A. Najjar,et al.  Input data reuse in compiling window operations onto reconfigurable hardware , 2004, LCTES '04.

[29]  Peter M. Athanas,et al.  Quantitative analysis of floating point arithmetic on FPGA based custom computing machines , 1995, Proceedings IEEE Symposium on FPGAs for Custom Computing Machines.

[30]  David A. Kearney,et al.  Fast 2D convolution using reconfigurable computing , 2005, Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005..

[31]  Abbes Amira,et al.  FPGA implementations of fast Fourier transforms for real-time signal and image processing , 2005 .

[32]  P. Sadayappan,et al.  Efficient transposition algorithms for large matrices , 1993, Supercomputing '93. Proceedings.