Evaluation of a Floating-Point Intensive Kernel on FPGA - A Case Study of Geodesic Distance Kernel

Heterogeneous platforms provide a promising solution for high-performance and energy-efficient computing applications. This paper presents our research on usage of heterogeneous platform for a floating-point intensive kernel. We first introduce the floating-point intensive kernel from the geographical information system. Then we analyze the FPGA designs generated by the Intel FPGA SDK for OpenCL, and evaluate the kernel performance and the floating-point error rate of the FPGA designs. Finally, we compare the performance and energy efficiency of the kernel implementations on the Arria 10 FPGA, Intel’s Xeon Phi Knights Landing CPU, and NVIDIA’s Kepler GPU. Our evaluation shows the energy efficiency of the single-precision kernel on the FPGA is 1.35X better than on the CPU and the GPU, while the energy efficiency of the double-precision kernel on the FPGA is 1.36X and 1.72X less than the CPU and GPU, respectively.

[1]  Mário P. Véstias,et al.  Trends of CPU, GPU and FPGA for high-performance computing , 2014, 2014 24th International Conference on Field Programmable Logic and Applications (FPL).

[2]  Wayne Luk,et al.  Is high level synthesis ready for business? A computational finance case study , 2014, 2014 International Conference on Field-Programmable Technology (FPT).

[3]  Kevin Skadron,et al.  Accelerating Compute-Intensive Applications with GPUs and FPGAs , 2008, 2008 Symposium on Application Specific Processors.

[4]  Dirk Koch,et al.  FPGAs for Software Programmers , 2016 .

[5]  Doris Chen,et al.  Fractal video compression in OpenCL: An evaluation of CPUs, GPUs, and FPGAs as acceleration platforms , 2013, 2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC).

[6]  Franck Cappello,et al.  Evaluation of CHO Benchmarks on the Arria 10 FPGA using Intel FPGA SDK for OpenCL , 2017 .

[7]  Satoshi Matsuoka,et al.  Evaluating and Optimizing OpenCL Kernels for High Performance Computing with FPGAs , 2016, SC16: International Conference for High Performance Computing, Networking, Storage and Analysis.

[8]  Javier Navaridas,et al.  CHO: towards a benchmark suite for OpenCL FPGA accelerators , 2015, IWOCL.

[9]  Keith D. Underwood,et al.  FPGAs vs. CPUs: trends in peak floating-point performance , 2004, FPGA '04.

[10]  Miriam Leeser,et al.  OpenCL Floating Point Software on Heterogeneous Architectures – Portable or Not? , 2012 .

[11]  Avinash Sodani,et al.  Intel Xeon Phi Processor High Performance Programming: Knights Landing Edition 2nd Edition , 2016 .

[12]  Ronan Keryell,et al.  Optimizing OpenCL applications on Xilinx FPGA , 2016, IWOCL.