A Quantitative Analysis of the Memory Architecture of FPGA-SoCs

In recent years, so called FPGA-SoCs have been introduced by Intel (formerly Altera) and Xilinx. These devices combine multi-core processors with programmable logic. This paper analyzes the various memory and communication interconnects found in actual devices, particularly the Zynq-7020 and Zynq-7045 from Xilinx and the Cyclone V SE SoC from Intel. Issues such as different access patterns, cache coherence and full-duplex communication are analyzed, for both generic accesses as well as for a real workload from the field of video coding. Furthermore, the paper shows that by carefully choosing the memory interconnect networks as well as the software interface, high-speed memory access can be achieved for various scenarios.

[1]  Ben H. H. Juurlink,et al.  SIMD Acceleration for HEVC Decoding , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Wei Zhang,et al.  Analytical Delay Model for CPU-FPGA Data Paths in Programmable System-on-Chip FPGA , 2016, ARC.

[4]  Haohuan Fu,et al.  Eliminating the memory bottleneck: an FPGA-based solution for 3d reverse time migration , 2011, FPGA '11.

[5]  Toshiaki Miyazaki,et al.  Hardware Accelerator for BLAST , 2012, 2012 IEEE 6th International Symposium on Embedded Multicore SoCs.

[6]  Lukás Sekanina,et al.  Image filter evolution on the Xilinx Zynq Platform , 2013, 2013 NASA/ESA Conference on Adaptive Hardware and Systems (AHS-2013).

[7]  Simon W. Moore,et al.  Managing the FPGA memory wall: Custom computing or vector processing? , 2013, 2013 23rd International Conference on Field programmable Logic and Applications.

[8]  Valery Sklyarov,et al.  Analysis and Comparison of Attainable Hardware Acceleration in All Programmable Systems-on-Chip , 2015, 2015 Euromicro Conference on Digital System Design.

[9]  Luca Benini,et al.  Energy and performance exploration of accelerator coherency port using Xilinx ZYNQ , 2013 .