Using hardware parallelism for reducing power consumption in video streaming applications

Reconfigurable technology fits for real-time video streaming applications. It is considered as a promising solution due to the offered performance per watt compared to other technologies. Since FPGA evolved, several techniques at different design levels starting from the circuit-level up to the system-level were proposed to reduce the power consumption of the FPGA devices. In this paper, we present a flexible parallel hardware-based architecture in conjunction with frequency scaling as a technique for reducing power consumption in video streaming applications. In this work, we derived equations to ease the calculation for the level of parallelism and the maximum depth for the FIFOs used for clock domain crossing. Accordingly, a design space was formed including all the design alternatives for the application. The preferable design alternative is selected in aware of how much hardware it costs and what power reduction goal it can satisfy. We used Xilinx Zynq ZC706 evaluation board to implement two video streaming applications: Video downscaler (1:16) and AES encryption algorithm to verify our approach. The experimental results showed up to 19.6% power reduction for the video downscaler and up to 5.4% for the AES encryption.

[1]  David Black-Schaffer,et al.  The HIPEAC vision for advanced computing in horizon 2020 , 2013 .

[2]  Jean-Luc Dekeyser,et al.  A generic pixel distribution architecture for parallel video processing , 2014, 2014 International Conference on ReConFigurable Computing and FPGAs (ReConFig14).

[3]  Ahmed Amine Jerraya,et al.  Multiprocessor System-on-Chip (MPSoC) Technology , 2008, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[4]  Samuel H. Fuller,et al.  Computing Performance: Game Over or Next Level? , 2011, Computer.

[5]  Wolfgang Nebel,et al.  Low power design in deep submicron electronics , 1997 .

[6]  Ioannis Papaefstathiou,et al.  FPGA power consumption measurements and estimations under different implementation parameters , 2011, 2011 International Conference on Field-Programmable Technology.

[7]  Jason Helge Anderson,et al.  Clock gating architectures for FPGA power reduction , 2009, 2009 International Conference on Field Programmable Logic and Applications.

[8]  Azadeh Davoodi,et al.  FPGA Dynamic Power Minimization through Placement and Routing Constraints , 2006, EURASIP J. Embed. Syst..

[9]  Morris J. Dworkin,et al.  Recommendation for Block Cipher Modes of Operation: Methods and Techniques , 2001 .

[10]  Greg Brown,et al.  A performance and energy comparison of FPGAs, GPUs, and multicores for sliding-window applications , 2012, FPGA '12.

[11]  Patrick Degenaar,et al.  Parallelism to reduce power consumption on FPGA spatiotemporal image processing , 2008, 2008 IEEE International Symposium on Circuits and Systems.

[12]  Eric R. Fossum,et al.  CMOS image sensors: electronic camera on a chip , 1995, Proceedings of International Electron Devices Meeting.

[13]  Andrew B. Kahng The ITRS design technology and system drivers roadmap: Process and status , 2013, 2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC).

[14]  José L. Núñez-Yáñez,et al.  Run-time power and performance scaling in 28 nm FPGAs , 2014, IET Comput. Digit. Tech..