论文信息 - Cordic-based Softmax Acceleration Method of Convolution Neural Network on FPGA

Cordic-based Softmax Acceleration Method of Convolution Neural Network on FPGA

With the vigorous development of computing power, Convolutional Neural Network (CNN) is developing rapidly, and new CNN structures with more layers and better performance continue to appear. Field Programmable Gate Array(FPGA) has gradually become the best choice for people to deploy and accelerate CNNs as a current research hotspot. This paper has studied the hardware acceleration method of FPGA to implement and simulate the Softmax layer of Alexnet on Vivado 2018.1. Combined with the features of FPGA, the Cordic algorithm is used to implement basic operations such as division and exponential functions, instead of consuming floating-point arithmetic resources. The paper proposes a method to shrink the convergence domain and analyzes the errors generated by the different digits of data after quantization and fixed-point inputs. The relative error of the Softmax layer exponential function is controlled below 0.0146% by reducing the bit width which satisfied the design requirements and saved resources. This method can complete the calculation and classification of the Softmax layer in 66.5 cycles without processing the layer data at fixed points, which greatly improves the calculation speed of the Softmax layer.

[1] Yong Dou,et al. Double Precision Hybrid-Mode Floating-Point FPGA CORDIC Co-processor , 2008, 2008 10th IEEE International Conference on High Performance Computing and Communications.

[2] José Manuel Ferrández,et al. New Emulated Discrete Model of CNN Architecture for FPGA and DSP Applications , 2003, IWANN.

[3] Brejesh Lall,et al. Brain-Inspired Machine Intelligence for Image Analysis: Convolutional Neural Networks , 2017 .

[4] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.

[5] Tamás Roska,et al. Cellular neural networks with nonlinear and delay-type template elements , 1990, IEEE International Workshop on Cellular Neural Networks and their Applications.

[6] Chang Liu,et al. An FPGA implementation for real-time edge detection , 2015, Journal of Real-Time Image Processing.

[7] He Chen,et al. Implementation on FPGA for CORDIC-based Computation of Arcsine and Arccosine , 2015 .

[8] Jack E. Volder. The CORDIC Trigonometric Computing Technique , 1959, IRE Trans. Electron. Comput..

[9] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[10] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.