Efficient Dilated-Winograd Convolutional Neural Networks

Dilated convolution is used to achieve wide receptive fields in computer vision algorithms such as image segmentation and denoising. Unlike the strided convolution, dilated convolution maintains the resolution of the output feature map same as the input feature map. Thus, the computational complexity can be increased to configure the convolutional neural network (CNN) architecture with the dilated convolutional layer. However, the complexity accordingly introduces additional computation delay and it is strongly required to have a proper way to lessen the computation delay of the dilated convolution. In this paper, we propose the dilated-Winograd convolution to reduce the computational complexity of the dilated convolution. By using the Winograd transform with a dilation rate, the number of pixels in the tile is effectively reduced. The proposed acceleration methods result in an average speedup of 2.043 and 1.456 with dilation rate of 2 and 4 compared to the state-of-the-art implementation.

[1]  Wangmeng Zuo,et al.  Learning Deep CNN Denoiser Prior for Image Restoration , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Andrew Lavin,et al.  Fast Algorithms for Convolutional Neural Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Iasonas Kokkinos,et al.  Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs , 2014, ICLR.

[4]  Yann LeCun,et al.  Fast Training of Convolutional Networks through FFTs , 2013, ICLR.

[5]  Jia Xu,et al.  Fast Image Processing with Fully-Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[6]  Qingxiang Wu,et al.  Image super-resolution using a dilated convolutional neural network , 2018, Neurocomputing.

[7]  Mark J. Shensa,et al.  The discrete wavelet transform: wedding the a trous and Mallat algorithms , 1992, IEEE Trans. Signal Process..

[8]  John Tran,et al.  cuDNN: Efficient Primitives for Deep Learning , 2014, ArXiv.

[9]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[11]  Jeff Johnson,et al.  Fast Convolutional Nets With fbfft: A GPU Performance Evaluation , 2014, ICLR.

[12]  Thomas A. Funkhouser,et al.  Dilated Residual Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).