Understanding the Performance of Small Convolution Operations for CNN on Intel Architecture
暂无分享,去创建一个
Alexander Heinecke | Greg Henry | Dhiraj Kalamkar | Evangelos Georganas | Anand Venkat | Dhiraj D. Kalamkar | Hans Pabst | G. Henry | E. Georganas | A. Heinecke | Hans Pabst | Anand Venkat | Kunal Banerjee | Narayanan | Sundaram | K. Banerjee
[1] Andrew Lavin,et al. Fast Algorithms for Convolutional Neural Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[2] Alexander Heinecke,et al. LIBXSMM: Accelerating Small Matrix Multiplications by Runtime Code Generation , 2016, SC16: International Conference for High Performance Computing, Networking, Storage and Analysis.