Optimizing convolutional neural networks on embedded platforms with OpenCL
暂无分享,去创建一个
[1] Grigori Fursin,et al. Collective Knowledge: Towards R&D sustainability , 2016, 2016 Design, Automation & Test in Europe Conference & Exhibition (DATE).
[2] Grigori Fursin,et al. Collective Mind, Part II: Towards Performance- and Cost-Aware Software Engineering as a Natural Science , 2015, ArXiv.
[3] Song Han,et al. Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.
[4] Fabian Tschopp,et al. Efficient convolutional neural networks for pixelwise classification on heterogeneous hardware systems , 2015, 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI).
[5] Sam Lindley,et al. Generating performance portable code using rewrite rules: from high-level functional expressions to high-performance OpenCL code , 2015, ICFP.
[6] Karl Rupp,et al. ViennaCL-A High Level Linear Algebra Library for GPUs and Multi-Core CPUs , 2010 .
[7] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[8] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.
[9] Nicholas D. Lane,et al. DeepX: A Software Accelerator for Low-Power Deep Learning Inference on Mobile Devices , 2016, 2016 15th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN).
[10] Anton Lokhmotov,et al. Optimising OpenCL kernels for the ARM Mali-T600 GPUs , 2014 .
[11] Elnar Hajiyev,et al. PENCIL: A Platform-Neutral Compute Intermediate Language for Accelerator Programming , 2015, 2015 International Conference on Parallel Architecture and Compilation (PACT).
[12] J. Xu. OpenCL – The Open Standard for Parallel Programming of Heterogeneous Systems , 2009 .