Fast multidimensional reduction and broadcast operations on GPU for machine learning
暂无分享,去创建一个
[1] Julia Deniz Yuret. Knet : beginning deep learning with 100 lines of , 2016 .
[2] Caglar Senaras,et al. Deep Learning for Medical Image Analysis , 2018, Journal of Pathology Informatics.
[3] Mark J. Harris. CUDA: performance tips and tricks , 2007, SIGGRAPH '07.
[4] Christoph Meinel,et al. Deep Learning for Medical Image Analysis , 2018, Journal of Pathology Informatics.
[5] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[6] Fei-Fei Li,et al. Deep visual-semantic alignments for generating image descriptions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[8] Barbara Chapman,et al. Using OpenMP: Portable Shared Memory Parallel Programming (Scientific and Engineering Computation) , 2007 .
[9] Kohei Ichikawa,et al. MPI_Reduce algorithm for OpenFlow-enabled network , 2015, 2015 15th International Symposium on Communications and Information Technologies (ISCIT).
[10] Alan Edelman,et al. Julia: A Fast Dynamic Language for Technical Computing , 2012, ArXiv.
[11] Tauno Kekäle,et al. Beautiful Code. Leading Programmers Explain How They Think , 2009 .
[12] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[13] Anand D. Sarwate,et al. A Unified Optimization Approach for Sparse Tensor Operations on GPUs , 2017, 2017 IEEE International Conference on Cluster Computing (CLUSTER).