暂无分享,去创建一个
Chao Liu | Jing Zhou | Jing Zhang | Mayank Daga | Daniel Lowell | Jehandad Khan | Ilya Perminov | Paul Fultz | Artem Tamazov | Michael Melesse | Murali Nandhimandalam | Kamil Nasyrov | Tejash Shah | Vasilii Filippov | Bragadeesh Natarajan | I. Perminov | Daniel Lowell | Mayank Daga | Chao Liu | Michael Melesse | Jehandad Khan | Paul Fultz | Artem Tamazov | Murali Nandhimandalam | Kamil Nasyrov | Tejash Shah | Vasilii Filippov | Jing Zhang | Jing Zhou | Bragadeesh Natarajan
[1] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[2] D. Scott Cyphers,et al. Intel® nGraphTM , 2018 .
[3] Roberto Cipolla,et al. Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[4] Haichen Shen,et al. TVM: An Automated End-to-End Optimizing Compiler for Deep Learning , 2018 .
[5] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[6] Charles L. Lawson,et al. Basic Linear Algebra Subprograms for Fortran Usage , 1979, TOMS.
[7] Andrew W. Senior,et al. Long short-term memory recurrent neural network architectures for large scale acoustic modeling , 2014, INTERSPEECH.
[8] Frédo Durand,et al. Halide: a language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines , 2013, PLDI 2013.
[9] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[10] Jacques Pienaar,et al. MLIR Primer: A Compiler Infrastructure for the End of Moore’s Law , 2019 .
[11] Albert Cohen,et al. Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions , 2018, ArXiv.
[12] James Demmel,et al. ImageNet Training in Minutes , 2017, ICPP.
[13] Andrew Lavin,et al. Fast Algorithms for Convolutional Neural Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[14] Bertrand A. Maher,et al. Glow: Graph Lowering Compiler Techniques for Neural Networks , 2018, ArXiv.
[15] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[16] Zheng Zhang,et al. MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems , 2015, ArXiv.
[17] Steven G. Johnson,et al. FFTW: an adaptive software architecture for the FFT , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[18] Shoaib Kamil,et al. The tensor algebra compiler , 2017, Proc. ACM Program. Lang..
[19] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[20] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.
[21] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[22] John J. Hopfield,et al. Neural networks and physical systems with emergent collective computational abilities , 1999 .
[23] Jürgen Schmidhuber,et al. Recurrent nets that time and count , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.
[24] H. Shimodaira,et al. Improving predictive inference under covariate shift by weighting the log-likelihood function , 2000 .
[25] Francesco Visin,et al. A guide to convolution arithmetic for deep learning , 2016, ArXiv.
[26] Yu Zhang,et al. Simple Recurrent Units for Highly Parallelizable Recurrence , 2017, EMNLP.
[27] Pascal Vincent,et al. Reducing Uncertainty in Undersampled MRI Reconstruction With Active Acquisition , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[28] Haichen Shen,et al. TVM: An Automated End-to-End Optimizing Compiler for Deep Learning , 2018, OSDI.
[29] Jack J. Dongarra,et al. Automatically Tuned Linear Algebra Software , 1998, Proceedings of the IEEE/ACM SC98 Conference.
[30] John Salvatier,et al. Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.
[31] Aaftab Munshi,et al. The OpenCL specification , 2009, 2009 IEEE Hot Chips 21 Symposium (HCS).
[32] François Chollet,et al. Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[33] Stéphane Mallat,et al. Rigid-Motion Scattering for Texture Classification , 2014, ArXiv.
[34] Yoshua Bengio,et al. Convolutional networks for images, speech, and time series , 1998 .
[35] Clément Farabet,et al. Implementing Neural Networks Efficiently , 2012, Neural Networks: Tricks of the Trade.
[36] Mikel Luján,et al. OoLALA: an object oriented analysis and design of numerical linear algebra , 2000, OOPSLA '00.
[37] Elizabeth R. Jessup,et al. Automating the generation of composed linear algebra kernels , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.
[38] John Tran,et al. cuDNN: Efficient Primitives for Deep Learning , 2014, ArXiv.
[39] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.
[40] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.
[41] Wojciech Matusik,et al. Simit , 2016, ACM Trans. Graph..
[42] Bo Chen,et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.
[43] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).