Improving the speed of neural networks on CPUs
暂无分享,去创建一个
[1] Ivica Rogina,et al. The bucket box intersection (BBI) algorithm for fast approximative evaluation of diagonal mixture Gaussians , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[2] Mark J. F. Gales,et al. Use of Gaussian selection in large vocabulary continuous speech recognition using HMMS , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[3] Keechul Jung,et al. GPU implementation of neural networks , 2004, Pattern Recognit..
[4] N. Fujimoto,et al. Faster matrix-vector multiplication on GeForce 8800GTX , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.
[5] Keechul Jung,et al. Neural Network Implementation Using CUDA and OpenMP , 2008, 2008 Digital Image Computing: Techniques and Applications.
[6] Volodymyr Mnih,et al. CUDAMat: a CUDA-based matrix class for Python , 2009 .
[7] Rajat Raina,et al. Large-scale deep unsupervised learning using graphics processors , 2009, ICML '09.
[8] Pradeep Dubey,et al. Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU , 2010, ISCA.
[9] Navdeep Jaitly,et al. Application of Pretrained Deep Neural Networks to Large Vocabulary Speech Recognition , 2012, INTERSPEECH.
[10] Navdeep Jaitly,et al. Application of Pretrained Deep Neural Networks to Large Vocabulary Conversational Speech Recognition , 2012 .