A Performance Evaluation of Distributed Deep Learning Frameworks on CPU Clusters Using Image Classification Workloads
暂无分享,去创建一个
[1] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[2] Chong Wang,et al. Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.
[3] Alexander J. Smola,et al. Scaling Distributed Machine Learning with the Parameter Server , 2014, OSDI.
[4] Shaohuai Shi,et al. Performance Modeling and Evaluation of Distributed Deep Learning Frameworks on GPUs , 2017, 2018 IEEE 16th Intl Conf on Dependable, Autonomic and Secure Computing, 16th Intl Conf on Pervasive Intelligence and Computing, 4th Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).
[5] Aaron Q. Li,et al. Parameter Server for Distributed Machine Learning , 2013 .
[6] J. Watts,et al. Interprocessor collective communication library (InterCom) , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.
[7] Minsik Cho. BlueConnect: Novel Hierarchical All-Reduce on Multi-tired Network for Deep Learning , 2018 .
[8] Zheng Zhang,et al. MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems , 2015, ArXiv.
[9] Sebastian Ruder,et al. An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.
[10] Takuya Akiba,et al. Chainer: A Deep Learning Framework for Accelerating the Research Cycle , 2019, KDD.
[11] Rajeev Thakur,et al. Optimization of Collective Communication Operations in MPICH , 2005, Int. J. High Perform. Comput. Appl..
[12] Nectarios Koziris,et al. General-Purpose vs. Specialized Data Analytics Systems: A Game of ML & SQL Thrones , 2019, 2019 IEEE International Conference on Big Data (Big Data).
[13] Alex Brooks,et al. Gluon: a communication-optimizing substrate for distributed heterogeneous graph analytics , 2018, PLDI.
[14] 2019 IEEE International Conference on Big Data (Big Data) , 2019 .
[15] He Ma,et al. Theano-MPI: A Theano-Based Distributed Training Framework , 2016, Euro-Par Workshops.
[16] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[17] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[18] Marc'Aurelio Ranzato,et al. Large Scale Distributed Deep Networks , 2012, NIPS.
[19] Xin Yuan,et al. Bandwidth optimal all-reduce algorithms for clusters of workstations , 2009, J. Parallel Distributed Comput..
[20] Junsong Yuan,et al. Multi-view Harmonized Bilinear Network for 3D Object Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[21] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.
[22] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.
[23] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .
[24] Ruben Mayer,et al. Scalable Deep Learning on Distributed Infrastructures: Challenges, Techniques and Tools , 2019 .
[25] Amar Phanishayee,et al. Benchmarking and Analyzing Deep Neural Network Training , 2018, 2018 IEEE International Symposium on Workload Characterization (IISWC).
[26] Zhiyuan Li,et al. Group-Level Emotion Recognition using Deep Models with A Four-stream Hybrid Network , 2018, ICMI.
[27] I. Dhillon,et al. Taming Pretrained Transformers for Extreme Multi-label Text Classification , 2019, KDD.
[28] Nectarios Koziris,et al. Towards Faster Distributed Deep Learning Using Data Hashing Techniques , 2019, 2019 IEEE International Conference on Big Data (Big Data).
[29] Kunle Olukotun,et al. DAWNBench : An End-to-End Deep Learning Benchmark and Competition , 2017 .
[30] Yang Wang,et al. BigDL: A Distributed Deep Learning Framework for Big Data , 2018, SoCC.
[31] Gu-Yeon Wei,et al. Fathom: reference workloads for modern deep learning methods , 2016, 2016 IEEE International Symposium on Workload Characterization (IISWC).
[32] Seung-Jong Park,et al. Evaluation of Deep Learning Frameworks Over Different HPC Architectures , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).
[33] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..