A Graph Theoretic Framework of Recomputation Algorithms for Memory-Efficient Backpropagation
暂无分享,去创建一个
Takuya Akiba | Gentaro Watanabe | Masanori Koyama | Takuya Inoue | Mitsuru Kusumoto | Masanori Koyama | Takuya Akiba | G. Watanabe | M. Kusumoto | Takuya Inoue
[1] Jianxin Wu,et al. ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[2] Brian A. Davey,et al. An Introduction to Lattices and Order , 1989 .
[3] Hao Wu,et al. Mixed Precision Training , 2017, ICLR.
[4] Kenta Oono,et al. Chainer : a Next-Generation Open Source Framework for Deep Learning , 2015 .
[5] Natalia Gimelshein,et al. vDNN: Virtualized deep neural networks for scalable, memory-efficient neural network design , 2016, 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).
[6] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[8] Tianqi Chen,et al. Training Deep Nets with Sublinear Memory Cost , 2016, ArXiv.
[9] Andrew W. Appel,et al. Modern Compiler Implementation in Java , 1997 .
[10] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[11] Manish Purohit,et al. Efficient Rematerialization for Deep Networks , 2019, NeurIPS.
[12] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[14] Takuya Akiba,et al. Chainer: A Deep Learning Framework for Accelerating the Research Cycle , 2019, KDD.
[15] Hai Jin,et al. Layer-Centric Memory Reuse and Data Migration for Extreme-Scale Deep Learning on Many-Core Architectures , 2018, ACM Trans. Archit. Code Optim..
[16] Yuning Jiang,et al. MegDet: A Large Mini-Batch Object Detector , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[17] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.
[18] Alex Graves,et al. Memory-Efficient Backpropagation Through Time , 2016, NIPS.
[19] Zenglin Xu,et al. Superneurons: dynamic GPU memory management for training deep neural networks , 2018, PPoPP.
[20] Xiaogang Wang,et al. Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[21] Kilian Q. Weinberger,et al. Memory-Efficient Implementation of DenseNets , 2017, ArXiv.