Pareto-Optimal Quantized ResNet Is Mostly 4-bit
暂无分享,去创建一个
Oleg Rybakov | Jonathan Malmaud | AmirAli Abdolrashidi | Shivani Agrawal | Chas Leichner | Lisa Wang | Lukasz Lew | J. Malmaud | Oleg Rybakov | Chas Leichner | Lukasz Lew | Shivani Agrawal | AmirAli Abdolrashidi | Lisa Wang
[1] Avi Mendelson,et al. NICE: Noise Injection and Clamping Estimation for Neural Network Quantization , 2018, Mathematics.
[2] Swagath Venkataramani,et al. PACT: Parameterized Clipping Activation for Quantized Neural Networks , 2018, ArXiv.
[3] Joe Lou,et al. Confounding Tradeoffs for Neural Network Quantization , 2021, ArXiv.
[4] Eunhui Kim,et al. Spatial Shift Point-Wise Quantization , 2020, IEEE Access.
[5] Zhijian Liu,et al. HAQ: Hardware-Aware Automated Quantization With Mixed Precision , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[6] Zhiru Zhang,et al. Improving Neural Network Quantization without Retraining using Outlier Channel Splitting , 2019, ICML.
[7] Fang Liu,et al. Effective and Fast: A Novel Sequential Single Path Search for Mixed-Precision Quantization , 2021, ArXiv.
[8] Song Han,et al. Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.
[9] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.
[10] A. Krizhevsky. Convolutional Deep Belief Networks on CIFAR-10 , 2010 .
[11] Patrick Judd,et al. Integer Quantization for Deep Learning Inference: Principles and Empirical Evaluation , 2020, ArXiv.
[12] Sergey Ioffe,et al. Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.
[13] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .
[14] Alok Aggarwal,et al. Regularized Evolution for Image Classifier Architecture Search , 2018, AAAI.
[15] Kurt Keutzer,et al. HAWQ: Hessian AWare Quantization of Neural Networks With Mixed-Precision , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[16] Dharmendra S. Modha,et al. Discovering Low-Precision Networks Close to Full-Precision Networks for Efficient Embedded Inference , 2018, ArXiv.
[17] Frank Hutter,et al. Neural Architecture Search: A Survey , 2018, J. Mach. Learn. Res..
[18] Zhiru Zhang,et al. FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations , 2020, FPGA.
[19] William J. Dally,et al. VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference , 2021, MLSys.
[20] Bo Chen,et al. Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[21] Lei Liu,et al. Auto-tuning Neural Network Quantization Framework for Collaborative Inference Between the Cloud and Edge , 2018, ICANN.
[22] Kurt Keutzer,et al. ZeroQ: A Novel Zero Shot Quantization Framework , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[23] Albert Gural,et al. Trained Uniform Quantization for Accurate and Efficient Neural Network Inference on Fixed-Point Hardware , 2019, ArXiv.
[24] Ian D. Reid,et al. Towards Effective Low-Bitwidth Convolutional Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[25] Quoc V. Le,et al. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.
[26] Bo Chen,et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.
[27] Jihun Oh,et al. Weight Equalizing Shift Scaler-Coupled Post-training Quantization , 2020, ArXiv.
[28] Ramesh C. Agarwal,et al. A three-dimensional approach to parallel matrix multiplication , 1995, IBM J. Res. Dev..
[29] Uri Weiser,et al. Robust Quantization: One Model to Rule Them All , 2020, NeurIPS.
[30] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.
[31] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[32] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.
[33] G. Hua,et al. LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks , 2018, ECCV.