nn-Meter: towards accurate latency prediction of deep-learning model inference on diverse edge devices
暂无分享,去创建一个
Yunxin Liu | Ningxin Zheng | Li Lyna Zhang | Yuqing Yang | Ting Cao | Shihao Han | Jianyu Wei | Yunxin Liu | L. Zhang | Yuqing Yang | Jianyu Wei | Ting Cao | S. Han | Ningxin Zheng
[1] Haichen Shen,et al. TVM: An Automated End-to-End Optimizing Compiler for Deep Learning , 2018, OSDI.
[2] Song Han,et al. AMC: AutoML for Model Compression and Acceleration on Mobile Devices , 2018, ECCV.
[3] Song Han,et al. Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.
[4] Ming-Wei Chang,et al. Well-Read Students Learn Better: On the Importance of Pre-training Compact Models , 2019 .
[5] Xiangyu Zhang,et al. ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[6] Yiming Yang,et al. DARTS: Differentiable Architecture Search , 2018, ICLR.
[7] W. Schröder-Preikschat,et al. Precious: Resource-Demand Estimation for Embedded Neural Network Accelerators , 2020 .
[8] Yuandong Tian,et al. FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[9] Michael Carbin,et al. Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks , 2018, ICML.
[10] Samuel J. Kaufman,et al. Learned TPU Cost Model for XLA Tensor Programs , 2019 .
[11] Leo Breiman,et al. Random Forests , 2001, Machine Learning.
[12] Yunxin Liu,et al. Fast Hardware-Aware Neural Architecture Search , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[13] Nicholas D. Lane,et al. BRP-NAS: Prediction-based NAS using GCNs , 2020, NeurIPS.
[14] Diana Marculescu,et al. NeuralPower: Predict and Deploy Energy-Efficient Convolutional Neural Networks , 2017, ArXiv.
[15] Tor M. Aamodt,et al. Accel-Sim: An Extensible Simulation Framework for Validated GPU Modeling , 2020, 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA).
[16] Ling Huang,et al. Predicting Execution Time of Computer Programs Using Sparse Polynomial Regression , 2010, NIPS.
[17] Frédo Durand,et al. Learning to optimize halide with tree search and random programs , 2019, ACM Trans. Graph..
[18] Xuanzhe Liu,et al. A First Look at Deep Learning Apps on Smartphones , 2018, WWW.
[19] Song Han,et al. ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware , 2018, ICLR.
[20] Tianqi Chen,et al. XGBoost: A Scalable Tree Boosting System , 2016, KDD.
[21] Niraj K. Jha,et al. ChamNet: Towards Efficient Network Design Through Platform-Aware Model Adaptation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[22] Yi Yang,et al. NAS-Bench-201: Extending the Scope of Reproducible Neural Architecture Search , 2020, ICLR.
[23] Chuang Gan,et al. Once for All: Train One Network and Specialize it for Efficient Deployment , 2019, ICLR.
[24] Bo Chen,et al. MnasNet: Platform-Aware Neural Architecture Search for Mobile , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[25] Hanan Samet,et al. Pruning Filters for Efficient ConvNets , 2016, ICLR.