Lightweight Approximation of Softmax Layer for On-Device Inference

[1]  Wai-Chi Fang,et al.  A Customized Convolutional Neural Network Design Using Improved Softmax Layer for Real-time Human Emotion Recognition , 2019, 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS).

[2]  Yasuhiro Fujiwara,et al.  Sigsoftmax: Reanalysis of the Softmax Bottleneck , 2018, NeurIPS.

[3]  Yunhui Guo,et al.  A Survey on Methods and Theories of Quantized Neural Networks , 2018, ArXiv.

[4]  Lacra Pavel,et al.  On the Properties of the Softmax Function with Application in Game Theory and Reinforcement Learning , 2017, ArXiv.

[5]  Natalia Gimelshein,et al.  Online normalizer calculation for softmax , 2018, ArXiv.

[6]  Xiaofei Wang,et al.  Convergence of Edge Computing and Deep Learning: A Comprehensive Survey , 2019, IEEE Communications Surveys & Tutorials.

[7]  Ke Wang,et al.  AI Benchmark: Running Deep Neural Networks on Android Smartphones , 2018, ECCV Workshops.

[8]  Quanyuan Feng,et al.  A High Speed SoftMax VLSI Architecture Based on Basic-Split , 2018, 2018 14th IEEE International Conference on Solid-State and Integrated Circuit Technology (ICSICT).

[9]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[10]  Tao Zhang,et al.  A Survey of Model Compression and Acceleration for Deep Neural Networks , 2017, ArXiv.

[11]  Weisong Shi,et al.  OpenEI: An Open Framework for Edge Intelligence , 2019, 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS).

[12]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[13]  Silvio Savarese,et al.  Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Anuj Pathania,et al.  Neural Network Inference on Mobile SoCs , 2020, IEEE Design & Test.

[15]  Bin Zhao,et al.  Hardware-Aware Softmax Approximation for Deep Neural Networks , 2018, ACCV.

[16]  Ruigang Yang,et al.  IoU Loss for 2D/3D Object Detection , 2019, 2019 International Conference on 3D Vision (3DV).

[17]  Yue Zhang,et al.  Efficient FPGA Implementation of Softmax Function for DNN Applications , 2018, 2018 12th IEEE International Conference on Anti-counterfeiting, Security, and Identification (ASID).