论文信息 - DLFloat: A 16-b Floating Point Format Designed for Deep Learning Training and Inference

DLFloat: A 16-b Floating Point Format Designed for Deep Learning Training and Inference

The resilience of Deep Learning (DL) training and inference workloads to low-precision computations, coupled with the demand for power-and area-efficient hardware accelerators for these workloads, has led to the emergence of 16-bit floating point formats as the precision of choice for DL hardware accelerators. This paper describes our optimized 16-bit format that has 6 exponent bits and 9 fraction bits, derived from a study of the range of values encountered in DL applications. We demonstrate that our format preserves the accuracy of DL networks, and we compare its ease-of-use for DL against IEEE-754 half-precision (5 exponent bits and 10 fraction bits) and bfloat16 (8 exponent bits and 7 fraction bits). Further, our format eliminated sub-normals and simplifies rounding modes and handling of corner cases. This streamlines floating-point unit logic and enables realization of a compact power-efficient computation engine.

[1] James Demmel,et al. IEEE Standard for Floating-Point Arithmetic , 2008 .

[2] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[3] Joel Silberman,et al. A Scalable Multi- TeraOPS Deep Learning Processor Core for AI Trainina and Inference , 2018, 2018 IEEE Symposium on VLSI Circuits.

[4] Beatrice Santorini,et al. The Penn Treebank: An Overview , 2003 .

[5] David A. Patterson,et al. In-datacenter performance analysis of a tensor processing unit , 2017, 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA).

[6] Silvia M. Müller,et al. The POWER7 Binary Floating-Point Unit , 2011, 2011 IEEE 20th Symposium on Computer Arithmetic.

[7] Hao Wu,et al. Mixed Precision Training , 2017, ICLR.

[8] Daniel Brand,et al. Training Deep Neural Networks with 8-bit Floating Point Numbers , 2018, NeurIPS.

[9] Luca Benini,et al. A transprecision floating-point platform for ultra-low power computing , 2017, 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[10] Bhuvana Ramabhadran,et al. Training variance and performance evaluation of neural networks in speech , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[12] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).