论文信息 - Compressing Deep Neural Networks with Probabilistic Data Structures

Compressing Deep Neural Networks with Probabilistic Data Structures

This paper presents a lossy weight encoding method which complements conventional compression techniques including weight pruning and clustering. The encoding is based on the Bloomier filter, a probabilistic data structure that can save space at the expense of introducing random errors in the weights. Leveraging the ability of DNNs to tolerate these imperfections and by re-training around them, the proposed technique can compress DNN weights by up to 496× (a 1.51× improvement over the state-of-the-art) without sacrificing model accuracy.

Brandon Reagen | Gu-Yeon Wei | Udit Gupta

[1] Burton H. Bloom,et al. Space/time trade-offs in hash coding with allowable errors , 1970, CACM.

[2] Yann LeCun,et al. Optimal Brain Damage , 1989, NIPS.

[3] Babak Hassibi,et al. Second Order Derivatives for Network Pruning: Optimal Brain Surgeon , 1992, NIPS.

[4] Bernard Chazelle,et al. The Bloomier filter: an efficient data structure for static support lookup tables , 2004, SODA '04.

[5] David J. C. MacKay,et al. Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[6] Song Han,et al. Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[7] Gu-Yeon Wei,et al. Minerva: Enabling Low-Power, Highly-Accurate Deep Neural Network Accelerators , 2016, 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA).

[8] Yurong Chen,et al. Dynamic Network Surgery for Efficient DNNs , 2016, NIPS.

[9] Alexander M. Rush,et al. Weightless: Lossy Weight Encoding For Deep Neural Network Compression , 2018, ICML.