论文信息 - Study of the Impact of Standard Image Compression Techniques on Performance of Image Classification with a Convolutional Neural Network

Study of the Impact of Standard Image Compression Techniques on Performance of Image Classification with a Convolutional Neural Network

In this study, we have measured the impact of image compression on the classification performance of Convolutional Neural Networks (CNNs). By using a pre-trained CNN to classify compressed images, we have shown that on average, an image can be compressed by a factor 7, 16, 40 for a JPEG, JPEG200 and an HEVC encoder, respectively, while still maintaining a correct classification by the CNN. This study also showed that pretrained AlexNet CNN was making use of JPEG artifacts learned during the training phase to perform classification. To further study the impact of compression on CNN-based classification, a large set of encoding parameters was explored: color-space, resolution, Quantization Parameter (QP). Main conclusions of this study are that color is essential for classification with AlexNet CNN, and that classification is resilient to image downscaling. Finally, we have studied the correlation between classification performance of a CNN and image quality measured with two objective metrics, namely the Peak Signal to Noise Ratio (PSNR) and the Structural SIMilarity (SSIM). We have found that the SSIM metrics was more appropriate to measure the degradation of an image with regards the CNN performance.

[1] Dean Pomerleau,et al. Efficient Training of Artificial Neural Networks for Autonomous Navigation , 1991, Neural Computation.

[2] Richard F. Haines,et al. The effects of video compression on acceptability of images for monitoring life sciences experiments , 1992 .

[3] Touradj Ebrahimi,et al. The JPEG 2000 still image compression standard , 2001, IEEE Signal Process. Mag..

[4] Y. LeCun,et al. Learning methods for generic object recognition with invariance to pose and lighting , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[5] Saraju P. Mohanty,et al. SBPG: A secure better portable graphics compression architecture for high speed trusted image communication in the IoT , 2016, 2016 17th International Conference on Thermal, Mechanical and Multi-Physics Simulation and Experiments in Microelectronics and Microsystems (EuroSimE).

[6] Seyed-Mohsen Moosavi-Dezfooli,et al. DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Huai Li,et al. Artificial convolution neural network for medical image pattern recognition , 1995, Neural Networks.

[8] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[9] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[10] Gregory K. Wallace,et al. The JPEG still picture compression standard , 1992 .

[11] Lawrence D. Jackel,et al. Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.

[12] Forrest N. Iandola,et al. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[13] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[14] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[15] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..