Human Segmentation Based on Compressed Deep Convolutional Neural Network

Most semantic segmentation models based on deep convolutional neural network (CNN) typically require a large number of weight parameters, high hardware resources for storage and computation. Moreover, redesigning a compact network suffers from some training problems, such as under-fitting. A human segmentation algorithm is proposed based on compressed deep CNN to optimize the convolution layers and filters. PSPNet-50 is fine-tuned on the human segmentation dataset to obtain the human segmentation model with higher accuracy. Then the convolutional-layer level pruning and corresponding structure optimization are performed so that the parameters of the model are substantially reduced. Finally, the two-stage global filter-level pruning strategy is used. Compared with the method of layer by layer pruning and retraining, our strategy not only reduces parameters of the model and saves the time of retraining, but also keeps the high IoU (Intersection over Union) accuracy. In addition, by adding auxiliary losses in the network during training CNN, the supervised training of the network is improved, and IoU is further increased. Compared to the model before compression, the sufficient experiments show that the parameter number, computation cost, memory consumption, and parameter storage are decreased by 1/7.5, 5.6/6.6, 0.7/1, 6.5/7.5, respectively, while the segmentation speed is accelerated by 2.4 times, and IoU on test set reaches 93.2%.

[1]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2]  Lu Leng,et al.  PalmHash Code vs. PalmPhasor Code , 2013, Neurocomputing.

[3]  Xiangyu Zhang,et al.  Channel Pruning for Accelerating Very Deep Neural Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[4]  Song Han,et al.  Learning both Weights and Connections for Efficient Neural Network , 2015, NIPS.

[5]  Hanan Samet,et al.  Pruning Filters for Efficient ConvNets , 2016, ICLR.

[6]  Fan Zhang,et al.  Slim and Efficient Neural Network Design for Resource-Constrained SAR Target Recognition , 2018, Remote. Sens..

[7]  Lin Xu,et al.  Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights , 2017, ICLR.

[8]  Sylvain Paris,et al.  Automatic Portrait Segmentation for Image Stylization , 2016, Comput. Graph. Forum.

[9]  Bolei Zhou,et al.  Semantic Understanding of Scenes Through the ADE20K Dataset , 2016, International Journal of Computer Vision.

[10]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[11]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[13]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Andrew Beng Jin Teoh,et al.  Alignment-free row-co-occurrence cancelable palmprint Fuzzy Vault , 2015, Pattern Recognit..

[15]  Russell Reed,et al.  Pruning algorithms-a survey , 1993, IEEE Trans. Neural Networks.

[16]  Greg Mori,et al.  CLIP-Q: Deep Network Compression Learning by In-parallel Pruning-Quantization , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[17]  Guangzhong Cao,et al.  Automated Product Boundary Defect Detection Based on Image Moment Feature Anomaly , 2019, IEEE Access.

[18]  Khairan D. Rajab,et al.  New Associative Classification Method Based on Rule Pruning for Classification of Datasets , 2019, IEEE Access.

[19]  Vladlen Koltun,et al.  Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials , 2011, NIPS.

[20]  Marie-Pierre Jolly,et al.  Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[21]  Ming Li,et al.  Dual-source discrimination power analysis for multi-instance contactless palmprint recognition , 2015, Multimedia Tools and Applications.

[22]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Guopu Zhu,et al.  Boundary-based image segmentation using binary level set method , 2007 .

[24]  Huimin Lu,et al.  Multi-scale deep context convolutional neural networks for semantic segmentation , 2017, World Wide Web.

[25]  Mark Sandler,et al.  MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[26]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Fan Zhang,et al.  A lossless lightweight CNN design for SAR target recognition , 2020 .

[28]  Song Han,et al.  Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[29]  Jianxin Wu,et al.  ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[30]  Zhiqiang Shen,et al.  Learning Efficient Convolutional Networks through Network Slimming , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[31]  Leo Grady,et al.  Random Walks for Image Segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Qingming Huang,et al.  Relay Backpropagation for Effective Learning of Deep Convolutional Neural Networks , 2015, ECCV.

[33]  Larry S. Davis,et al.  NISP: Pruning Networks Using Neuron Importance Score Propagation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[34]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[35]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[36]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[37]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[38]  Guillermo Sapiro,et al.  Geodesic Matting: A Framework for Fast Interactive Image and Video Segmentation and Matting , 2009, International Journal of Computer Vision.

[39]  Ming Yang,et al.  Compressing Deep Convolutional Networks using Vector Quantization , 2014, ArXiv.

[40]  George Papandreou,et al.  Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.

[41]  Harry Shum,et al.  Paint selection , 2009, ACM Trans. Graph..

[42]  Tieniu Tan,et al.  Early Hierarchical Contexts Learned by Convolutional Networks for Image Segmentation , 2014, 2014 22nd International Conference on Pattern Recognition.