Human Segmentation with Deep Contour-Aware Network

Human detection and segmenting are important computer vision problems with applications in indexing, surveillance, 3D reconstruction and action recognition. The figure-ground segmentation of humans in images captured in real-world environment is a challenge problem due to a variety of viewpoints, articulated skeletal structure, complex backgrounds, varying body proportions and clothing, etc. In this paper, we proposed a new approach to human segmentation in still images based on Deep Contour-Aware Network (DCAN), which is a unified multi-task deep learning framework combining the complementary object and contour information simultaneously for better segmentation performance. Experimental results on a large-scale human dataset indicates our human segmentation method can achieve a marginally better segmentation accuracy than the state of the art works.

[1]  Liang Wang,et al.  1000fps human segmentation with deep convolutional neural networks , 2015, 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR).

[2]  José García Rodríguez,et al.  A Review on Deep Learning Techniques Applied to Semantic Segmentation , 2017, ArXiv.

[3]  Jitendra Malik,et al.  Iterative Instance Segmentation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Hao Jiang,et al.  Detangling People: Individuating Multiple Close People and Their Body Parts via Region Assembly , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Xiaoou Tang,et al.  Facial Landmark Detection by Deep Multi-task Learning , 2014, ECCV.

[6]  Arati Dandavate,et al.  Semantic Texton Forests for Image Categorization and Segmentation , 2018, IJARCCE.

[7]  Cristian Sminchisescu,et al.  CPMC: Automatic Object Segmentation Using Constrained Parametric Min-Cuts , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Yanli Li,et al.  Combining Shape and Appearance for Automatic Pedestrian Segmentation , 2011, 2011 IEEE 23rd International Conference on Tools with Artificial Intelligence.

[9]  Antonio Criminisi,et al.  TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context , 2007, International Journal of Computer Vision.

[10]  Philip H. S. Torr,et al.  Holistic, Instance-Level Human Parsing , 2017, BMVC.

[11]  Jerome Karabel,et al.  Unequal Opportunity: Student Access to the University of California , 2003 .

[12]  Tieniu Tan,et al.  Early Hierarchical Contexts Learned by Convolutional Networks for Image Segmentation , 2014, 2014 22nd International Conference on Pattern Recognition.

[13]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[15]  Cristian Sminchisescu,et al.  Semantic Segmentation with Second-Order Pooling , 2012, ECCV.

[16]  Hao Chen,et al.  DCAN: Deep contour‐aware networks for object instance segmentation from histology images , 2017, Medical Image Anal..

[17]  Zhuowen Tu,et al.  Auto-Context and Its Application to High-Level Vision Tasks and 3D Brain Image Segmentation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[19]  R. Zemel,et al.  Multiscale conditional random fields for image labeling , 2004, CVPR 2004.

[20]  Jitendra Malik,et al.  Simultaneous Detection and Segmentation , 2014, ECCV.

[21]  Rob Fergus,et al.  Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-scale Convolutional Architecture , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[22]  Vladlen Koltun,et al.  Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials , 2011, NIPS.

[23]  Jian Sun,et al.  Instance-Aware Semantic Segmentation via Multi-task Network Cascades , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Jitendra Malik,et al.  Hypercolumns for object segmentation and fine-grained localization , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[26]  Jitendra Malik,et al.  An empirical approach to grouping and segmentation , 2002 .

[27]  Iasonas Kokkinos,et al.  Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs , 2014, ICLR.

[28]  Yu Liu,et al.  On the Exploration of Convolutional Fusion Networks for Visual Recognition , 2016, MMM.

[29]  Miguel Á. Carreira-Perpiñán,et al.  Multiscale conditional random fields for image labeling , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[30]  Hao Chen,et al.  Automatic Localization and Identification of Vertebrae in Spine CT via a Joint Learning Model with Deep Neural Networks , 2015, MICCAI.

[31]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Serge J. Belongie,et al.  Pose2Instance: Harnessing Keypoints for Person Instance Segmentation , 2017, ArXiv.