Thermal Infrared Pedestrian Segmentation Based on Conditional GAN

A novel thermal infrared pedestrian segmentation algorithm based on conditional generative adversarial network (IPS-cGAN) is proposed for intelligent vehicular applications. The convolution backbone architecture of the generator is based on the improved U-Net with residual blocks for well utilizing regional semantic information. Moreover, cross entropy loss for segmentation is introduced as the condition for the generator. SandwichNet, a novel convolutional network with symmetrical input, is proposed as the discriminator for real–fake segmented images. Based on the c-GAN framework, good segmentation performance could be achieved for thermal infrared pedestrians. Compared to some supervised and unsupervised segmentation algorithms, the proposed algorithm achieves higher accuracy with better robustness, especially for complex scenes.

[1]  Chunhong Wang,et al.  Double-threshold image segmentation method based on gray gradient , 2009, International Conference on Optical Instruments and Technology.

[2]  Christopher Joseph Pal,et al.  The Importance of Skip Connections in Biomedical Image Segmentation , 2016, LABELS/DLMIA@MICCAI.

[3]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[4]  Xiangzhi Bai,et al.  Symmetry Information Based Fuzzy Clustering for Infrared Pedestrian Segmentation , 2018, IEEE Transactions on Fuzzy Systems.

[5]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[6]  Yoshua Bengio,et al.  The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[7]  Xin Lin,et al.  Style Transfer for Anime Sketches with Enhanced Residual U-net and Auxiliary Classifier GAN , 2017, 2017 4th IAPR Asian Conference on Pattern Recognition (ACPR).

[8]  Min Bai,et al.  Deep Watershed Transform for Instance Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Junqiang Xi,et al.  Learning Driver-Specific Behavior for Overtaking: A Combined Learning Framework , 2018, IEEE Transactions on Vehicular Technology.

[10]  Xiangzhi Bai,et al.  Regional parallel structure based CNN for thermal infrared face identification , 2018, Integr. Comput. Aided Eng..

[11]  Yu Zhang,et al.  Infrared Ship Target Segmentation Based on Spatial Information Improved FCM , 2016, IEEE Transactions on Cybernetics.

[12]  Xiaoxiao Li,et al.  Semantic Image Segmentation via Deep Parsing Network , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[13]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Wei Li,et al.  DeepUNet: A Deep Fully Convolutional Network for Pixel-Level Sea-Land Segmentation , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[16]  William A. Yasnoff,et al.  Error measures for scene segmentation , 1977, Pattern Recognit..

[17]  Yueting Zhuang,et al.  DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection , 2015, IEEE Transactions on Image Processing.

[18]  Vijayan K. Asari,et al.  Recurrent Residual Convolutional Neural Network based on U-Net (R2U-Net) for Medical Image Segmentation , 2018, ArXiv.

[19]  Chandan Chakraborty,et al.  Her2Net: A Deep Framework for Semantic Segmentation and Classification of Cell Membranes and Nuclei in Breast Cancer Evaluation , 2018, IEEE Transactions on Image Processing.

[20]  Hao Chen,et al.  DCAN: Deep contour‐aware networks for object instance segmentation from histology images , 2017, Medical Image Anal..

[21]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Richard S. Zemel,et al.  End-to-End Instance Segmentation with Recurrent Attention , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  许华荣,et al.  Towards Real-Time Traffic Sign Detection and Classification , 2016 .

[24]  Dacheng Tao,et al.  Perceptual Adversarial Networks for Image-to-Image Transformation , 2017, IEEE Transactions on Image Processing.

[25]  Wilhelm Burger,et al.  Digital Image Processing - An Algorithmic Introduction using Java , 2008, Texts in Computer Science.

[26]  Xiangzhi Bai,et al.  Marker based watershed to segment dim infrared target through image enhancement , 2008, International Symposium on Instrumentation and Control Technology.

[27]  Yu Zhang,et al.  Spatial information based FCM for infrared ship target segmentation , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[28]  Nanning Zheng,et al.  Learning to Detect a Salient Object , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Xiaoming Liu,et al.  Multi-Task Convolutional Neural Network for Pose-Invariant Face Recognition , 2017, IEEE Transactions on Image Processing.

[30]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Xiang-Gui Guo,et al.  CNN-Based Distributed Adaptive Control for Vehicle-Following Platoon With Input Saturation , 2018, IEEE Transactions on Intelligent Transportation Systems.

[32]  Berkman Sahiner,et al.  Segmentation of follicles from CD8-stained slides of follicular lymphoma using deep learning , 2019, Medical Imaging.

[33]  Iasonas Kokkinos,et al.  Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs , 2014, ICLR.

[34]  Mianxiong Dong,et al.  Humanlike Driving: Empirical Decision-Making System for Autonomous Vehicles , 2018, IEEE Transactions on Vehicular Technology.

[35]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Sergiu Nedevschi,et al.  A Deep Learning Approach For Pedestrian Segmentation In Infrared Images , 2018, 2018 IEEE 14th International Conference on Intelligent Computer Communication and Processing (ICCP).

[37]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[38]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Jitendra Malik,et al.  Learning to detect natural image boundaries using local brightness, color, and texture cues , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[41]  Ian D. Reid,et al.  RefineNet : MultiPath Refinement Networks with Identity Mappings for High-Resolution Semantic Segmentation , 2016 .

[42]  Paul L. Rosin Unimodal thresholding , 2001, Pattern Recognit..

[43]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[44]  Qingjie Liu,et al.  Road Extraction by Deep Residual U-Net , 2017, IEEE Geoscience and Remote Sensing Letters.

[45]  George Papandreou,et al.  Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.

[46]  Changming Sun,et al.  Iterative infrared ship target segmentation based on multiple features , 2014, Pattern Recognit..

[47]  Tony F. Chan,et al.  Active contours without edges , 2001, IEEE Trans. Image Process..

[48]  Xiaogang Wang,et al.  DeepID3: Face Recognition with Very Deep Neural Networks , 2015, ArXiv.

[49]  Qiang Chen,et al.  Robust noise region-based active contour model via local similarity factor for image segmentation , 2017, Pattern Recognit..

[50]  James W. Davis,et al.  A Two-Stage Template Approach to Person Detection in Thermal Imagery , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[51]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[52]  Yudong Zhang,et al.  A note on the marker-based watershed method for X-ray image segmentation , 2017, Comput. Methods Programs Biomed..

[53]  Pierre-Luc St-Charles,et al.  Thermal–visible registration of human silhouettes: A similarity measure performance evaluation , 2014 .

[54]  Xiangzhi Bai,et al.  Pedestrian Segmentation in Infrared Images Based on Circular Shortest Path , 2016, IEEE Transactions on Intelligent Transportation Systems.