Modified Perceptual Cycle Generative Adversarial Network-Based Image Enhancement for Improving Accuracy of Low Light Image Segmentation

In recent years, the importance of the semantic segmentation field has been increasingly emphasized because autonomous vehicle and artificial intelligence (AI)-based robot technology are being researched extensively; and methods for accurately recognizing objects are required. Previous state-of-the-art segmentation methods have been proven to be effective for databases obtained during daytime. However, in extremely low light or nighttime environments, the shape and color information of objects are very small or disappear due to an insufficient amount of external light, which makes it difficult to train the segmentation network and significantly degrades performance. In our previous work, segmentation performance in a low light environment was improved using the enhancement-based segmentation method. However, low light images could not be restored precisely and segmentation performance improvement was limited because only per-pixel loss functions were used when training the enhancement network. To overcome these drawbacks, we propose a low light image segmentation method based on a modified perceptual cycle generative adversarial network (CycleGAN). Perceptual image enhancement was performed using our network, which significantly improved segmentation performance. Unlike the existing perceptual loss, the Euclidean distance of the feature maps extracted from the pretrained segmentation network was used. In our experiments, we used low light databases generated from two famous road scene open databases, which are Cambridge-driving Labeled Video Database (CamVid) and Karlsruhe Institute of Technology and Toyota Technological Institute at Chicago (KITTI), and confirmed that our proposed method shows better segmentation performance in extremely low light environments than the existing state-of-the art methods.

[1]  Kang Ryoung Park,et al.  FRED-Net: Fully residual encoder-decoder network for accurate iris segmentation , 2019, Expert Syst. Appl..

[2]  Sinisa Segvic,et al.  Convolutional Scale Invariance for Semantic Segmentation , 2016, GCPR.

[3]  Gordon Wyeth,et al.  SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights , 2012, 2012 IEEE International Conference on Robotics and Automation.

[4]  Trevor Darrell,et al.  BDD100K: A Diverse Driving Video Database with Scalable Annotation Tooling , 2018, ArXiv.

[5]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[6]  Md. Zahid Hasan,et al.  Detection of Vehicle's Number Plate at Nighttime using Iterative Threshold Segmentation (ITS) Algorithm , 2013 .

[7]  Stefan Winkler,et al.  Nighttime sky/cloud image segmentation , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[8]  Andrew L. Maas Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[9]  Stefan Roth,et al.  Single-Stage Semantic Segmentation From Image Labels , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  David Salomon,et al.  Data Compression: The Complete Reference , 2006 .

[11]  Wolfram Burgard,et al.  AdapNet: Adaptive semantic segmentation in adverse environmental conditions , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[12]  Soumik Sarkar,et al.  LLNet: A deep autoencoder approach to natural low-light image enhancement , 2015, Pattern Recognit..

[13]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[14]  Andrea Vedaldi,et al.  Instance Normalization: The Missing Ingredient for Fast Stylization , 2016, ArXiv.

[15]  Chee Seng Chan,et al.  Low-light image enhancement using Gaussian Process for features retrieval , 2019, Signal Process. Image Commun..

[16]  Luc Van Gool,et al.  Dark Model Adaptation: Semantic Image Segmentation from Daytime to Nighttime , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[17]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[18]  Jie Ma,et al.  MSR-net: Low-light Image Enhancement Using Deep Convolutional Network , 2017, ArXiv.

[19]  Jacob Cohen,et al.  A power primer. , 1992, Psychological bulletin.

[20]  Tania Stathaki,et al.  Image Fusion: Algorithms and Applications , 2008 .

[21]  Li Fei-Fei,et al.  Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[22]  Ian D. Reid,et al.  RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Suk-Ho Lee,et al.  Object Detection in Low Illumination Environment , 2009 .

[24]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  David Salomon,et al.  Data compression - The Complete Reference, 4th Edition , 2004 .

[26]  Slobodan Ilic,et al.  Semantic Segmentation Based Traffic Light Detection at Day and at Night , 2015, GCPR.

[27]  拓海 杉山,et al.  “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[28]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Vinay G. Vaidya,et al.  Edge based segmentation for pedestrian detection using NIR camera , 2011, 2011 International Conference on Image Information Processing.

[30]  Kang Ryoung Park,et al.  Semantic Segmentation With Low Light Images by Modified CycleGAN-Based Image Enhancement , 2020, IEEE Access.

[31]  Orcan Alpar,et al.  Corona segmentation for nighttime brake light detection , 2016 .

[32]  Zhaoyang Lu,et al.  Nighttime Foreground Pedestrian Detection Based on Three-Dimensional Voxel Surface Model , 2017, Sensors.

[33]  Raymond Y. K. Lau,et al.  Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[34]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Xiaojuan Qi,et al.  ICNet for Real-Time Semantic Segmentation on High-Resolution Images , 2017, ECCV.

[36]  Luc Van Gool,et al.  Guided Curriculum Model Adaptation and Uncertainty-Aware Evaluation for Semantic Nighttime Image Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[37]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Roberto Cipolla,et al.  Semantic object classes in video: A high-definition ground truth database , 2009, Pattern Recognit. Lett..

[40]  George Papandreou,et al.  Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.

[41]  Yuanbin Wang,et al.  Low-Light Forest Flame Image Segmentation Based on Color Features , 2018 .

[42]  Lei Sun,et al.  See clearer at night: towards robust nighttime semantic segmentation through day-night image conversion , 2019, Security + Defence.