Deep Image Segmentation by Quality Inference

Traditionally, convolutional neural networks are trained for semantic segmentation by having an image given as input and the segmented mask as output. In this work, we propose a neural network trained by being given an image and mask pair, with the output being the quality of that pairing. The segmentation is then created afterwards through backpropagation on the mask. This allows enriching training with semi-supervised synthetic variations on the ground-truth. The proposed iterative segmentation technique allows improving an existing segmentation or creating one from scratch. We compare the performance of the proposed methodology with state-of-the-art deep architectures for image segmentation and achieve competitive results, being able to improve their segmentations.

[1]  Patrice Y. Simard,et al.  Best practices for convolutional neural networks applied to visual document analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[2]  Peter C. Fishburn,et al.  Utility theory for decision making , 1970 .

[3]  Jianping Fan,et al.  Automatic image segmentation by integrating color-edge extraction and seeded region growing , 2001, IEEE Trans. Image Process..

[4]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Carolina Chang,et al.  Teeth/Palate and Interdental Segmentation Using Artificial Neural Networks , 2012, ANNPR.

[6]  Jaime S. Cardoso,et al.  Temporal Segmentation of Digital Colposcopies , 2015, IbPRIA.

[7]  Jaime S. Cardoso,et al.  Transfer Learning with Partial Observability Applied to Cervical Cancer Screening , 2017, IbPRIA.

[8]  Jaime S. Cardoso,et al.  Towards an intelligent medical system for the aesthetic evaluation of breast cancer conservative treatment , 2007, Artif. Intell. Medicine.

[9]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Jaime S. Cardoso,et al.  Learning and ensembling lexicographic preference trees with multiple kernels , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[12]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[13]  C. List,et al.  Social Choice Theory and Deliberative Democracy: A Reconciliation , 2002, British Journal of Political Science.

[14]  Ana F. Sequeira,et al.  MobBIO: A multimodal database captured with a portable handheld device , 2014, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[15]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[16]  Luís Rosado,et al.  Principal Axes-Based Asymmetry Assessment Methodology for Skin Lesion Image Analysis , 2014, ISVC.

[17]  D. Ellsberg Classic and Current Notions of “Measurable Utility” , 1954 .

[18]  Noel C. F. Codella,et al.  Skin lesion analysis toward melanoma detection: A challenge at the 2017 International symposium on biomedical imaging (ISBI), hosted by the international skin imaging collaboration (ISIC) , 2016, 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018).

[19]  Vibhav Vineet,et al.  Conditional Random Fields as Recurrent Neural Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[20]  Pedro M. Ferreira,et al.  PH2 - A dermoscopic image database for research and benchmarking , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[21]  Rivka Oxman,et al.  The thinking eye: visual re-cognition in design emergence , 2002 .

[22]  Leon van der Torre,et al.  Utilitarian Desires , 2002, Autonomous Agents and Multi-Agent Systems.

[23]  Jonathon Shlens,et al.  Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[24]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[25]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.