LC-GAN: Image-to-image Translation Based on Generative Adversarial Network for Endoscopic Images

Intelligent vision is appealing in computer-assisted and robotic surgeries. Vision-based analysis with deep learning usually requires large labeled datasets, but manual data labeling is expensive and time-consuming in medical problems. We investigate a novel cross-domain strategy to reduce the need for manual data labeling by proposing an image-to-image translation model live-cadaver GAN (LC-GAN) based on generative adversarial networks (GANs). We consider a situation when a labeled cadaveric surgery dataset is available while the task is instrument segmentation on an unlabeled live surgery dataset. We train LC-GAN to learn the mappings between the cadaveric and live images. For live image segmentation, we first translate the live images to fake-cadaveric images with LC-GAN and then perform segmentation on the fake-cadaveric images with models trained on the real cadaveric dataset. The proposed method fully makes use of the labeled cadaveric dataset for live image segmentation without the need to label the live dataset. LC-GAN has two generators with different architectures that leverage the deep feature representation learned from the cadaveric image based segmentation task. Moreover, we propose the structural similarity loss and segmentation consistency loss to improve the semantic consistency during translation. Our model achieves better image-to-image translation and leads to improved segmentation performance in the proposed cross-domain segmentation task.

[1]  Alexey Shvets,et al.  TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation , 2018, Computer-Aided Analysis of Gastrointestinal Videos.

[2]  Luigi di Stefano,et al.  Exploiting semantics in adversarial training for image-level domain adaptation , 2018, 2018 IEEE International Conference on Image Processing, Applications and Systems (IPAS).

[3]  Blake Hannaford,et al.  Multicamera 3D Reconstruction of Dynamic Surgical Cavities: Camera Grouping and Pair Sequencing , 2019, 2019 International Symposium on Medical Robotics (ISMR).

[4]  Lena Maier-Hein,et al.  2017 Robotic Instrument Segmentation Challenge , 2019, ArXiv.

[5]  Z. Hou,et al.  Attention-Guided Lightweight Network for Real-Time Segmentation of Robotic Surgical Instruments , 2019, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[6]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Josien P. W. Pluim,et al.  Not‐so‐supervised: A survey of semi‐supervised, multi‐instance, and transfer learning in medical image analysis , 2018, Medical Image Anal..

[8]  A. Khanna,et al.  Managing Complications and Revisions in Sinus Surgery , 2019, Current Otorhinolaryngology Reports.

[9]  Andrea Giachetti,et al.  Matching techniques to compute image motion , 2000, Image Vis. Comput..

[10]  P. Schuler,et al.  Recent advances in robot‐assisted head and neck surgery , 2017, The international journal of medical robotics + computer assisted surgery : MRCAS.

[11]  Eugenio Culurciello,et al.  LinkNet: Exploiting encoder representations for efficient semantic segmentation , 2017, 2017 IEEE Visual Communications and Image Processing (VCIP).

[12]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Lena Maier-Hein,et al.  Generating large labeled data sets for laparoscopic image processing tasks using unpaired image-to-image translation , 2019, MICCAI.

[15]  Marie Claire Capolei,et al.  Positioning the laparoscopic camera with industrial robot arm , 2017, 2017 3rd International Conference on Control, Automation and Robotics (ICCAR).

[16]  Faisal Mahmood,et al.  Unsupervised Reverse Domain Adaptation for Synthetic Medical Images via Adversarial Training , 2017, IEEE Transactions on Medical Imaging.

[17]  Allan Hanbury,et al.  Metrics for evaluating 3D medical image segmentation: analysis, selection, and tool , 2015, BMC Medical Imaging.

[18]  Sébastien Ourselin,et al.  ToolNet: Holistically-nested real-time segmentation of robotic surgical tools , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[19]  Hongliang Ren,et al.  Real-Time Instrument Segmentation in Robotic Surgery Using Auxiliary Supervised Deep Adversarial Learning , 2019, IEEE Robotics and Automation Letters.

[20]  B. Hannaford,et al.  Force controlled and teleoperated endoscopic grasper for minimally invasive surgery-experimental performance evaluation , 1999, IEEE Transactions on Biomedical Engineering.

[21]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[22]  Zhou Wang,et al.  Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[23]  Blake Hannaford,et al.  Towards Better Surgical Instrument Segmentation in Endoscopic Vision: Multi-Angle Feature Aggregation and Contour Supervision , 2020, IEEE Robotics and Automation Letters.

[24]  Jan Kautz,et al.  Multimodal Unsupervised Image-to-Image Translation , 2018, ECCV.

[25]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[26]  Georg Rose,et al.  Instrument State Recognition and Tracking for Effective Control of Robotized Laparoscopic Systems , 2016 .

[27]  Danail Stoyanov,et al.  Vision‐based and marker‐less surgical tool detection and tracking: a review of the literature , 2017, Medical Image Anal..

[28]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[29]  Sebastian Bodenstedt,et al.  Generative adversarial networks for specular highlight removal in endoscopic images , 2018, Medical Imaging.

[30]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Zhe Gan,et al.  AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[32]  A. Cherian,et al.  Sem-GAN: Semantically-Consistent Image-to-Image Translation , 2018, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV).

[33]  Taesung Park,et al.  CyCADA: Cycle-Consistent Adversarial Domain Adaptation , 2017, ICML.

[34]  George Papandreou,et al.  Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.

[35]  Bin Zheng,et al.  Depth Perception of Surgeons in Minimally Invasive Surgery , 2016, Surgical innovation.

[36]  Yuexiang Li,et al.  cC-GAN: A Robust Transfer-Learning Framework for HEp-2 Specimen Image Segmentation , 2018, IEEE Access.

[37]  Hongliang Ren,et al.  Learning Where to Look While Tracking Instruments in Robot-assisted Surgery , 2019, MICCAI.

[38]  Jan Kautz,et al.  Unsupervised Image-to-Image Translation Networks , 2017, NIPS.

[39]  Blake Hannaford,et al.  Automatic Sinus Surgery Skill Assessment Based on Instrument Segmentation and Tracking in Endoscopic Video , 2019, MMMI@MICCAI.

[40]  Chi-Sheng Shih,et al.  2018 Robotic Scene Segmentation Challenge , 2020, ArXiv.

[41]  Benoit M. Dawant,et al.  Conditional Generative Adversarial Networks for Metal Artifact Reduction in CT Images of the Ear , 2018, MICCAI.

[42]  George P Mylonas,et al.  Soft Robotics in Minimally Invasive Surgery , 2019, Soft robotics.

[43]  Blake Hannaford,et al.  Surgical Instrument Segmentation for Endoscopic Vision with Data Fusion of rediction and Kinematic Pose , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[44]  Christian Ledig,et al.  Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  拓海 杉山,et al.  “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[46]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[47]  Alexander Rakhlin,et al.  Automatic Instrument Segmentation in Robot-Assisted Surgery Using Deep Learning , 2018, bioRxiv.