Reducing Annotating Load: Active Learning with Synthetic Images in Surgical Instrument Segmentation

Accurate instrument segmentation in endoscopic vision of robot-assisted surgery is challenging due to reflection on the instruments and frequent contacts with tissue. Deep neural networks (DNN) show competitive performance and are in favor in recent years. However, DNN’s hunger for labeled data poses a huge workload of annotation. Motivated by alleviating this workload, we propose a general embeddable method to decrease the usage of labeled real images, using active generated synthetic images. In each active learning iteration, the most informative unlabeled images are first queried by active learning and then labeled. Next, synthetic images are generated based on these selected images. The instruments and backgrounds are cropped out and randomly combined with each other with blending and fusion near the boundary. The effectiveness of the proposed method is validated on 2 sinus surgery datasets and 1 intraabdominal surgery dataset. The results indicate a considerable improvement in performance, especially when the budget for annotation is small. The effectiveness of different types of synthetic images, blending methods, and external background are also studied. All the code is open-sourced at: https://github.com/HaonanPeng/active syn generator.

[1]  Blake Hannaford,et al.  Towards Better Surgical Instrument Segmentation in Endoscopic Vision: Multi-Angle Feature Aggregation and Contour Supervision , 2020, IEEE Robotics and Automation Letters.

[2]  Alexandr A. Kalinin,et al.  Medical Image Segmentation Using Deep Neural Networks with Pre-trained Encoders , 2020 .

[3]  Allan Hanbury,et al.  Metrics for evaluating 3D medical image segmentation: analysis, selection, and tool , 2015, BMC Medical Imaging.

[4]  Khalid Raza,et al.  Medical Image Generation Using Generative Adversarial Networks: A Review , 2021, Health Informatics.

[5]  Yang Lei,et al.  A review on medical imaging synthesis using deep learning and its clinical applications , 2020, Journal of applied clinical medical physics.

[6]  Ismail Ben Ayed,et al.  Deep Active Learning for Joint Classification & Segmentation with Weak Annotator , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[7]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[8]  Hongliang Ren,et al.  Learning Where to Look While Tracking Instruments in Robot-assisted Surgery , 2019, MICCAI.

[9]  Nima Tajbakhsh,et al.  Embracing Imperfect Datasets: A Review of Deep Learning Solutions for Medical Image Segmentation , 2019, Medical Image Anal..

[10]  Xavier Giró-i-Nieto,et al.  Cost-Effective Active Learning for Melanoma Segmentation , 2017, NIPS 2017.

[11]  George Papandreou,et al.  Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.

[12]  Bernhard Kainz,et al.  A Survey on Active Learning and Human-in-the-Loop Deep Learning for Medical Image Analysis , 2019, Medical Image Anal..

[13]  Pietro Perona,et al.  Entropy-based active learning for object recognition , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[14]  Dana Angluin,et al.  Queries and concept learning , 1988, Machine Learning.

[15]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[16]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[17]  Lyle H. Ungar,et al.  Machine Learning manuscript No. (will be inserted by the editor) Active Learning for Logistic Regression: , 2007 .

[18]  Alexander Rakhlin,et al.  Automatic Instrument Segmentation in Robot-Assisted Surgery Using Deep Learning , 2018, bioRxiv.

[19]  Lucas J. van Vliet,et al.  Recursive implementation of the Gaussian filter , 1995, Signal Process..

[20]  Quoc V. Le,et al.  Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Lena Maier-Hein,et al.  2017 Robotic Instrument Segmentation Challenge , 2019, ArXiv.

[22]  O. Abe,et al.  Deep Learning Approach for Generating MRA Images From 3D Quantitative Synthetic MRI Without Additional Scans. , 2020, Investigative radiology.

[23]  Brian S. Peters,et al.  Review of emerging surgical robotic technology , 2018, Surgical Endoscopy.

[24]  Yvan Saeys,et al.  Cost-Efficient Segmentation of Electron Microscopy Images Using Active Learning , 2019, BNAIC/BENELEARN.

[25]  Juan Lavista Ferres,et al.  Reducing bias and increasing utility by federated generative modeling of medical images using a centralized adversary , 2021, GoodIT.

[26]  Lin Yang,et al.  Suggestive Annotation: A Deep Active Learning Framework for Biomedical Image Segmentation , 2017, MICCAI.

[27]  Arash J. Sayari,et al.  Review of robotic-assisted surgery: what the future looks like through a spine oncology lens. , 2019, Annals of translational medicine.

[28]  Wojciech Zaremba,et al.  Domain randomization for transferring deep neural networks from simulation to the real world , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[29]  Danail Stoyanov,et al.  Robotic Instrument Segmentation With Image-to-Image Translation , 2021, IEEE Robotics and Automation Letters.

[30]  Namkug Kim,et al.  Active learning for accuracy enhancement of semantic segmentation with CNN-corrected label curations: Evaluation on kidney segmentation in abdominal CT , 2020, Scientific Reports.

[31]  Josien P. W. Pluim,et al.  Not‐so‐supervised: A survey of semi‐supervised, multi‐instance, and transfer learning in medical image analysis , 2018, Medical Image Anal..

[32]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[33]  Zoubin Ghahramani,et al.  Bayesian Active Learning for Classification and Preference Learning , 2011, ArXiv.

[34]  B. Hannaford,et al.  LC-GAN: Image-to-image Translation Based on Generative Adversarial Network for Endoscopic Images , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[35]  Zoubin Ghahramani,et al.  Deep Bayesian Active Learning with Image Data , 2017, ICML.

[36]  Wouter M. Kouw,et al.  A Review of Domain Adaptation without Target Labels , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Martial Hebert,et al.  Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[38]  Matthew A. Brown,et al.  Learning to Segment via Cut-and-Paste , 2018, ECCV.

[39]  Tae Keun Yoo,et al.  A generative adversarial network approach to predicting postoperative appearance after orbital decompression surgery for thyroid eye disease , 2020, Comput. Biol. Medicine.