VR-Caps: A Virtual Environment for Capsule Endoscopy

Current capsule endoscopes and next-generation robotic capsules for diagnosis and treatment of gastrointestinal diseases are complex cyber-physical platforms that must orchestrate complex software and hardware functions. The desired tasks for these systems include visual localization, depth estimation, 3D mapping, disease detection and segmentation, automated navigation, active control, path realization and optional therapeutic modules such as targeted drug delivery and biopsy sampling. Data-driven algorithms promise to enable many advanced functionalities for capsule endoscopes, but real-world data is challenging to obtain. Physically-realistic simulations providing synthetic data have emerged as a solution to the development of data-driven algorithms. In this work, we present a comprehensive simulation platform for capsule endoscopy operations and introduce VR-Caps, a virtual active capsule environment that simulates a range of normal and abnormal tissue conditions (e.g., inflated, dry, wet etc.) and varied organ types, capsule endoscope designs (e.g., mono, stereo, dual and 360°camera), and the type, number, strength, and placement of internal and external magnetic sources that enable active locomotion. VR-Caps makes it possible to both independently or jointly develop, optimize, and test medical imaging and analysis software for the current and next-generation endoscopic capsule systems. To validate this approach, we train state-of-the-art deep neural networks to accomplish various medical image analysis tasks using simulated data from VR-Caps and evaluate the performance of these models on real medical data. Results demonstrate the usefulness and effectiveness of the proposed virtual platform in developing algorithms that quantify fractional coverage, camera trajectory, 3D map reconstruction, and disease classification.

[1]  C. Dooley,et al.  Variability of migrating motor complex in humans , 1992, Digestive Diseases and Sciences.

[2]  Kazuhiro Kosuge,et al.  An Analytical Solution for a Redundant Manipulator with Seven Degrees of Freedom , 2012 .

[3]  Jan Tack,et al.  The migrating motor complex: control mechanisms and its role in health and disease , 2012, Nature Reviews Gastroenterology &Hepatology.

[4]  Ian D. Reid,et al.  Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[5]  Stephen M. Moore,et al.  The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository , 2013, Journal of Digital Imaging.

[6]  Richard J. Chen,et al.  Polyp segmentation and classification using predicted depth from monocular endoscopy , 2019, Medical Imaging.

[7]  Leonidas J. Guibas,et al.  Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[8]  Kun Dong Wang,et al.  Research on measurement and modeling of the gastro intestine's frictional characteristics , 2009 .

[9]  Richard J. Chen,et al.  Deep Adversarial Training for Multi-Organ Nuclei Segmentation in Histopathology Images , 2018, IEEE Transactions on Medical Imaging.

[10]  Nilanjan Dey,et al.  Wireless Capsule Gastrointestinal Endoscopy: Direction-of-Arrival Estimation Based Localization Survey , 2017, IEEE Reviews in Biomedical Engineering.

[11]  Vladimir J. Lumelsky,et al.  A terrain-covering algorithm for an AUV , 1996, Auton. Robots.

[12]  Tae-Kyeong Lee,et al.  Integrated On-Line Localization, Mapping and Coverage Algorithm of Unknown Environments for Robotic Vacuum Cleaners Based on Minimal Sensing , 2011, Adv. Robotics.

[13]  Faisal Mahmood,et al.  Unsupervised Reverse Domain Adaptation for Synthetic Medical Images via Adversarial Training , 2017, IEEE Transactions on Medical Imaging.

[14]  Lakmal Seneviratne,et al.  A Proposed Clinical Evaluation of a Simulation Environment for Magnetically-Driven Active Endoscopic Capsules. , 2019, Advances in experimental medicine and biology.

[15]  Helder Araujo,et al.  Quantitative Evaluation of Endoscopic SLAM Methods: EndoSLAM Dataset , 2020, ArXiv.

[16]  Metin Sitti,et al.  Design and Rolling Locomotion of a Magnetically Actuated Soft Capsule Endoscope , 2012, IEEE Transactions on Robotics.

[17]  Sung Won Park,et al.  Risk factors of missed colorectal lesions after colonoscopy , 2017, Medicine.

[18]  Ramesh Jain,et al.  Hookworm Detection in Wireless Capsule Endoscopy Images With Deep Learning , 2018, IEEE Transactions on Image Processing.

[19]  Yasin Almalioglu,et al.  SelfVIO: Self-Supervised Deep Monocular Visual-Inertial Odometry and Depth Estimation , 2019, ArXiv.

[20]  Matthew A. Brown,et al.  Automatic Panoramic Image Stitching using Invariant Features , 2007, International Journal of Computer Vision.

[21]  Masatoshi Okutomi,et al.  3D Reconstruction of Whole Stomach from Endoscope Video Using Structure-from-Motion , 2019, 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[22]  Yongmin Zhong,et al.  Deformable Models for Surgical Simulation: A Survey , 2019, IEEE Reviews in Biomedical Engineering.

[23]  Shuhei Nomura,et al.  Automatic anatomical classification of esophagogastroduodenoscopy images using deep convolutional neural networks , 2018, Scientific Reports.

[24]  Toku Takahashi Interdigestive migrating motor complex -its mechanism and clinical importance , 2014, Journal of smooth muscle research = Nihon Heikatsukin Gakkai kikanshi.

[25]  Argentina Leite,et al.  A Deep Learning Approach for Red Lesions Detection in Video Capsule Endoscopies , 2018, ICIAR.

[26]  Paolo Dario,et al.  Vision-based haptic feedback for capsule endoscopy navigation: a proof of concept , 2016 .

[27]  Chunhua Shen,et al.  Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video , 2019, NeurIPS.

[28]  N. Coleman,et al.  Colorectal cancer screening: prospects for molecular stool analysis , 2005, Nature Reviews Cancer.

[29]  P. Dario,et al.  Capsule Endoscopy: From Current Achievements to Open Challenges , 2011, IEEE Reviews in Biomedical Engineering.

[30]  Richard J. Chen,et al.  SLAM Endoscopy enhanced by adversarial depth prediction , 2019, ArXiv.

[31]  M. Chiao,et al.  A magnetically controlled MEMS device for drug delivery: design, fabrication, and testing. , 2011, Lab on a chip.

[32]  Stefan Leutenegger,et al.  ElasticFusion: Real-time dense SLAM and light source estimation , 2016, Int. J. Robotics Res..

[33]  P. Dario,et al.  Magnetic air capsule robotic system: proof of concept of a novel approach for painless colonoscopy , 2012, Surgical Endoscopy.

[34]  Howie Choset,et al.  Coverage for robotics – A survey of recent results , 2001, Annals of Mathematics and Artificial Intelligence.

[35]  Faisal Mahmood,et al.  Deep learning and conditional random fields‐based depth estimation and topographical reconstruction from conventional endoscopy , 2017, Medical Image Anal..

[36]  C. Prinz,et al.  The effect of sedation on the quality of upper gastrointestinal endoscopy: an investigator-blinded, randomized study comparing propofol with midazolam. , 2007, Endoscopy.

[37]  Christian Duriez,et al.  SOFA: A Multi-Model Framework for Interactive Physical Simulation , 2012 .

[38]  Youngbae Hwang,et al.  Recent Development of Computer Vision Technology to Improve Capsule Endoscopy , 2019, Clinical endoscopy.

[39]  Hongyi Li,et al.  Modeling of Velocity-dependent Frictional Resistance of a Capsule Robot Inside an Intestine , 2012, Tribology Letters.

[40]  Takeo Kanade,et al.  Learning scene-specific pedestrian detectors without real data , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Faisal Mahmood,et al.  2D Discrete Fourier Transform with simultaneous edge artifact removal for real-time applications , 2015, 2015 International Conference on Field Programmable Technology (FPT).

[42]  Zhengyou Zhang,et al.  A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  Sen Wang,et al.  DeepVO: Towards end-to-end visual odometry with deep Recurrent Convolutional Neural Networks , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[44]  Faisal Mahmood,et al.  Deep learning-based depth estimation from a synthetic endoscopy image training set , 2018, Medical Imaging.

[45]  Marwan Mattar,et al.  Unity: A General Platform for Intelligent Agents , 2018, ArXiv.

[46]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[47]  Yasin Almalioglu,et al.  EndoL2H: Deep Super-Resolution for Capsule Endoscopy , 2020, IEEE Transactions on Medical Imaging.

[48]  Metin Sitti,et al.  Biopsy using a Magnetic Capsule Endoscope Carrying, Releasing, and Retrieving Untethered Microgrippers , 2014, IEEE Transactions on Biomedical Engineering.

[49]  Alan L. Yuille,et al.  UnrealCV: Connecting Computer Vision to Unreal Engine , 2016, ECCV Workshops.

[50]  Michael Riegler,et al.  KVASIR: A Multi-Class Image Dataset for Computer Aided Gastrointestinal Disease Detection , 2017, MMSys.

[51]  Kate Saenko,et al.  Learning Deep Object Detectors from 3D Models , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[52]  J. Feher 8.3 – Intestinal and Colonic Chemoreception and Motility , 2017 .

[53]  Helder Araújo,et al.  Deep EndoVO: A recurrent convolutional neural network (RCNN) based visual odometry approach for endoscopic capsule robots , 2017, Neurocomputing.

[54]  Howie Choset,et al.  Path Planning for Robotic Demining: Robust Sensor-Based Coverage of Unstructured Environments and Probabilistic Methods , 2003, Int. J. Robotics Res..

[55]  Natália D. Aredes,et al.  Radiology Data from The Cancer Genome Atlas Stomach Adenocarcinoma [TCGA-STAD] collection , 2016 .