SLAM Endoscopy enhanced by adversarial depth prediction

Medical endoscopy remains a challenging application for simultaneous localization and mapping (SLAM) due to the sparsity of image features and size constraints that prevent direct depth-sensing. We present a SLAM approach that incorporates depth predictions made by an adversarially-trained convolutional neural network (CNN) applied to monocular endoscopy images. The depth network is trained with synthetic images of a simple colon model, and then fine-tuned with domain-randomized, photorealistic images rendered from computed tomography measurements of human colons. Each image is paired with an error-free depth map for supervised adversarial learning. Monocular RGB images are then fused with corresponding depth predictions, enabling dense reconstruction and mosaicing as an endoscope is advanced through the gastrointestinal tract. Our preliminary results demonstrate that incorporating monocular depth estimation into a SLAM architecture can enable dense reconstruction of endoscopic scenes.

[1]  Hugh F. Durrant-Whyte,et al.  Simultaneous localization and mapping: part I , 2006, IEEE Robotics & Automation Magazine.

[2]  Dimitris K. Iakovidis,et al.  An artificial neural network architecture for non-parametric visual odometry in wireless capsule endoscopy , 2017 .

[3]  Thierry Peynot,et al.  Dense-ArthroSLAM: Dense Intra-Articular 3-D Reconstruction With Robust Localization Prior for Arthroscopy , 2019, IEEE Robotics and Automation Letters.

[4]  J. M. M. Montiel,et al.  Visual SLAM for Handheld Monocular Endoscope , 2014, IEEE Transactions on Medical Imaging.

[5]  Chyke A Doubeni,et al.  Adenoma detection rate and risk of colorectal cancer and death. , 2014, The New England journal of medicine.

[6]  A. Jemal,et al.  Cancer statistics, 2019 , 2019, CA: a cancer journal for clinicians.

[7]  Gamini Dissanayake,et al.  MIS-SLAM: Real-Time Large-Scale Dense Deformable SLAM System in Minimal Invasive Surgery Based on Heterogeneous Computing , 2018, IEEE Robotics and Automation Letters.

[8]  Thomas Rösch,et al.  Expert opinions and scientific evidence for colonoscopy key performance indicators , 2016, Gut.

[9]  P. Cotton,et al.  Practical Gastrointestinal Endoscopy , 2000 .

[10]  Douglas K Rex,et al.  Who is the best colonoscopist? , 2007, Gastrointestinal endoscopy.

[11]  Sandra Sudarsky,et al.  Deep learning with cinematic rendering: fine-tuning deep neural networks using photorealistic medical images , 2018, Physics in medicine and biology.

[12]  Stefan Leutenegger,et al.  ElasticFusion: Real-time dense SLAM and light source estimation , 2016, Int. J. Robotics Res..

[13]  Paul Collins,et al.  Colonoscopic withdrawal times and adenoma detection during screening colonoscopy , 2007 .

[14]  Alan L. Yuille,et al.  Rethinking Monocular Depth Estimation with Adversarial Training , 2018, ArXiv.

[15]  P. Bossuyt,et al.  Polyp Miss Rate Determined by Tandem Colonoscopy: A Systematic Review , 2006, The American Journal of Gastroenterology.

[16]  Alexandre Hostettler,et al.  Live Tracking and Dense Reconstruction for Handheld Monocular Endoscopy , 2019, IEEE Transactions on Medical Imaging.