Generative Localization With Uncertainty Estimation Through Video-CT Data for Bronchoscopic Biopsy

Robot-assisted endobronchial intervention requires accurate localization based on both intra- and pre-operative data. Most existing methods achieve this by registering 2D videos with 3D CT models according to a defined similarity metric with local features. Instead, we formulate the bronchoscopic localization as a learning-based global localisation using deep neural networks. The proposed network consists of two generative architectures and one auxiliary learning component. The cycle generative architecture bridges the domain variance between the real bronchoscopic videos and virtual views derived from pre-operative CT data so that the proposed approach can be trained through a large number of generated virtual images but deployed through real images. The auxiliary learning architecture leverages complementary relative pose regression to constrain the search space, ensuring consistent global pose predictions. Most importantly, the uncertainty of each global pose is obtained through variational inference by sampling within the learned underlying probability distribution. Detailed validation results demonstrate the localization accuracy with reasonable uncertainty achieved and its potential clinical value. A demonstration video demo can be found on the website https://youtu.be/ci9LMY49aF8.

[1]  Roberto Cipolla,et al.  PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2]  Li Sun,et al.  A fully end-to-end deep learning approach for real-time simultaneous 3D reconstruction and material recognition , 2017, 2017 18th International Conference on Advanced Robotics (ICAR).

[3]  William E. Higgins,et al.  Construction of a multimodal CT-video chest model , 2014, Medical Imaging.

[4]  Guang-Zhong Yang,et al.  Robust camera localisation with depth reconstruction for bronchoscopic navigation , 2015, International Journal of Computer Assisted Radiology and Surgery.

[5]  Li Sun,et al.  Learning Monocular Visual Odometry with Dense 3D Mapping from Dense 3D Flow , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[6]  Guido Gerig,et al.  User-guided 3D active contour segmentation of anatomical structures: Significantly improved efficiency and reliability , 2006, NeuroImage.

[7]  Wolfram Burgard,et al.  Deep Auxiliary Learning for Visual Localization and Odometry , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[8]  Debora Gil,et al.  Towards a Videobronchoscopy Localization System from Airway Centre Tracking , 2017, VISIGRAPP.

[9]  Faisal Mahmood,et al.  Unsupervised Reverse Domain Adaptation for Synthetic Medical Images via Adversarial Training , 2017, IEEE Transactions on Medical Imaging.

[10]  Debora Gil,et al.  On-Line Lumen Centre Detection in Gastrointestinal and Respiratory Endoscopy , 2013, CLIP.

[11]  拓海 杉山,et al.  “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[12]  Toshimitsu Kaneko,et al.  Deep monocular 3D reconstruction for assisted navigation in bronchoscopy , 2017, International Journal of Computer Assisted Radiology and Surgery.

[13]  Guang-Zhong Yang,et al.  Patient-specific bronchoscope simulation with pq-space-based 2D/3D registration , 2004, Computer aided surgery : official journal of the International Society for Computer Aided Surgery.

[14]  Guang-Zhong Yang,et al.  Context-Aware Depth and Pose Estimation for Bronchoscopic Navigation , 2019, IEEE Robotics and Automation Letters.

[15]  Li Sun,et al.  Dense RGB-D Semantic Mapping with Pixel-Voxel Neural Network , 2017, Sensors.

[16]  Eric J. Seibel,et al.  In Vivo Validation of a Hybrid Tracking System for Navigation of an Ultrathin Bronchoscope Within Peripheral Airways , 2010, IEEE Transactions on Biomedical Engineering.

[17]  Guang-Zhong Yang,et al.  BRANCH: Bifurcation Recognition for Airway Navigation based on struCtural cHaracteristics , 2017, MICCAI.

[18]  Raymond Y. K. Lau,et al.  Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[19]  Mark Sandler,et al.  MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20]  David Eng,et al.  OffsetNet: Deep Learning for Localization in the Lung using Rendered Images , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[21]  Li Sun,et al.  Learning Kalman Network: A deep monocular visual odometry for on-road driving , 2019, Robotics Auton. Syst..

[22]  Kensaku Mori,et al.  A Discriminative Structural Similarity Measure and its Application to Video-Volume Registration for Endoscope Three-Dimensional Motion Tracking , 2014, IEEE Transactions on Medical Imaging.