Autonomous Driving in the Lung using Deep Learning for Localization

Lung cancer is the leading cause of cancer-related death worldwide, and early diagnosis is critical to improving patient outcomes. To diagnose cancer, a highly trained pulmonologist must navigate a flexible bronchoscope deep into the branched structure of the lung for biopsy. The biopsy fails to sample the target tissue in 26-33% of cases largely because of poor registration with the preoperative CT map. To improve intraoperative registration, we develop two deep learning approaches to localize the bronchoscope in the preoperative CT map based on the bronchoscopic video in real-time, called AirwayNet and BifurcationNet. The networks are trained entirely on simulated images derived from the patient-specific CT. When evaluated on recorded bronchoscopy videos in a phantom lung, AirwayNet outperforms other deep learning localization algorithms with an area under the precision-recall curve of 0.97. Using AirwayNet, we demonstrate autonomous driving in the phantom lung based on video feedback alone. The robot reaches four targets in the left and right lungs in 95% of the trials. On recorded videos in eight human cadaver lungs, AirwayNet achieves areas under the precision-recall curve ranging from 0.82 to 0.997.

[1]  William E. Higgins,et al.  Combined video tracking and image-video registration for continuous bronchoscopic guidance , 2008, International Journal of Computer Assisted Radiology and Surgery.

[2]  Toshimitsu Kaneko,et al.  Deep monocular 3D reconstruction for assisted navigation in bronchoscopy , 2017, International Journal of Computer Assisted Radiology and Surgery.

[3]  William E. Higgins,et al.  Interactive CT-Video Registration for the Continuous Guidance of Bronchoscopy , 2013, IEEE Transactions on Medical Imaging.

[4]  Rosa M. Estrada-Y.-Martin,et al.  Diagnostic Yield and Complications of Bronchoscopy for Peripheral Lung Lesions. Results of the AQuIRE Registry. , 2016, American journal of respiratory and critical care medicine.

[5]  Antoni Rosell,et al.  Motion planning for the Virtual Bronchoscopy , 2012, 2012 IEEE International Conference on Robotics and Automation.

[6]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Yihong Wu,et al.  Image-based camera localization: an overview , 2016, Visual Computing for Industry, Biomedicine, and Art.

[8]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  W. Quarz [Transthoracic needle biopsy of the lung]. , 1970, Pneumonologie. Pneumonology.

[10]  Hedyeh Rafii-Tari,et al.  Current and Emerging Robot-Assisted Endovascular Catheterization Technologies: A Review , 2013, Annals of Biomedical Engineering.

[11]  Jake Sganga,et al.  Orientation estimation of a continuum manipulator in a phantom lung , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[12]  L. Tanoue,et al.  Lung cancer: epidemiology, etiology, and prevention. , 2011, Clinics in chest medicine.

[13]  Jakub W. Pachocki,et al.  Learning dexterous in-hand manipulation , 2018, Int. J. Robotics Res..

[14]  David Eng,et al.  OffsetNet: Deep Learning for Localization in the Lung using Rendered Images , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[15]  F. Lindseth,et al.  Navigated Bronchoscopy: A Technical Review , 2014, Journal of bronchology & interventional pulmonology.

[16]  Xiaowei Zhou,et al.  Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  D. Mollura,et al.  Segmentation and Image Analysis of Abnormal Lungs at CT: Current Approaches, Challenges, and Future Trends. , 2015, Radiographics : a review publication of the Radiological Society of North America, Inc.

[18]  G. Fagogenis,et al.  Autonomous robotic intracardiac catheter navigation using haptic vision , 2019, Science Robotics.

[19]  Kensaku Mori,et al.  A Discriminative Structural Similarity Measure and its Application to Video-Volume Registration for Endoscope Three-Dimensional Motion Tracking , 2014, IEEE Transactions on Medical Imaging.

[20]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[21]  Wojciech Zaremba,et al.  Domain randomization for transferring deep neural networks from simulation to the real world , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[22]  Debora Gil,et al.  Towards a Videobronchoscopy Localization System from Airway Centre Tracking , 2017, VISIGRAPP.

[23]  William E. Higgins,et al.  Construction of a multimodal CT-video chest model , 2014, Medical Imaging.

[24]  Erlend Fagertun Hofstad,et al.  Automatic registration of CT images to patient during the initial phase of bronchoscopy: a clinical pilot study. , 2014, Medical physics.

[25]  Guang-Zhong Yang,et al.  BRANCH: Bifurcation Recognition for Airway Navigation based on struCtural cHaracteristics , 2017, MICCAI.