Multimodal Image Registration with Deep Context Reinforcement Learning

Automatic and robust registration between real-time patient imaging and pre-operative data (e.g. CT and MRI) is crucial for computer-aided interventions and AR-based navigation guidance. In this paper, we present a novel approach to automatically align range image of the patient with pre-operative CT images. Unlike existing approaches based on the surface similarity optimization process, our algorithm leverages the contextual information of medical images to resolve data ambiguities and improve robustness. The proposed algorithm is derived from deep reinforcement learning algorithm that automatically learns to extract optimal feature representation to reduce the appearance discrepancy between these two modalities. Quantitative evaluations on 1788 pairs of CT and depth images from real clinical setting demonstrate that the proposed method achieves the state-of-the-art performance.

[1]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[2]  Nikos Komodakis,et al.  A Deep Metric for Multimodal Registration , 2016, MICCAI.

[3]  Yaozong Gao,et al.  Learning-Based Multimodal Image Registration for Prostate Cancer Radiation Therapy , 2016, MICCAI.

[4]  Vivek Kumar Singh,et al.  Estimating a Patient Surface Model for Optimizing the Medical Scanning Workflow , 2014, MICCAI.

[5]  R. Bellman A Markovian Decision Process , 1957 .

[6]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[7]  Dorin Comaniciu,et al.  An Artificial Agent for Anatomical Landmark Detection in Medical Images , 2016, MICCAI.

[8]  Nassir Navab,et al.  Learning Optimization Updates for Multimodal Registration , 2016, MICCAI.

[9]  Dorin Comaniciu,et al.  An Artificial Agent for Robust Image Registration , 2016, AAAI.

[10]  Hao Li,et al.  Depth Sensor-Based Realtime Tumor Tracking for Accurate Radiation Therapy , 2014, Eurographics.

[11]  Sergey Levine,et al.  Learning Hand-Eye Coordination for Robotic Grasping with Large-Scale Data Collection , 2016, ISER.

[12]  Wei Cai,et al.  A Kinect(™) camera based navigation system for percutaneous abdominal puncture. , 2016, Physics in medicine and biology.

[13]  Nassir Navab,et al.  Patient MoCap: Human Pose Estimation Under Blanket Occlusion for Hospital Monitoring Applications , 2016, MICCAI.

[14]  Sven Haase,et al.  Multi-modal surface registration for markerless initial patient setup in radiation therapy using microsoft's Kinect sensor , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[15]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[16]  R. Nachabe,et al.  Surgical Navigation Technology Based on Augmented Reality and Integrated 3D Intraoperative Imaging , 2016, Spine.

[17]  William M. Wells,et al.  Feature-Based Alignment of Volumetric Multi-modal Images , 2013, IPMI.