Deformable structure from motion by fusing visual and inertial measurement data

Accurate recovery of the 3D structure of a deforming surgical environment during minimally invasive surgery is important for intra-operative guidance. One key component of reliable reconstruction is accurate camera pose estimation, which is challenging for monocular cameras due to the paucity of reliable salient features, coupled with narrow baseline during surgical navigation. With recent advances in miniaturized MEMS sensors, the combination of inertial and vision sensing can provide increased robustness for camera pose estimation particularly for scenes involving tissue deformation. The aim of this work is to propose a robust framework for intra-operative free-form deformation recovery based on structure-from-motion. A novel adaptive Unscented Kalman Filter (UKF) parameterization scheme is proposed to fuse vision information with data from an Inertial Measurement Unit (IMU). The method is built on a compact scene representation scheme suitable for both surgical episode identification and instrument-tissue motion modelling. Detailed validation with both synthetic and phantom data is performed and results derived justify the potential clinical value of the technique.

[1]  Agostino Martinelli,et al.  Vision and IMU Data Fusion: Closed-Form Solutions for Attitude, Speed, Absolute Scale, and Bias Determination , 2012, IEEE Transactions on Robotics.

[2]  Guang-Zhong Yang,et al.  Content-Based Surgical Workflow Representation Using Probabilistic Motion Modeling , 2010, MIAR.

[3]  Gerd Hirzinger,et al.  An approach to ulta-tightly coupled data fusion for handheld input devices in robotic surgery , 2011, 2011 IEEE International Conference on Robotics and Automation.

[4]  Markus Vincze,et al.  Simultaneous Motion and Structure Estimation by Fusion of Inertial and Vision Data , 2007, Int. J. Robotics Res..

[5]  Stergios I. Roumeliotis,et al.  Two Efficient Solutions for Visual Odometry Using Directional Correspondence , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Michael Figl,et al.  Non-rigid Reconstruction of the Beating Heart Surface for Minimally Invasive Cardiac Surgery , 2009, MICCAI.

[7]  Roland Siegwart,et al.  Robust embedded egomotion estimation , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8]  Huosheng Hu,et al.  Integration of Vision and Inertial Sensors for 3D Arm Motion Tracking in Home-based Rehabilitation , 2007, Int. J. Robotics Res..

[9]  Roland Siegwart,et al.  Robust Real-Time Visual Odometry with a Single Camera and an IMU , 2011, BMVC.

[10]  Guang-Zhong Yang,et al.  Affine-invariant anisotropic detector for soft tissue tracking in minimally invasive surgery , 2009, 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[11]  Roland Siegwart,et al.  Fusion of IMU and Vision for Absolute Scale Estimation in Monocular SLAM , 2011, J. Intell. Robotic Syst..

[12]  Alonzo Kelly,et al.  A new approach to vision-aided inertial navigation , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Guang-Zhong Yang,et al.  Motion Compensated SLAM for Image Guided Surgery , 2010, MICCAI.

[14]  Éric Marchand,et al.  Improving monocular plane-based SLAM with inertial measures , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15]  David Nistér,et al.  An efficient solution to the five-point relative pose problem , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[16]  Marc Pollefeys,et al.  A Minimal Case Solution to the Calibrated Relative Pose Problem for the Case of Two Known Orientation Angles , 2010, ECCV.

[17]  Roland Siegwart,et al.  Collaborative stereo , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  Jorge Dias,et al.  Vision and Inertial Sensor Cooperation Using Gravity as a Vertical Reference , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Guang-Zhong Yang,et al.  Tissue Deformation Recovery with Gaussian Mixture Model Based Structure from Motion , 2011, AE-CAI.

[20]  Daniel Mirota,et al.  Toward Video-Based Navigation for Endoscopic Endonasal Skull Base Surgery. , 2009, Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention.

[21]  Vincent Lepetit,et al.  Accurate Non-Iterative O(n) Solution to the PnP Problem , 2007, 2007 IEEE 11th International Conference on Computer Vision.