Single-Shot Pose Estimation of Surgical Robot Instruments’ Shafts from Monocular Endoscopic Images

Surgical robots are used to perform minimally invasive surgery and alleviate much of the burden imposed on surgeons. Our group has developed a surgical robot to aid in the removal of tumors at the base of the skull via access through the nostrils. To avoid injuring the patients, a collision-avoidance algorithm that depends on having an accurate model for the poses of the instruments’ shafts is used. Given that the model’s parameters can change over time owing to interactions between instruments and other disturbances, the online estimation of the poses of the instrument’s shaft is essential. In this work, we propose a new method to estimate the pose of the surgical instruments’ shafts using a monocular endoscope. Our method is based on the use of an automatically annotated training dataset and an improved pose-estimation deep-learning architecture. In preliminary experiments, we show that our method can surpass state of the art vision-based marker-less pose estimation techniques (providing an error decrease of 55% in position estimation, 64% in pitch, and 69% in yaw) by using artificial images.

[1]  Luc Van Gool,et al.  The 2005 PASCAL Visual Object Classes Challenge , 2005, MLCW.

[2]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[3]  Zoltan-Csaba Marton,et al.  Implicit 3D Orientation Learning for 6D Object Detection from RGB Images , 2018, ECCV.

[4]  Sébastien Ourselin,et al.  Image Based Surgical Instrument Pose Estimation with Multi-class Labelling and Optical Flow , 2015, MICCAI.

[5]  Guang-Zhong Yang,et al.  Real-Time 3D Tracking of Articulated Tools for Robotic Surgery , 2016, MICCAI.

[6]  Fanny Ficuciello,et al.  Vision-Based Dynamic Virtual Fixtures for Tools Collision Avoidance in Robotic Surgery , 2020, IEEE Robotics and Automation Letters.

[7]  Hongliang Ren,et al.  Tubular Enhanced Geodesic Active Contours for continuum robot detection using 3D ultrasound , 2012, 2012 IEEE International Conference on Robotics and Automation.

[8]  Peter Kazanzides,et al.  A Unified Framework for the Teleoperation of Surgical Robots in Constrained Workspaces , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[9]  Shahram Payandeh,et al.  Visual Tracking of Laparoscopic Instruments , 2014 .

[10]  Ran Hao,et al.  Vision-Based Surgical Tool Pose Estimation for the da Vinci® Robotic Surgical System , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11]  Austin Reiter,et al.  Appearance learning for 3D tracking of robotic surgical tools , 2014, Int. J. Robotics Res..

[12]  Hongliang Ren,et al.  Real-Time 6DOF Pose Estimation of Endoscopic Instruments Using Printable Markers , 2019, IEEE Sensors Journal.

[13]  Mamoru Mitsuishi,et al.  Robust Visual Tracking of Robotic Forceps Under a Microscope Using Kinematic Data Fusion , 2014, IEEE/ASME Transactions on Mechatronics.

[14]  Mamoru Mitsuishi,et al.  Dynamic Active Constraints for Surgical Robots Using Vector-Field Inequalities , 2019, IEEE Transactions on Robotics.

[15]  Mamoru Mitsuishi,et al.  SmartArm: Integration and validation of a versatile surgical robotic system for constrained workspaces , 2020, The international journal of medical robotics + computer assisted surgery : MRCAS.

[16]  Takuya Maekawa,et al.  Preliminary Evaluation of a Framework for Overhead Skeleton Tracking in Factory Environments using Kinect , 2017, iWOAR.

[17]  Pierre E. Dupont,et al.  Real-time adaptive kinematic model estimation of concentric tube robots , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[18]  D. Stoyanov,et al.  3-D Pose Estimation of Articulated Instruments in Robotic Minimally Invasive Surgery , 2018, IEEE Transactions on Medical Imaging.

[19]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[20]  Nassir Navab,et al.  SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).