Multilateral surgical pattern cutting in 2D orthotropic gauze with deep reinforcement learning policies for tensioning

In the Fundamentals of Laparoscopic Surgery (FLS) standard medical training regimen, the Pattern Cutting task requires residents to demonstrate proficiency by maneuvering two tools, surgical scissors and tissue gripper, to accurately cut a circular pattern on surgical gauze suspended at the corners. Accuracy of cutting depends on tensioning, wherein the gripper pinches a point on the gauze in R3 and pulls to induce and maintain tension in the material as cutting proceeds. An automated tensioning policy maps the current state of the gauze to output a direction of pulling as an action. The optimal tensioning policy depends on both the choice of pinch point and cutting trajectory. We explore the problem of learning a tensioning policy conditioned on specific cutting trajectories. Every timestep, we allow the gripper to react to the deformation of the gauze and progress of the cutting trajectory with a translation unit vector along an allowable set of directions. As deformation is difficult to analytically model and explicitly observe, we leverage deep reinforcement learning with direct policy search methods to learn tensioning policies using a finite-element simulator and then transfer them to a physical system. We compare the Deep RL tensioning policies with fixed and analytic (opposing the error vector with a fixed pinch point) policies on a set of 17 open and closed curved contours in simulation and 4 patterns in physical experiments with the da Vinci Research Kit (dVRK). Our simulation results suggest that learning to tension with Deep RL can significantly improve performance and robustness to noise and external forces.

[1]  Sarah F. Frisken Using Linked Volumes to Model Object Collisions, Deformation, Cutting, Carving, and Joining , 1999, IEEE Trans. Vis. Comput. Graph..

[2]  S. F. Frisken-Gibson Using linked volumes to model object collisions, deformation, cutting, carving, and joining , 1999 .

[3]  Han-Wen Nienhuys,et al.  A Surgery Simulation Supporting Cuts and Finite Element Deformation , 2001, MICCAI.

[4]  Hui Zhang,et al.  On cutting and dissection of virtual deformable objects , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[5]  R. Veldkamp,et al.  Laparoscopic surgery versus open surgery for colon cancer : short-term outcomes of a randomised trial , 2022 .

[6]  Pieter Abbeel,et al.  Using inaccurate models in reinforcement learning , 2006, ICML.

[7]  Daniel J Scott,et al.  Design of a Proficiency-Based Skills Training Curriculum for the Fundamentals of Laparoscopic Surgery , 2007, Surgical innovation.

[8]  Kenneth Y. Goldberg,et al.  Motion Planning in Medicine: Optimization and Simulation Algorithms for Image-Guided Procedures , 2008, Springer Tracts in Advanced Robotics.

[9]  Darius Burschka,et al.  Automation of Manual Tasks for Minimally Invasive Surgery , 2008, Fourth International Conference on Autonomic and Autonomous Systems (ICAS'08).

[10]  Allison M. Okamura,et al.  Modeling the Forces of Cutting With Scissors , 2008, IEEE Transactions on Biomedical Engineering.

[11]  Russell H. Taylor,et al.  Medical robotics in computer-integrated surgery , 2003, IEEE Trans. Robotics Autom..

[12]  James F. O'Brien,et al.  Interactive simulation of surgical needle insertion and steering , 2009, ACM Trans. Graph..

[13]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[14]  Henk Nijmeijer,et al.  Robot Programming by Demonstration , 2010, SIMPAR.

[15]  Pieter Abbeel,et al.  Superhuman performance of surgical tasks by robots using iterative learning from human-guided demonstrations , 2010, 2010 IEEE International Conference on Robotics and Automation.

[16]  Gregory D. Hager,et al.  Motion generation of robotic surgical tasks: Learning from expert demonstrations , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[17]  Dejan Pangercic,et al.  Robotic roommates making pancakes , 2011, 2011 11th IEEE-RAS International Conference on Humanoid Robots.

[18]  Stefano Carpin,et al.  Combining imitation and reinforcement learning to fold deformable planar objects , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19]  K. M. Deliparaschos,et al.  Evolution of autonomous and semi‐autonomous robotic surgical systems: a review of the literature , 2011, The international journal of medical robotics + computer assisted surgery : MRCAS.

[20]  Ryan A. Beasley Medical Robots: Current Systems and Research Directions , 2012, J. Robotics.

[21]  P. Jiménez,et al.  Survey on model-based manipulation planning of deformable objects , 2012 .

[22]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[23]  J. Kaouk,et al.  Fundamental skills of robotic surgery: a multi-institutional randomized controlled trial for validation of a simulation-based curriculum. , 2013, Urology.

[24]  Ankush Gupta,et al.  A case study of trajectory transfer through non-rigid registration for a simplified suturing scenario , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  Peter Kazanzides,et al.  An open-source research kit for the da Vinci® Surgical System , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[26]  Mamoru Mitsuishi,et al.  Online Trajectory Planning in Dynamic Environments for Surgical Task Automation , 2014, Robotics: Science and Systems.

[27]  Sergey Levine,et al.  Policy Learning with Continuous Memory States for Partially Observed Robotic Control , 2015, ArXiv.

[28]  Jonathan P. How,et al.  Real-World Reinforcement Learning via Multifidelity Simulators , 2015, IEEE Transactions on Robotics.

[29]  Honglak Lee,et al.  Deep learning for detecting robotic grasps , 2013, Int. J. Robotics Res..

[30]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[31]  Pieter Abbeel,et al.  Learning by observation for surgical subtasks: Multilateral cutting of 3D viscoelastic and 2D Orthotropic Tissue Phantoms , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[32]  Pieter Abbeel,et al.  Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.

[33]  Stuart M. Gale,et al.  Patterning of tensile fabric structures with a discrete element model using dynamic relaxation , 2016 .

[34]  Brijen Thananjeyan,et al.  SWIRL: A SequentialWindowed Inverse Reinforcement Learning Algorithm for Robot Tasks With Delayed Rewards , 2016, Workshop on the Algorithmic Foundations of Robotics.

[35]  Jonathan P. How,et al.  Autonomous drifting using simulation-aided reinforcement learning , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[36]  Sergey Levine,et al.  End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[37]  Kenneth Y. Goldberg,et al.  Automating multi-throw multilateral surgical suturing with a mechanical needle guide and sequential convex optimization , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[38]  Kenneth Y. Goldberg,et al.  Tumor localization using automated palpation with Gaussian Process Adaptive Sampling , 2016, 2016 IEEE International Conference on Automation Science and Engineering (CASE).

[39]  Brijen Thananjeyan,et al.  SWIRL: A sequential windowed inverse reinforcement learning algorithm for robot tasks with delayed rewards , 2018, Int. J. Robotics Res..