Toward Synergic Learning for Autonomous Manipulation of Deformable Tissues via Surgical Robots: An Approximate Q-Learning Approach

In this paper, we present a synergic learning algorithm to address the task of indirect manipulation of an unknown deformable tissue. Tissue manipulation is a common yet challenging task in various surgical interventions, which makes it a good candidate for robotic automation. We propose using a linear approximate Q-learning method in which human knowledge contributes to selecting useful yet simple features of tissue manipulation while the algorithm learns to take optimal actions and accomplish the task. The algorithm is implemented and evaluated on a simulation using the OpenCV and CHAI3D libraries. Successful simulation results for four different configurations which are based on realistic tissue manipulation scenarios are presented. Results indicate that with a careful selection of relatively simple and intuitive features, the developed Q-learning algorithm can successfully learn an optimal policy without any prior knowledge of tissue dynamics or camera intrinsic/extrinsic calibration parameters.

[1]  Eduardo F. Morales,et al.  An Introduction to Reinforcement Learning , 2011 .

[2]  Ji Ma,et al.  Autonomous suturing via surgical robot: An algorithm for optimal selection of needle diameter, shape, and path , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[3]  Zerui Wang,et al.  Toward Semi-autonomous Cryoablation of Kidney Tumors via Model-Independent Deformable Tissue Manipulation Technique , 2018, Annals of Biomedical Engineering.

[4]  Mehran Armand,et al.  On the Use of a Continuum Manipulator and a Bendable Medical Screw for Minimally Invasive Interventions in Orthopedic Surgery , 2019, IEEE Transactions on Medical Robotics and Bionics.

[5]  Zerui Wang,et al.  A Robust Data-Driven Approach for Online Learning and Manipulation of Unmodeled 3-D Heterogeneous Compliant Objects , 2018, IEEE Robotics and Automation Letters.

[6]  Punit Pandey,et al.  Approximate Q-Learning: An Introduction , 2010, 2010 Second International Conference on Machine Learning and Computing.

[7]  Sergey Levine,et al.  End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[8]  Russell H. Taylor,et al.  Spatial Motion Constraints for Robot Assisted Suturing Using Virtual Fixtures , 2005, MICCAI.

[9]  M. Pauline Baker,et al.  Computer graphics with OpenGL , 1986 .

[10]  L. Sentis,et al.  The CHAI Libraries , 2003 .

[11]  Kenneth Y. Goldberg,et al.  Automating multi-throw multilateral surgical suturing with a mechanical needle guide and sequential convex optimization , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[12]  Blake Hannaford,et al.  Raven-II: An Open Platform for Surgical Robotics Research , 2013, IEEE Transactions on Biomedical Engineering.

[13]  Leslie N. Smith,et al.  Cyclical Learning Rates for Training Neural Networks , 2015, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[14]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[15]  Iulian Iordachita,et al.  SCADE: Simultaneous Sensor Calibration and Deformation Estimation of FBG-Equipped Unmodeled Continuum Manipulators , 2020, IEEE Transactions on Robotics.

[16]  Ji Ma,et al.  Learning Soft Tissue Dynamics in Image Space for Automated Bimanual Tissue Manipulation with Surgical Robots , 2019, ArXiv.

[17]  P. Jiménez,et al.  Survey on model-based manipulation planning of deformable objects , 2012 .

[18]  Yasemin Altun,et al.  Relative Entropy Policy Search , 2010 .

[19]  Yunhui Liu,et al.  Automatic 3-D Manipulation of Soft Objects by Robotic Arms With an Adaptive Deformation Model , 2016, IEEE Transactions on Robotics.

[20]  P. Brett,et al.  An autonomous surgical robot for drilling a cochleostomy: preliminary porcine trial , 2008, Clinical otolaryngology : official journal of ENT-UK ; official journal of Netherlands Society for Oto-Rhino-Laryngology & Cervico-Facial Surgery.

[21]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[22]  Pieter Abbeel,et al.  Learning by observation for surgical subtasks: Multilateral cutting of 3D viscoelastic and 2D Orthotropic Tissue Phantoms , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[23]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[24]  Shinichi Hirai,et al.  Indirect simultaneous positioning of deformable objects with multi-pinching fingers based on an uncertain model , 2000, Robotica.

[25]  Zerui Wang,et al.  Autonomous Data-Driven Manipulation of Unknown Anisotropic Deformable Tissues Using Unmodelled Continuum Manipulators , 2019, IEEE Robotics and Automation Letters.

[26]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[27]  Jacob Rosen,et al.  Autonomous Tissue Manipulation via Surgical Robot Using Learning Based Model Predictive Control , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[28]  Sergey Levine,et al.  Continuous Deep Q-Learning with Model-based Acceleration , 2016, ICML.

[29]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[31]  T. J. Rivlin An Introduction to the Approximation of Functions , 2003 .

[32]  Ryan S. Decker,et al.  Supervised autonomous robotic soft tissue surgery , 2016, Science Translational Medicine.

[33]  Jan Peters,et al.  A Survey on Policy Search for Robotics , 2013, Found. Trends Robotics.

[34]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.