Style Transfer Enabled Sim2Real Framework for Efficient Learning of Robotic Ultrasound Image Analysis Using Simulated Data

Robotic ultrasound (US) systems have shown great potential to make US examinations easier and more accurate. Recently, various machine learning techniques have been proposed to realize automatic US image interpretation for robotic US acquisition tasks. However, obtaining large amounts of real US imaging data for training is usually expensive or even unfeasible in some clinical applications. An alternative is to build a simulator to generate synthetic US data for training, but the differences between simulated and real US images may result in poor model performance. This work presents a Sim2Real framework to efficiently learn robotic US image analysis tasks based only on simulated data for real-world deployment. A style transfer module is proposed based on unsupervised contrastive learning and used as a preprocessing step to convert the real US images into the simulation style. Thereafter, a task-relevant model is designed to combine CNNs with vision transformers to generate the task-dependent prediction with improved generalization ability. We demonstrate the effectiveness of our method in an image regression task to predict the probe position based on US images in robotic transesophageal echocardiography (TEE). Our results show that using only simulated US data and a small amount of unlabelled real data for training, our method can achieve comparable performance to semi-supervised and fully supervised learning methods. Moreover, the effectiveness of our previously proposed CT-based US image simulation method is also indirectly confirmed.

[1]  Ziqi Zhao,et al.  Closed-Loop Magnetic Manipulation for Robotic Transesophageal Echocardiography , 2023, IEEE Transactions on Robotics.

[2]  Keyu Li,et al.  Enabling Augmented Segmentation and Registration in Ultrasound-Guided Spinal Surgery via Realistic Ultrasound Synthesis from Diagnostic CT Volume , 2023, arXiv.org.

[3]  Mohammad Farid Azampour,et al.  CACTUSS: Common Anatomical CT-US Space for US examinations , 2022, MICCAI.

[4]  Ziqi Zhao,et al.  External and Internal Sensor Fusion Based Localization Strategy for 6-DOF Pose Estimation of a Magnetic Capsule Robot , 2022, IEEE Robotics and Automation Letters.

[5]  N. Navab,et al.  VesNet-RL: Simulation-Based Reinforcement Learning for Real-World US Probe Navigation , 2022, IEEE Robotics and Automation Letters.

[6]  D. Shen,et al.  Transformers in Medical Image Analysis: A Review , 2022, Intelligent Medicine.

[7]  Max Q.-H. Meng,et al.  Image-Guided Navigation of a Robotic Ultrasound Probe for Autonomous Spinal Sonography Using a Shadow-Aware Dual-Agent Framework , 2021, IEEE Transactions on Medical Robotics and Bionics.

[8]  M. Meng,et al.  A Virtual Scanning Framework for Robotic Spinal Sonography with Automatic Real-time Recognition of Standard Views , 2021, 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).

[9]  Max Q.-H. Meng,et al.  An Overview of Systems and Techniques for Autonomous Robotic Ultrasound Acquisitions , 2021, IEEE Transactions on Medical Robotics and Bionics.

[10]  Max Q.-H. Meng,et al.  Autonomous Navigation of an Ultrasound Probe Towards Standard Scan Planes with Deep Reinforcement Learning , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[11]  Hongen Liao,et al.  Autonomic Robotic Ultrasound Imaging System Based on Reinforcement Learning , 2021, IEEE Transactions on Biomedical Engineering.

[12]  D. Tao,et al.  A Survey on Vision Transformer , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  N. Navab,et al.  Autonomous Robotic Screening of Tubular Structures Based Only on Real-Time Ultrasound Imaging Feedback , 2020, IEEE Transactions on Industrial Electronics.

[14]  S. Gelly,et al.  An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2020, ICLR.

[15]  Alexei A. Efros,et al.  Contrastive Learning for Unpaired Image-to-Image Translation , 2020, ECCV.

[16]  Nassir Navab,et al.  Ultrasound-Guided Robotic Navigation with Deep Reinforcement Learning , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[17]  Dong Ni,et al.  Deep Learning in Medical Ultrasound Analysis: A Review , 2019, Engineering.

[18]  Fausto Milletari,et al.  Straight to the point: reinforcement learning for user guidance in ultrasound , 2019, SUSI/PIPPI@MICCAI.

[19]  Oriol Vinyals,et al.  Representation Learning with Contrastive Predictive Coding , 2018, ArXiv.

[20]  Qinghua Huang,et al.  Fully Automatic Three-Dimensional Ultrasound Imaging Based on Conventional B-Scan , 2018, IEEE Transactions on Biomedical Circuits and Systems.

[21]  Kaspar Althoefer,et al.  Robotic Ultrasound: View Planning, Tracking, and Automatic Acquisition of Transesophageal Echocardiography , 2016, IEEE Robotics & Automation Magazine.

[22]  Thomas Neff,et al.  Towards MRI-Based Autonomous Robotic US Acquisitions: A First Feasibility Study , 2016, IEEE Transactions on Medical Imaging.

[23]  Adam Piórkowski,et al.  The Transesophageal Echocardiography Simulator Based on Computed Tomography Images , 2013, IEEE Transactions on Biomedical Engineering.

[24]  Atsuo Takanishi,et al.  Implementation of an automatic scanning and detection algorithm for the carotid artery by an assisted-robotic measurement system , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[26]  H. Xiong,et al.  RL-TEE: Autonomous Probe Guidance for Transesophageal Echocardiography Based on Attention-Augmented Deep Reinforcement Learning , 2024, IEEE Transactions on Automation Science and Engineering.