论文信息 - Instrument Tracking via Online Learning in Retinal Microsurgery

Instrument Tracking via Online Learning in Retinal Microsurgery

Robust visual tracking of instruments is an important task in retinal microsurgery. In this context, the instruments are subject to a large variety of appearance changes due to illumination and other changes during a procedure, which makes the task very challenging. Most existing methods require collecting a sufficient amount of labelled data and yet perform poorly in handling appearance changes that are unseen in training data. To address these problems, we propose a new approach for robust instrument tracking. Specifically, we adopt an online learning technique that collects appearance samples of instruments on the fly and gradually learns a target-specific detector. Online learning enables the detector to reinforce its model and become more robust over time. The performance of the proposed method has been evaluated on a fully annotated dataset of retinal instruments in in-vivo retinal microsurgery and on a laparoscopy image sequence. In all experimental results, our proposed tracking approach shows superior performance compared to several other state-of-the-art approaches.

Junzhou Huang | Chen Chen | Xiaolei Huang | Yeqing Li

[1] Darius Burschka,et al. Navigating inner space: 3-D assistance for minimally invasive surgery , 2005, Robotics Auton. Syst..

[2] Junzhou Huang,et al. Robust tracking using local sparse appearance model and K-selection , 2011, CVPR 2011.

[3] Thomas Deselaers,et al. ClassCut for Unsupervised Class Segmentation , 2010, ECCV.

[4] Gregory D. Hager,et al. Articulated object tracking by rendering consistent appearance parts , 2009, 2009 IEEE International Conference on Robotics and Automation.

[5] Selim Benhimane,et al. Homography-based 2D Visual Tracking and Servoing , 2007, Int. J. Robotics Res..

[6] Russell H. Taylor,et al. Data-Driven Visual Tracking in Retinal Microsurgery , 2012, MICCAI.

[7] Zdenek Kalal,et al. Tracking-Learning-Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] Hervé Delingette,et al. Medical Image Computing and Computer-Assisted Intervention – MICCAI 2012 , 2012, Lecture Notes in Computer Science.

[9] Russell H. Taylor,et al. Visual Tracking of Surgical Tools for Proximity Detection in Retinal Surgery , 2011, IPCAI.

[10] Vincent Lepetit,et al. Fast Keypoint Recognition in Ten Lines of Code , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Mark Jenkinson,et al. Non-local Shape Descriptor: A New Similarity Metric for Deformable Multi-modal Registration , 2011, MICCAI.

[12] Russell H. Taylor,et al. Unified Detection and Tracking of Instruments during Retinal Microsurgery , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] Russell H. Taylor,et al. Unified Detection and Tracking in Retinal Microsurgery , 2011, MICCAI.

[14] Jiri Matas,et al. Forward-Backward Error: Automatic Detection of Tracking Failures , 2010, 2010 20th International Conference on Pattern Recognition.

[15] Horst Bischof,et al. Real-Time Tracking via On-line Boosting , 2006, BMVC.

[16] Russell H. Taylor,et al. Information Processing in Computer-Assisted Interventions - Second International Conference, IPCAI 2011, Berlin, Germany, June 22, 2011. Proceedings , 2011, IPCAI.

[17] Paul A. Viola,et al. Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[18] Ming-Hsuan Yang,et al. Visual tracking with online Multiple Instance Learning , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[19] Junzhou Huang,et al. Robust and Fast Collaborative Tracking with Two Stage Sparse Optimization , 2010, ECCV.

[20] Mark R. Pickering,et al. A new multi-modal similarity measure for fast gradient-based 2D-3D image registration , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[21] Simon Baker,et al. Lucas-Kanade 20 Years On: A Unifying Framework , 2004, International Journal of Computer Vision.