Accurate and robust feature description and dense point-wise matching based on feature fusion for endoscopic images

Despite the rapid technical advancement of augmented reality (AR) and mixed reality (MR) in minimally invasive surgery (MIS) in recent years, monocular-based 2D/3D reconstruction still remains technically challenging in AR/MR guided surgery navigation nowadays. In principle, soft tissue surface is smooth and watery with sparse texture, specular reflection, and frequent deformation. As a result, we frequently obtain only sparse feature points that give rise to incorrect matching results with conventional image processing methods. To ameliorate, in this paper we enunciate an accurate and robust description and matching method for dense feature points in endoscopic videos. Our new method first extracts contours of the low-rank image sequences based on the adaptive robust principal component analysis (RPCA) decomposition. Then we propose a multi-scale dense geometric feature description approach, which simultaneously extracts dense feature descriptors of the contours in the original Euclidean coordinate space, the accompanying 3D color coordinate space, and the derived curvature-gradient coordinate space. Finally, we devise a new algorithm for both global and local point-wise matching based on feature fusion. For global matching, we employ the fast Fourier transform (FFT) to reduce the dimension of the dense feature descriptors. For local feature point matching, in order to enhance the robustness and accuracy of the matching, we cluster multiple contour points to form "super-point" based on dense feature descriptors and their spatio-temporal continuity. The comprehensive experimental results confirm that our novel approach can overcome the highlight influence, and robustly describe contours from image sequences of soft tissue surfaces. Compared with the state-of-the-art feature point description and matching methods, our analysis framework shows the key advantages of both robustness and accuracy in dense point-wise matching, even when the severe soft tissue deformation occurs. Our new approach is expected to have high potential in 2D/3D reconstruction in endoscopy.

[1]  Ulrich Schwanecke,et al.  Real-Time Monocular Pose Estimation of 3D Objects Using Temporally Consistent Local Color Histograms , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[2]  Jinhui Tang,et al.  Richer Convolutional Features for Edge Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Angel D. Sappa,et al.  Dense Extreme Inception Network: Towards a Robust CNN Model for Edge Detection , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[4]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Jin Young Choi,et al.  Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Ming Yang,et al.  Bi-Directional Cascade Network for Perceptual Edge Detection , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Michael Felsberg,et al.  Learning Spatially Regularized Correlation Filters for Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[8]  Yasuyuki Matsushita,et al.  GMS: Grid-Based Motion Statistics for Fast, Ultra-robust Feature Correspondence , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Rynson W. H. Lau,et al.  CREST: Convolutional Residual Learning for Visual Tracking , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[10]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Guang-Zhong Yang,et al.  Online tracking and retargeting with applications to optical biopsy in gastrointestinal endoscopic examinations , 2016, Medical Image Anal..

[12]  Rui Wang,et al.  GA-SURF: A new Speeded-Up robust feature extraction algorithm for multispectral images based on geometric algebra , 2019, Pattern Recognit. Lett..

[13]  Mohamed H. Abdelpakey,et al.  DP-Siam: Dynamic Policy Siamese Network for Robust Object Tracking , 2020, IEEE Transactions on Image Processing.

[14]  Alain Lalande,et al.  Real-Time Augmented Reality for Ear Surgery , 2018, MICCAI.

[15]  Azriel Rosenfeld,et al.  The Max Roberts Operator is a Hueckel-Type Edge Detector , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Huchuan Lu,et al.  GradNet: Gradient-Guided Network for Visual Object Tracking , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[17]  Zhe,et al.  The Visual Object Tracking VOT2015 Challenge Results , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[18]  Fei-Yue Wang,et al.  3-D Tracking for Augmented Reality Using Combined Region and Dense Cues in Endoscopic Surgery , 2018, IEEE Journal of Biomedical and Health Informatics.

[19]  Junjun Pan,et al.  Specular Reflections Removal for Endoscopic Image Sequences With Adaptive-RPCA Decomposition , 2020, IEEE Transactions on Medical Imaging.

[20]  Elena De Momi,et al.  Long Term Safety Area Tracking (LT‐SAT) with online failure detection and recovery for robotic minimally invasive surgery , 2018, Medical Image Anal..

[21]  Bruce A. Draper,et al.  Visual object tracking using adaptive correlation filters , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[23]  Sanja Fidler,et al.  Devil Is in the Edges: Learning Semantic Boundaries From Noisy Annotations , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  A. Murat Tekalp,et al.  Partial shape recognition by sub-matrix matching for partial matching guided image labeling , 2005, Pattern Recognit..

[25]  Shuji Oishi,et al.  VITAMIN-E: VIsual Tracking and MappINg With Extremely Dense Feature Points , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Michael Felsberg,et al.  The Sixth Visual Object Tracking VOT2018 Challenge Results , 2018, ECCV Workshops.

[27]  Antonis A. Argyros,et al.  Scale invariant and deformation tolerant partial shape matching , 2011, Image Vis. Comput..

[28]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[29]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[30]  Yuefeng Ji,et al.  Contextual Bag-of-Words for Robust Visual Tracking , 2018, IEEE Transactions on Image Processing.

[31]  Michael Felsberg,et al.  The Visual Object Tracking VOT2017 Challenge Results , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[32]  Liqiang Nie,et al.  Large-Scale Tracking for Images With Few Textures , 2017, IEEE Transactions on Multimedia.

[33]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[34]  Qiang Wang,et al.  An elastic partial shape matching technique , 2007, Pattern Recognit..

[35]  Xiaoming Peng,et al.  Combine color and shape in real-time detection of texture-less objects , 2015, Comput. Vis. Image Underst..

[36]  Luc Soler,et al.  The status of augmented reality in laparoscopic surgery as of 2016 , 2017, Medical Image Anal..

[37]  Alexandre Hostettler,et al.  Live Tracking and Dense Reconstruction for Handheld Monocular Endoscopy , 2019, IEEE Transactions on Medical Imaging.

[38]  S. Weber,et al.  Augmented environments for the targeting of hepatic lesions during image-guided robotic liver surgery. , 2013, The Journal of surgical research.

[39]  Haoyin Zhou,et al.  Real-Time Dense Reconstruction of Tissue Surface From Stereo Optical Video , 2020, IEEE Transactions on Medical Imaging.

[40]  Yu Cao,et al.  2D nonrigid partial shape matching using MCMC and contour subdivision , 2011, CVPR 2011.

[41]  Qu Ying-Dong,et al.  A fast subpixel edge detection method using Sobel-Zernike moments operator , 2005, Image Vis. Comput..

[42]  Jordi Pont-Tuset,et al.  Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  R. Sukthankar,et al.  PCA-SIFT: a more distinctive representation for local image descriptors , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[44]  Qian Yu,et al.  Multiscale Fourier descriptor based on triangular features for shape retrieval , 2019, Signal Process. Image Commun..

[45]  C. Floyd,et al.  Characterization of difference of Gaussian filters in the detection of mammographic regions. , 2006, Medical physics.

[46]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[48]  Zhetao Li,et al.  Online Multi-Expert Learning for Visual Tracking , 2020, IEEE Transactions on Image Processing.

[49]  Cheng Wang,et al.  Motion Vector for Outlier Elimination in Feature Matching and Its Application in SLAM Based Laparoscopic Tracking , 2017, CARE/CLIP@MICCAI.

[50]  Jiri Matas,et al.  Discriminative Correlation Filter Tracker with Channel and Spatial Reliability , 2016, International Journal of Computer Vision.

[51]  Xingming Sun,et al.  Integrating SIFT and CNN Feature Matching for Partial-Duplicate Image Detection , 2020, IEEE Transactions on Emerging Topics in Computational Intelligence.

[52]  Michael Felsberg,et al.  Discriminative Scale Space Tracking , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Mitra Basu,et al.  Gaussian-based edge-detection methods - a survey , 2002, IEEE Trans. Syst. Man Cybern. Part C.

[54]  Chi-Man Pun,et al.  Invariant Digital Image Watermarking Using Adaptive Harris Corner Detector , 2011, 2011 Eighth International Conference Computer Graphics, Imaging and Visualization.

[55]  Longin Jan Latecki,et al.  Locally constrained diffusion process on locally densified distance spaces with applications to shape retrieval , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[56]  Mounir Kaaniche,et al.  Efficient Enhancement of Stereo Endoscopic Images Based on Joint Wavelet Decomposition and Binocular Combination , 2019, IEEE Transactions on Medical Imaging.

[57]  Golnoosh Samei,et al.  A partial augmented reality system with live ultrasound and registered preoperative MRI for guiding robot-assisted radical prostatectomy , 2019, Medical Image Anal..

[58]  Josef Kittler,et al.  Joint Group Feature Selection and Discriminative Filter Learning for Robust Visual Object Tracking , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[59]  C. Moro,et al.  The effectiveness of virtual and augmented reality in health sciences and medical anatomy , 2017, Anatomical sciences education.

[60]  Hui Wei,et al.  A novel method for 2D nonrigid partial shape matching , 2018, Neurocomputing.

[61]  Yongchao Gong,et al.  Mask Scoring R-CNN , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).