An artificial neural network architecture for non-parametric visual odometry in wireless capsule endoscopy

Wireless capsule endoscopy is a non-invasive screening procedure of the gastrointestinal (GI) tract performed with an ingestible capsule endoscope (CE) of the size of a large vitamin pill. Such endoscopes are equipped with a usually low-frame-rate color camera which enables the visualization of the GI lumen and the detection of pathologies. The localization of the commercially available CEs is performed in the 3D abdominal space using radio-frequency (RF) triangulation from external sensor arrays, in combination with transit time estimation. State-of-the-art approaches, such as magnetic localization, which have been experimentally proved more accurate than the RF approach, are still at an early stage. Recently, we have demonstrated that CE localization is feasible using solely visual cues and geometric models. However, such approaches depend on camera parameters, many of which are unknown. In this paper the authors propose a novel non-parametric visual odometry (VO) approach to CE localization based on a feed-forward neural network architecture. The effectiveness of this approach in comparison to state-of-the-art geometric VO approaches is validated using a robotic-assisted in vitro experimental setup.

[1]  Kaveh Pahlavan,et al.  Geometric estimation of intestinal contraction for motion tracking of video capsule endoscope , 2014, Medical Imaging.

[2]  Peter Xiaoping Liu,et al.  Image distortion correction for wireless capsule endoscope , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[3]  Weihua Li,et al.  A Review of Localization Systems for Robotic Endoscopic Capsules , 2012, IEEE Transactions on Biomedical Engineering.

[4]  Sohaib Khan,et al.  Camera calibration and three-dimensional world reconstruction of stereo-vision using neural networks , 2001, Int. J. Syst. Sci..

[5]  Andreas Geiger,et al.  Object scene flow for autonomous vehicles , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Weihua Li,et al.  An Effective Localization Method for Robotic Endoscopic Capsules Using Multiple Positron Emission Markers , 2014, IEEE Transactions on Robotics.

[7]  Roland Memisevic,et al.  Learning Visual Odometry with a Convolutional Network , 2015, VISAPP.

[8]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[9]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[10]  Friedrich Fraundorfer,et al.  Visual Odometry Part I: The First 30 Years and Fundamentals , 2022 .

[11]  Evaggelos Spyrou,et al.  Capsule endoscope localization based on visual features , 2013, 13th IEEE International Conference on BioInformatics and BioEngineering.

[12]  Dimitris K. Iakovidis,et al.  Video-based measurements for wireless capsule endoscope tracking , 2014 .

[13]  Carl G. Looney,et al.  Pattern recognition using neural networks: theory and algorithms for engineers and scientists , 1997 .

[14]  Kaveh Pahlavan,et al.  Comparative Performance Evaluation of RF Localization for Wireless Capsule Endoscopy Applications , 2014, International Journal of Wireless Information Networks.

[15]  Anupam Singh,et al.  Continuing challenges in the diagnosis and management of obscure gastrointestinal bleeding. , 2014, World journal of gastrointestinal pathophysiology.

[16]  H. Kita,et al.  Double-balloon endoscopy for the diagnosis and treatment of small intestinal disease. , 2006, Best practice & research. Clinical gastroenterology.

[17]  Zhengyou Zhang,et al.  Flexible camera calibration by viewing a plane from unknown orientations , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[18]  James R. Bergen,et al.  Visual odometry , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[19]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[20]  Evaggelos Spyrou,et al.  Comparative assessment of feature extraction methods for visual odometry in wireless capsule endoscopy , 2015, Comput. Biol. Medicine.

[21]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Dimitris K. Iakovidis,et al.  Robotic validation of visual odometry for wireless capsule endoscopy , 2016, 2016 IEEE International Conference on Imaging Systems and Techniques (IST).

[23]  Д А Кондратьева PILLCAM CAPSULE ENDOSCOPY , 2016 .

[24]  Levin J. Sliker,et al.  Flexible and capsule endoscopy for screening, diagnosis and treatment , 2014, Expert review of medical devices.

[25]  Kurt Hornik,et al.  Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.

[26]  D. Iakovidis,et al.  Software for enhanced video capsule endoscopy: challenges for essential progress , 2015, Nature Reviews Gastroenterology &Hepatology.

[27]  Juho Kannala,et al.  A generic camera model and calibration method for conventional, wide-angle, and fish-eye lenses , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Alexandros Karargyris,et al.  Optimizing lesion detection in small-bowel capsule endoscopy: from present problems to future solutions , 2015, Expert review of gastroenterology & hepatology.

[29]  Erkan Besdok,et al.  3D Vision by Using Calibration Pattern with Inertial Sensor and RBF Neural Networks , 2009, Sensors.

[30]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.