From stereo image sequences to smooth and robust surface models using temporal information and Bilateral postprocessing

Reconstruction of surface models from camera images has many applications in robotics such as surface registration or object recognition. In this paper, we describe a workflow in which we extract depth information from stereo image sequences to generate a surface model. We present our solutions to correspondence analysis, disparity correction and refinement, as well as 3D reconstruction, point cloud smoothing and meshing. One important feature of the correspondence analysis that we evaluate in detail is the use of temporal information. Another emphasis is on correcting and smoothing the disparity images as well as the reconstructed point cloud without losing too much detail. We, hence, introduce our application of the Bilateral filter on disparity images and our usage of least squares smoothing. The components of the workflow were evaluated using three image sources: Endoscopic images from the daVinci® telemanipulator; images from a stereo camera integrated in the ARMAR III humanoid robot; synthetic data. Depending on the image resolution and the application, the workflow reconstructs surface models in real-time. We show that by using temporal information we obtain more accurate and robust correspondences. Additionally, the Bilateral filter was especially useful in refining the correspondences extracted from endoscopic images as well as the synthetic data sets, whereas the least squares method showed good results in smoothing the point cloud of ARMAR III images. Overall, the presented approach achieves good results for different camera settings and image types, especially with respect to the real-time requirement.

[1]  Stefano Mattoccia,et al.  Accurate and Efficient Cost Aggregation Strategy for Stereo Correspondence Based on Approximated Joint Bilateral Filtering , 2009, ACCV.

[2]  Neil A. Dodgson,et al.  Real-Time Spatiotemporal Stereo Matching Using the Dual-Cross-Bilateral Grid , 2010, ECCV.

[3]  Ève Coste-Manière,et al.  Towards endoscopic augmented reality for robotically assisted minimally invasive cardiac surgery , 2001, Proceedings International Workshop on Medical Imaging and Augmented Reality.

[4]  Tamim Asfour,et al.  ARMAR-III: An Integrated Humanoid Platform for Sensory-Motor Control , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[5]  Oliver Schreer,et al.  Stereo analysis by hybrid recursive matching for real-time immersive video conferencing , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Frédo Durand,et al.  A Fast Approximation of the Bilateral Filter Using a Signal Processing Approach , 2006, International Journal of Computer Vision.

[7]  Sebastian Bodenstedt,et al.  Real-time surface reconstruction from stereo endoscopic images for intraoperative registration , 2011, Medical Imaging.

[8]  John Kenneth Salisbury,et al.  The Intuitive/sup TM/ telesurgery system: overview and application , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[9]  Danail Stoyanov,et al.  A practical approach towards accurate dense 3D depth recovery for robotic laparoscopic surgery , 2005, Computer aided surgery : official journal of the International Society for Computer Aided Surgery.

[10]  Szymon Rusinkiewicz,et al.  Spacetime stereo: a unifying framework for depth from triangulation , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Russell H. Taylor,et al.  Video to CT Registration for Image Overlay on Solid Organs , 2008 .

[12]  Rüdiger Dillmann,et al.  Towards shape-based visual object categorization for humanoid robots , 2011, 2011 IEEE International Conference on Robotics and Automation.

[13]  Andrew E. Johnson,et al.  Spin-Images: A Representation for 3-D Surface Matching , 1997 .

[14]  Changming Sun,et al.  An energy minimisation approach to stereo-temporal dense reconstruction , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[15]  Tony DeRose,et al.  Surface reconstruction from unorganized points , 1992, SIGGRAPH.

[16]  Bernard Chazelle,et al.  Shape distributions , 2002, TOGS.