New Techniques for 3D Modelling...and for doing without

Object recognition, visual robot guidance, and several other vision applications require models of objects or scenes. Computer vision has a tradition of building these models from inherent object characteristics. The problem is that such characteristics are difficult to extract. Recently, a pure view-based object recognition approach was proposed, that is surprisingly performant. It is based on a model that is extracted directly from raw image data. Limitations of both strands raise the question whether there is room for middle ground solutions, that combine the strengths but avoid the weaknesses. Two examples are discussed, where in each case the only input required are images, but where nevertheless substantial feature extraction and analysis are involved. These are non-Euclidean 3D reconstruction from multiple, uncalibrated views and scene description based on local, affinely invariant surface patches that can be extracted from single views. Both models are useful for robot vision tasks such as visual navigation.

[1]  Paul A. Beardsley,et al.  Euclidean Structure from Uncalibrated Images , 1994, BMVC.

[2]  Jitendra Malik,et al.  Modeling and Rendering Architecture from Photographs: A hybrid geometry- and image-based approach , 1996, SIGGRAPH.

[3]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Anders Heyden,et al.  Euclidean reconstruction from image sequences with varying and unknown focal length and principal point , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  Theo Moons,et al.  A Guided Tour Through Multiview Relations , 1998, SMILE.

[6]  Reinhard Koch,et al.  Self-Calibration and Metric Reconstruction Inspite of Varying and Unknown Intrinsic Camera Parameters , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[7]  Luc Van Gool,et al.  Euclidean 3D Reconstruction from Image Sequences with Variable Focal Lenghts , 1996, ECCV.

[8]  Luc Van Gool,et al.  Affine Reconstruction from Perspective Image Pairs with a Relative Object-Camera Translation in Between , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  L. Gool,et al.  Color-Based Moment Invariants for Viewpoint and Illumination Independent Recognition of Planar Color Patterns , 1999 .

[10]  Reinhard Koch,et al.  Matching of affinely invariant regions for visual servoing , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[11]  Richard Szeliski,et al.  Layered depth images , 1998, SIGGRAPH.

[12]  Bill Triggs,et al.  Autocalibration and the absolute quadric , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Andrew Zisserman,et al.  Wide baseline stereo matching , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).