A scale selection principle for estimating image deformations

A basic functionality of a vision system concerns the ability to compute deformation elds between dierent images of the same physical structure. This article advocates the need for incorporating an explicit mechanism for scale selection in this context, in algorithms for computing descriptors such as optic flow and for performing stereo matching. A basic reason why such a mechanism is essential is the fact that in a coarse-to-ne propagation of disparity or flow information, it is not necessarily the case that the most accurate estimates are obtained at the nest scales. The existence of interfering structures at ne scales may make it impossible to accurately match the image data at ne scales. A systematic methodology for approaching this problem is proposed, by estimating the uncertainty in the computed flow estimate at each scale, and then selecting deformation estimates from the scales that minimize the (suitably normalized) uncertainty over scales. A specic implementation of this idea is presented for a region based dierential flow estimation scheme. It is shown that the integrated scale selection and flow estimation algorithm has the qualitative properties of leading to the selection of coarser scales for larger size image structures and increasing noise level, whereas it leads to the selection of ner scales in the neighbourhood of flow eld discontinuities. The latter property may serve as an indicator when detecting flow eld discontinuities and occlusions. I would like to thank D. Betsis and G. Orban for valuable discussions as well as J. Koenderink and A. van Doorn for providing the torso image in gure 8. This work was partially performed under the Esprit-BRA project InSight and the Esprit-NSF collaboration Diusion. The support from the Swedish Research Council for Engineering Sciences, TFR, is gratefully acknowledged.

[1]  Tony Lindeberg,et al.  Shape-adapted smoothing in estimation of 3-D shape cues from affine deformations of local 2-D brightness structure , 1997, Image Vis. Comput..

[2]  Max A. Viergever,et al.  Scale and the differential structure of images , 1992, Image Vis. Comput..

[3]  Jitendra Malik,et al.  Computational framework for determining stereo correspondence from a set of linear spatial filters , 1992, Image Vis. Comput..

[4]  Fredrik Johansson,et al.  Automatic matching of brain images and brain atlases using multi-scale fusion algorithms , 1997 .

[5]  Andrea J. van Doorn,et al.  Invariant Properties of the Motion Parallax Field due to the Movement of Rigid Bodies Relative to an Observer , 1975 .

[6]  T. Lindeberg Direct Estimation of Affine Deformations of Brightness Patterns Using Visual Front-End O , 1995 .

[7]  Hans-Hellmut Nagel,et al.  Optical Flow Estimation: Advances and Comparisons , 1994, ECCV.

[8]  Tony Lindeberg,et al.  Direct estimation of affine image deformations using visual front-end operations with automatic scale selection , 1995, Proceedings of IEEE International Conference on Computer Vision.

[9]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[10]  Luc Van Gool,et al.  Determination of Optical Flow and its Discontinuities using Non-Linear Diffusion , 1994, ECCV.

[11]  Tony Lindeberg,et al.  Scale-Space Theory in Computer Vision , 1993, Lecture Notes in Computer Science.

[12]  Tony Lindeberg,et al.  Shape-Adapted Smoothing in Estimation of 3-D Depth Cues from Affine Distortions of Local 2-D Brightness Structure , 1994, ECCV.

[13]  Berthold K. P. Horn,et al.  Shape from shading , 1989 .

[14]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[15]  Takeo Kanade,et al.  A Stereo Matching Algorithm with an Adaptive Window: Theory and Experiment , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  T. Lindeberg,et al.  Shape-adapted smoothing in estimation of 3-D depth cues from affine distortions of local 2-D structure , 1997 .

[17]  C. Erkelens,et al.  Stability of Binocular Depth Perception with Moving Head and Eyes , 1996, Vision Research.

[18]  Linda G. Shapiro,et al.  Computer and Robot Vision , 1991 .

[19]  Tony Lindeberg,et al.  Shape from texture from a multi-scale perspective , 1993, 1993 (4th) International Conference on Computer Vision.

[20]  Alex Pentland,et al.  Photometric motion , 1991, [1990] Proceedings Third International Conference on Computer Vision.

[21]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[22]  Tony Lindeberg,et al.  Scale selection for differential operators , 1994 .

[23]  Steven S. Beauchemin,et al.  The computation of optical flow , 1995, CSUR.

[24]  P. Lions,et al.  Axioms and fundamental equations of image processing , 1993 .

[25]  Andrew P. Witkin,et al.  Scale-Space Filtering , 1983, IJCAI.

[26]  J. Koenderink,et al.  Surface perception in pictures , 1992, Perception & psychophysics.

[27]  Tony Lindeberg,et al.  On Automatic Selection of Temporal Scales in Time-Causal Scale-Space , 1997, AFPAC.

[28]  J. Garding,et al.  Shape from texture and contour by weak isotropy , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[29]  Olivier D. Faugeras,et al.  Computing differential properties of 3-D shapes from stereoscopic images without 3-D models , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[30]  J J Koenderink,et al.  Affine structure from motion. , 1991, Journal of the Optical Society of America. A, Optics and image science.

[31]  G. Orban,et al.  Shape and Spatial Distribution of Receptive Fields and Antagonistic Motion Surrounds in the Middle Temporal Area (V5) of the Macaque , 1995, The European journal of neuroscience.

[32]  Tony Lindeberg,et al.  Junction detection with automatic selection of detection scales and localization scales , 1994, Proceedings of 1st International Conference on Image Processing.

[33]  Jitendra Malik,et al.  A differential method for computing local shape-from-texture for planar and curved surfaces , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[34]  J. Koenderink,et al.  Extraction of motion parallax structure in the visual system I , 2004, Biological Cybernetics.

[35]  M. Nielsen,et al.  The Intrinsic Structure of the Optic Flow Field , 1994 .

[36]  R. Manmatha Measuring the Affine Transform Using Gaussian Filters , 1994, ECCV.

[37]  Alan L. Yuille,et al.  Scaling Theorems for Zero Crossings , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Jitendra Malik,et al.  A differential method for computing local shape-from-texture for planar and curved surfaces , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[39]  Roberto Cipolla,et al.  Extracting the Affine Transformation from Texture Moments , 1994, ECCV.

[40]  Massimo Tistarelli,et al.  Multiple Constraints to Compute Optical Flow , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Tony Lindeberg,et al.  Effective Scale: A Natural Unit for Measuring Scale-Space Lifetime , 1993, IEEE Trans. Pattern Anal. Mach. Intell..