Resolution-appropriate shape representation

We present a new type of "snake" in which the dimensionality of the shapes is scaled appropriately for the resolution of the images in which the shapes are embedded. We define shapes as an ordered list of control points and compute the principal components of the shapes in a prior training set. Our energy function is based upon the Mahalanobis distance of a given shape from the mean shape and on the Mahalanobis distance of the image attributes from image attribute values extracted from the training set. We show that the derivative of this energy function with respect to the modal weights is reduced as the image resolution is reduced, and that the derivative of the energy scales with the variance associated with each mode. We exploit this property to determine the subset of the modes which are relevant at a particular level of image resolution thereby reducing the dimensionality of the shapes. We implement a coarse-to-fine search procedure in the image and shape domains simultaneously, and demonstrate this procedure on the identification of anatomic structures in Computed Tomography images.

[1]  Lawrence Sirovich,et al.  Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Roberto Marcondes Cesar Junior,et al.  Towards effective planar shape representation with multiscale digital curvature analysis based on signal processing techniques , 1996, Pattern Recognit..

[3]  L Sirovich,et al.  Low-dimensional procedure for the characterization of human faces. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[4]  Tomaso Poggio,et al.  Image Representations for Visual Learning , 1996, Science.

[5]  Alex Pentland,et al.  Closed-form solutions for physically-based shape modeling and recognition , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Jean-Pierre Antoine,et al.  Multiscale shape analysis using the continuous wavelet transform , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[7]  Timothy F. Cootes,et al.  Building and using flexible models using grey-level information , 1993, ICCV 1993.

[8]  Gerald J. Kutcher,et al.  Variation in prostate position: Quantitation and implications for three-dimensional conformal radiation therapy , 1993 .

[9]  Nicholas Ayache,et al.  Fast segmentation, tracking, and analysis of deformable objects , 1993, 1993 (4th) International Conference on Computer Vision.

[10]  Stéphane Mallat,et al.  Characterization of Signals from Multiscale Edges , 2011, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Alex Pentland,et al.  A modal framework for correspondence and description , 1993, 1993 (4th) International Conference on Computer Vision.

[12]  Hiroshi Murase,et al.  Learning, positioning, and tracking visual appearance , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[13]  C. Taylor,et al.  Active shape models - 'Smart Snakes'. , 1992 .

[14]  Timothy F. Cootes,et al.  Building and using flexible models incorporating grey-level information , 1993, 1993 (4th) International Conference on Computer Vision.

[15]  Farzin Mokhtarian,et al.  Scale-Based Description and Recognition of Planar Curves and Two-Dimensional Shapes , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Laurent D. Cohen,et al.  On active contour models and balloons , 1991, CVGIP Image Underst..

[17]  Timothy F. Cootes,et al.  Active Shape Models: Evaluation of a Multi-Resolution Method for Improving Image Search , 1994, BMVC.

[18]  P. J. Burt,et al.  The Pyramid as a Structure for Efficient Computation , 1984 .