Content-based Image Sequence Representation ?

In this chapter we overview methods that represent video se- quences in terms of their content. These methods dier from those devel- oped for MPEG/H.26X coding standards in that sequences are described in terms of extended images instead of collections of frames. We describe how these extended images, e.g., mosaics, are generated by the basically same principle: the incremental composition of visual photometric, geo- metric, and multi-view information into one or more extended images. Dierent outputs, e.g., from single 2-D mosaics to full 3-D mosaics, are obtained depending on the quality and quantity of photometric, geomet- ric, and multi-view information. In particular, we detail a framework well suited to the representation of scenes with independently moving objects. We address the two following important cases: i) the moving objects can be represented by 2-D silhouettes (generative video approach) ; or ii) the camera motion is such that the moving object must be described by their 3-D shape (recovered through rank 1 surface-based factorization). A ba- sic pre-processing step in content-based image sequence representation is to extract and track the relevant background and foreground objects. This is achieved by 2-D shape segmentation for which there is a wealth of methods and approaches. The chapter includes a brief description of active contour methods for image segmentation.

[1]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  José M. F. Moura,et al.  Video compression via constructs , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[3]  Demetri Terzopoulos,et al.  Topologically adaptable snakes , 1995, Proceedings of IEEE International Conference on Computer Vision.

[4]  Jean-Michel Morel,et al.  Variational methods in image segmentation , 1995 .

[5]  Takeo Kanade,et al.  A Multibody Factorization Method for Independently Moving Objects , 1998, International Journal of Computer Vision.

[6]  Anthony J. Yezzi,et al.  A geometric snake model for segmentation of medical imagery , 1997, IEEE Transactions on Medical Imaging.

[7]  Jerry L. Prince,et al.  Snakes, shapes, and gradient vector flow , 1998, IEEE Trans. Image Process..

[8]  Masahide Kaneko,et al.  Coding of facial image sequence based on a 3-D model of the head and motion detection , 1991, J. Vis. Commun. Image Represent..

[9]  Takeo Kanade,et al.  A Paraperspective Factorization Method for Shape and Motion Recovery , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  A. Murat Tekalp,et al.  Simultaneous motion estimation and segmentation , 1997, IEEE Trans. Image Process..

[11]  Marc Levoy,et al.  Light field rendering , 1996, SIGGRAPH.

[12]  Takeo Kanade,et al.  Virtualized Reality: Constructing Virtual Worlds from Real Scenes , 1997, IEEE Multim..

[13]  José M. F. Moura,et al.  Video representation with three-dimensional entities , 1998, IEEE J. Sel. Areas Commun..

[14]  Radu S. Jasinschi,et al.  Apparent 3-D camera velocity-extraction and applications , 2000, IEEE Trans. Circuits Syst. Video Technol..

[15]  Richard Szeliski,et al.  The lumigraph , 1996, SIGGRAPH.

[16]  James S. Duncan,et al.  Deformable boundary finding in medical images by integrating gradient and region information , 1996, IEEE Trans. Medical Imaging.

[17]  James S. Duncan,et al.  Game-Theoretic Integration for Image Segmentation , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Takeo Kanade,et al.  A sequential factorization method for recovering shape and motion from image streams , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Michal Irani,et al.  Representation of scenes from collections of images , 1995, Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95).

[20]  Takeo Kanade,et al.  A factorization method for affine structure from line correspondences , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[21]  Guillermo Sapiro,et al.  Geodesic Active Contours , 1995, International Journal of Computer Vision.

[22]  J. Sethian,et al.  FRONTS PROPAGATING WITH CURVATURE DEPENDENT SPEED: ALGORITHMS BASED ON HAMILTON-JACOB1 FORMULATIONS , 2003 .

[23]  Kiyoharu Aizawa,et al.  Model-based analysis synthesis image coding (MBASIC) system for a person's face , 1989, Signal Process. Image Commun..

[24]  Wiro J. Niessen Geometric partial differential equations and image analysis [Book Reviews] , 2001, IEEE Transactions on Medical Imaging.

[25]  P. Anandan,et al.  Efficient representations of video sequences and their applications , 1996, Signal Process. Image Commun..

[26]  José M. F. Moura,et al.  Rank 1 Weighted Factorization for 3D Structure Recovery: Algorithms and Performance Analysis , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[28]  Ruzena Bajcsy,et al.  Segmentation of range images as the search for geometric parametric models , 1995, International Journal of Computer Vision.

[29]  Rama Chellappa,et al.  Estimating the Kinematics and Structure of a Rigid Object from a Sequence of Monocular Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Sanjoy K. Mitter,et al.  Hierarchical Image Segmentation—Part I: Detection of Regular Curves in a Vector Graph , 1998, International Journal of Computer Vision.

[31]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[32]  Alex Pentland,et al.  Recursive Estimation of Motion, Structure, and Focal Length , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Norman J. Zabusky,et al.  Regularization of contour dynamical algorithms. I. tangential regularization , 1983 .

[34]  G. Borgefors Distance transformations in arbitrary dimensions , 1984 .

[35]  Thomas S. Huang,et al.  Uniqueness and Estimation of Three-Dimensional Motion Parameters of Rigid Objects with Curved Surfaces , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Walter Bender,et al.  Salient video stills: content and context preserved , 1993, MULTIMEDIA '93.

[37]  Edward H. Adelson,et al.  A multiresolution spline with application to image mosaics , 1983, TOGS.

[38]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[39]  J. Sethian Numerical algorithms for propagating interfaces: Hamilton-Jacobi equations and conservation laws , 1990 .

[40]  S. Brendle,et al.  Calculus of Variations , 1927, Nature.

[41]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[42]  D. Mumford,et al.  Optimal approximations by piecewise smooth functions and associated variational problems , 1989 .

[43]  Jitendra Malik,et al.  Modeling and Rendering Architecture from Photographs: A hybrid geometry- and image-based approach , 1996, SIGGRAPH.

[44]  Takeo Kanade,et al.  A Paraperspective Factorization Method for Shape and Motion Recovery , 1994, ECCV.

[45]  Irvin Sobel,et al.  Neighborhood coding of binary images for fast contour following and general binary array processing , 1978 .

[46]  José M. F. Moura,et al.  Content-based video sequence representation , 1995, Proceedings., International Conference on Image Processing.

[47]  J. Sethian Curvature and the evolution of fronts , 1985 .

[48]  José M. F. Moura,et al.  Stochastic active contour for cardiac MR image segmentation , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[49]  Michael S. Landy,et al.  Computational models of visual processing , 1991 .

[50]  Theodosios Pavlidis,et al.  Integrating Region Growing and Edge Detection , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[51]  Richard I. Hartley,et al.  In Defense of the Eight-Point Algorithm , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[52]  B. Brunt The calculus of variations , 2003 .

[53]  Andrew Lippman,et al.  Movie-maps: An application of the optical videodisc to computer graphics , 1980, SIGGRAPH '80.

[54]  Jerry L. Prince,et al.  Gradient vector flow: a new external force for snakes , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[55]  Demetri Terzopoulos,et al.  Constraints on Deformable Models: Recovering 3D Shape and Nonrigid Motion , 1988, Artif. Intell..

[56]  Sanjoy K. Mitter,et al.  Beyond the Uniqueness Assumption: Ambiguity Representation and Redundancy Elimination in the Computation of a Covering Sample of Salient Contour Cycles , 1999, Comput. Vis. Image Underst..

[57]  José M. F. Moura,et al.  Three-dimensional modeling from two-dimensional video , 2001, IEEE Trans. Image Process..

[58]  Demetri Terzopoulos,et al.  T-snakes: Topology adaptive snakes , 2000, Medical Image Anal..

[59]  Demetri Terzopoulos,et al.  Energy Constraints on Deformable Models: Recovering Shape and Non-Rigid Motion , 1987, AAAI.

[60]  Azriel Rosenfeld,et al.  Picture Processing and Psychopictorics , 1970 .

[61]  José M. F. Moura,et al.  Nonlinear editing by Generative Video , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[62]  P. Lions,et al.  Axioms and fundamental equations of image processing , 1993 .

[63]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[64]  Tomaso A. Poggio,et al.  On Edge Detection , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[65]  Kristin J. Dana,et al.  Real-time scene stabilization and mosaic construction , 1994, Proceedings of 1994 IEEE Workshop on Applications of Computer Vision.

[66]  S. Osher Riemann Solvers, the Entropy Condition, and Difference , 1984 .

[67]  Rémi Ronfard,et al.  Region-based strategies for active contour models , 1994, International Journal of Computer Vision.

[68]  J. M. Gerzso,et al.  Computer graphics and interactive techniques: 15th-17th July 1974. Boulder, Colorado, USA. Sponsored by the University of Colorado Computing Centre and ACM/SIGGRAPH , 1975, Comput. Aided Des..

[69]  Laurent D. Cohen,et al.  On active contour models and balloons , 1991, CVGIP Image Underst..

[70]  James A. Sethian,et al.  Level Set Methods and Fast Marching Methods , 1999 .

[71]  James A. Sethian,et al.  Level Set Methods and Fast Marching Methods/ J. A. Sethian , 1999 .

[72]  Theodosios Pavlidis,et al.  Integrating region growing and edge detection , 1988, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition.

[73]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[74]  José M. F. Moura,et al.  STACS: new active contour scheme for cardiac MR image segmentation , 2005, IEEE Transactions on Medical Imaging.

[75]  Tony F. Chan,et al.  Active contours without edges , 2001, IEEE Trans. Image Process..

[76]  E. Adelson,et al.  The Plenoptic Function and the Elements of Early Vision , 1991 .

[77]  Rolf Adams,et al.  Seeded Region Growing , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[78]  Baba C. Vemuri,et al.  Shape Modeling with Front Propagation: A Level Set Approach , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[79]  Steven D. Blostein,et al.  Motion-based object segmentation and estimation using the MDL principle , 1995, IEEE Trans. Image Process..

[80]  Denis Laurendeau,et al.  A General Surface Approach to the Integration of a Set of Range Views , 1995, IEEE Trans. Pattern Anal. Mach. Intell..