Motion representation using composite energy features

This work tackles the segmentation of apparent-motion from a bottom-up perspective. When no information is available to build prior high-level models, the only alternative are bottom-up techniques. Hence, the whole segmentation process relies on the suitability of the low-level features selected to describe motion. A wide variety of low-level spatio-temporal features have been proposed so far. However, all of them suffer from diverse drawbacks. Here, we propose the use of composite energy features in bottom-up motion segmentation to solve several of these problems. Composite energy features are clusters of energy filters-pairs of band-pass filters in quadrature-each one sensitive to a different set of scale, orientation, direction of motion and speed. They are grouped in order to reconstruct independent motion patterns in a video sequence. A composite energy feature, this is, the response of one of these clusters of filters, can be built as a combination of the responses of the individual filters. Therefore, it inherits the desirable properties of energy filters but providing a more complete representation of motion patterns. In this paper, we will present our approach for integration of composite features based on the concept of Phase Congruence. We will show some results that illustrate the capabilities of this low-level motion representation and its usefulness in bottom-up motion segmentation and tracking.

[1]  Hironobu Fujiyoshi,et al.  Moving target classification and tracking from real-time video , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[2]  Xose Manuel Pardo,et al.  Generalized ellipsoids and anisotropic filtering for segmentation improvement in 3D medical imaging , 2003, Image Vis. Comput..

[3]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[4]  Janusz Konrad,et al.  Multiple motion segmentation with level sets , 2003, IEEE Trans. Image Process..

[5]  Jake K. Aggarwal,et al.  Temporal spatio-velocity transform and its application to tracking and interaction , 2004, Comput. Vis. Image Underst..

[6]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[7]  Eero P. Simoncelliy,et al.  Computing Optical Flow Distributions Using Spatio-temporal Filters , 1996 .

[8]  Oscar Nestares,et al.  Automatic enhancement of noisy image sequences through local spatiotemporal spectrum analysis , 2000 .

[9]  Xosé R. Fernández-Vidal,et al.  Dissimilarity Measures for Visual Pattern Partitioning , 2005, IbPRIA.

[10]  D J Heeger,et al.  Model for the extraction of image flow. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[11]  Arnold W. M. Smeulders,et al.  Fast occluded object tracking by a robust appearance filter , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Charles Kervrann,et al.  A Hierarchical Markov Modeling Approach for the Segmentation and Tracking of Deformable Shapes , 1998, Graph. Model. Image Process..

[13]  S. Osher,et al.  Geometric Level Set Methods in Imaging, Vision, and Graphics , 2011, Springer New York.

[14]  Daniel P. Huttenlocher,et al.  Adaptive Bayesian recognition in tracking rigid objects , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[15]  Chuan Yi Tang,et al.  A 2.|E|-Bit Distributed Algorithm for the Directed Euler Trail Problem , 1993, Inf. Process. Lett..

[16]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[17]  Xosé R. Fernández-Vidal,et al.  The RGFF Representational Model: A System for the Automatically Learned Partitioning of 'Visual Patterns' in Digital Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Jesús Chamorro-Martínez,et al.  A frequency-domain approach for the extraction of motion patterns , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[19]  Rachid Deriche,et al.  Geodesic Active Contours and Level Sets for the Detection and Tracking of Moving Objects , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Nikhil R. Pal,et al.  Cluster validation using graph theoretic concepts , 1997, Pattern Recognit..

[21]  David J. Fleet,et al.  Performance of optical flow techniques , 1994, International Journal of Computer Vision.

[22]  A. Murat Tekalp,et al.  Simultaneous motion estimation and segmentation , 1997, IEEE Trans. Image Process..

[23]  Guillermo Sapiro,et al.  Geodesic Active Contours , 1995, International Journal of Computer Vision.

[24]  J. M. Hans du Buf,et al.  Ramp edges, Mach bands, and the functional significance of the simple cell assembly , 1994, Biological Cybernetics.

[25]  Robyn A. Owens,et al.  Feature detection from local energy , 1987, Pattern Recognit. Lett..

[26]  Svetha Venkatesh,et al.  On the classification of image features , 1990, Pattern Recognit. Lett..

[27]  David J. Fleet Measurement of image velocity , 1992 .

[28]  A J Ahumada,et al.  Model of human visual-motion sensing. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[29]  J. Weickert,et al.  Fast Methods for Implicit Active Contour Models , 2003 .

[30]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[31]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[32]  Xosé R. Fernández-Vidal,et al.  Decomposition of three-dimensional medical images into visual patterns , 2005, IEEE Transactions on Biomedical Engineering.

[33]  D. Burr,et al.  The conditions under which Mach bands are visible , 1989, Vision Research.

[34]  Lucas J. van Vliet,et al.  3D-Orientation Space; Filters and Sampling , 2003, SCIA.

[35]  Filiberto Pla,et al.  An iterative region‐growing algorithm for motion segmentation and estimation , 2005, Int. J. Intell. Syst..

[36]  David J. Field,et al.  What Is the Goal of Sensory Coding? , 1994, Neural Computation.

[37]  Filiberto Pla,et al.  An iterative region-growing algorithm for motion segmentation and estimation: Research Articles , 2005 .

[38]  A.V. Oppenheim,et al.  The importance of phase in signals , 1980, Proceedings of the IEEE.

[39]  Dragoljub Pokrajac,et al.  Using spatiotemporal blocks to reduce the uncertainty in detecting and tracking moving objects in video , 2006, Int. J. Intell. Syst. Technol. Appl..

[40]  B. S. Manjunath,et al.  MPEG‐7 Homogeneous Texture Descriptor , 2001 .

[41]  C. Stiller,et al.  Estimating motion in image sequences , 1999, IEEE Signal Process. Mag..