Automatic Representation and Segmentation of Video Sequences via a Novel Framework Based on the nD-EVM and Kohonen Networks

Recently in the Computer Vision field, a subject of interest, at least in almost every video application based on scene content, is video segmentation. Some of these applications are indexing, surveillance, medical imaging, event analysis, and computer-guided surgery, for naming some of them. To achieve their goals, these applications need meaningful information about a video sequence, in order to understand the events in its corresponding scene. Therefore, we need semantic information which can be obtained from objects of interest that are present in the scene. In order to recognize objects we need to compute features which aid the finding of similarities and dissimilarities, among other characteristics. For this reason, one of the most important tasks for video and image processing is segmentation. The segmentation process consists in separating data into groups that share similar features. Based on this, in this work we propose a novel framework for video representation and segmentation. The main workflow of this framework is given by the processing of an input frame sequence in order to obtain, as output, a segmented version. For video representation we use the Extreme Vertices Model in the -Dimensional Space while we use the Discrete Compactness descriptor as feature and Kohonen Self-Organizing Maps for segmentation purposes.

[1]  E. Bribiesca Measuring 2D shape compactness using the contact perimeter , 1997 .

[2]  Thomas B. Moeslund,et al.  Introduction to Video and Image Processing: Building Real Systems and Applications , 2012 .

[3]  Jan-Michael Frahm,et al.  Towards Urban 3D Reconstruction from Video , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[4]  King-Sun Fu,et al.  A survey on image segmentation , 1981, Pattern Recognit..

[6]  Ernesto Bribiesca,et al.  State of the Art of Compactness and Circularity Measures 1 , 2009 .

[7]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[8]  Mark S. Nixon,et al.  Feature Extraction and Image Processing , 2002 .

[9]  R. Haier,et al.  Human intelligence and brain networks , 2010, Dialogues in clinical neuroscience.

[10]  Camillo Gentile,et al.  Segmentation for robust tracking in the presence of severe occlusion , 2001, IEEE Transactions on Image Processing.

[11]  Eric L. Miller,et al.  Multiple Hypothesis Video Segmentation from Superpixel Flows , 2010, ECCV.

[12]  King Ngi Ngan,et al.  Face segmentation using skin-color map in videophone applications , 1999, IEEE Trans. Circuits Syst. Video Technol..

[13]  King Ngi Ngan,et al.  Segmentation and Tracking of Moving objects for Content-Based Video Coding, "IEE , 1999 .

[14]  Ricardo Pérez Aguila Modeling and Manipulating 3D Datasets through the Extreme Vertices Model in the n-Dimensional Space (nD-EVM) , 2007 .

[15]  Michael G. Strintzis,et al.  Real-time compressed-domain spatiotemporal segmentation and ontologies for video indexing and retrieval , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Mohan M. Trivedi,et al.  Detecting Moving Shadows: Algorithms and Evaluation , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Bernt Schiele,et al.  Video Segmentation with Superpixels , 2012, ACCV.

[18]  E. Bribiesca A measure of compactness for 3D shapes , 2000 .

[19]  Jim R. Parker,et al.  Algorithms for image processing and computer vision , 1996 .

[20]  Ujjwal Maulik,et al.  Soft Computing for Image and Multimedia Data Processing , 2013, Springer Berlin Heidelberg.

[21]  Jenq-Neng Hwang,et al.  Fast and automatic video object segmentation and tracking for content-based applications , 2002, IEEE Trans. Circuits Syst. Video Technol..

[22]  King Ngi Ngan,et al.  Video Segmentation and Its Applications , 2011 .

[23]  Y. Ramadevi Synergy between Object Recognition and Image Segmentation , 2010 .

[24]  K. Ruland,et al.  The pickup and delivery problem: Faces and branch-and-cut algorithm , 1997 .

[25]  Iain E. G. Richardson,et al.  H.264 and MPEG-4 Video Compression: Video Coding for Next-Generation Multimedia , 2003 .

[26]  Sing Bing Kang,et al.  Stereo for Image-Based Rendering using Image Over-Segmentation , 2007, International Journal of Computer Vision.

[27]  Bernt Schiele,et al.  Learning Must-Link Constraints for Video Segmentation Based on Spectral Clustering , 2014, GCPR.

[28]  Jan-Michael Frahm,et al.  Detailed Real-Time Urban 3D Reconstruction from Video , 2007, International Journal of Computer Vision.

[29]  Cordelia Schmid,et al.  Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.

[30]  Ricardo Pérez Aguila Representing and Visualizing Vectorized Videos through the Extreme Vertices Model in the n-Dimensional Space (nD-EVM) , 2007 .

[31]  Hyeran Byun,et al.  FRIP: a region-based image retrieval tool using automatic image segmentation and stepwise Boolean AND matching , 2005, IEEE Transactions on Multimedia.

[32]  Henning Biermann,et al.  Recovering non-rigid 3D shape from image streams , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[33]  Qi Tian,et al.  Foreground object detection from videos containing complex background , 2003, MULTIMEDIA '03.

[34]  King Ngi Ngan,et al.  Video segmentation for content-based coding , 1999, IEEE Trans. Circuits Syst. Video Technol..

[35]  Yu-Jin Zhang,et al.  Advances in image and video segmentation , 2006 .

[36]  Rita Cucchiara,et al.  Detecting Moving Objects, Ghosts, and Shadows in Video Streams , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Chen Wang,et al.  Semantic Object Segmentation in Tagged Videos via Detection , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Marcel Worring,et al.  Multimodal Video Indexing : A Review of the State-ofthe-art , 2001 .

[39]  Lurng-Kuo Liu,et al.  Model-based video segmentation for vision-augmented interactive games , 2000, Electronic Imaging.

[40]  Cordelia Schmid,et al.  Dense Trajectories and Motion Boundary Descriptors for Action Recognition , 2013, International Journal of Computer Vision.