Fusion of Foreground Object, Spatial and Frequency Domain Motion Information for Video Summarization

Surveillance video camera captures a large amount of continuous video stream every day. To analyze or investigate any significant events from the huge video data, it is laborious and boring job to identify these events. To solve this problem, a video summarization technique combining foreground objects as well as motion information in spatial and frequency domain is proposed in this paper. We extract foreground object using background modeling and motion information in spatial domain and frequency domain. Frame transition is applied for obtaining motion information in spatial domain. For acquiring motion information in frequency domain, phase correlation PC technique is applied. Later, foreground objects and motions in spatial and frequency domain are fused and key frames are extracted. Experimental results reveal that the proposed method performs better than the state-of-the-art method.

[1]  Sung Wook Baik,et al.  Saliency-directed prioritization of visual data in wireless surveillance networks , 2015, Inf. Fusion.

[2]  Narendra Ahuja,et al.  Integration of frequency and space for multiple motion estimation and shape-independent object segmentation , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Chun-Rong Huang,et al.  Maximum a Posteriori Probability Estimation for Online Surveillance Video Synopsis , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Yong Jae Lee,et al.  Discovering important people and objects for egocentric video summarization , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Manoranjan Paul,et al.  A hybrid object detection technique from dynamic background using Gaussian mixture models , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[6]  Masahiro Toyoura,et al.  Film Comic Generation with Eye Tracking , 2013, MMM.

[7]  Weisi Lin,et al.  Scene-Based Movie Summarization Via Role-Community Networks , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Chia-han Lee,et al.  On-Line Multi-View Video Summarization for Wireless Video Sensor Network , 2015, IEEE Journal of Selected Topics in Signal Processing.

[9]  Chih-Jen Lin,et al.  Large-Scale Video Summarization Using Web-Image Priors , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Nuno Vasconcelos,et al.  On the plausibility of the discriminant center-surround hypothesis for visual saliency. , 2008, Journal of vision.

[11]  Lei Sun,et al.  The dynamic VideoBook: A hierarchical summarization for surveillance video , 2013, 2013 IEEE International Conference on Image Processing.

[12]  Bu-Sung Lee,et al.  Direct Intermode Selection for H.264 Video Coding Using Phase Correlation , 2011, IEEE Transactions on Image Processing.

[13]  Luc Van Gool,et al.  Video summarization by learning submodular mixtures of objectives , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Klaus Schöffmann,et al.  Keyframe extraction in endoscopic video , 2015, Multimedia Tools and Applications.

[15]  Petros Maragos,et al.  Multimodal Saliency and Fusion for Movie Summarization Based on Aural, Visual, and Textual Attention , 2013, IEEE Transactions on Multimedia.

[16]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[17]  Manoranjan Paul,et al.  A novel video coding scheme using a scene adaptive non-parametric background model , 2014, 2014 IEEE 16th International Workshop on Multimedia Signal Processing (MMSP).

[18]  Michael R. Frater,et al.  An Efficient Mode Selection Prior to the Actual Encoding for H.264/AVC Encoder , 2009, IEEE Transactions on Multimedia.

[19]  Manoranjan Paul,et al.  An efficient video coding using phase-matched error from phase correlation information , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[20]  Manoranjan Paul,et al.  Disparity-adjusted 3D multi-view video coding with dynamic background modelling , 2013, 2013 IEEE International Conference on Image Processing.

[21]  Narendra Ahuja,et al.  Joint spatial and frequency domain motion analysis , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[22]  Evaggelos Spyrou,et al.  Panoramic Visual Summaries for Efficient Reading of Capsule Endoscopy Videos , 2013, 2013 8th International Workshop on Semantic and Social Media Adaptation and Personalization.

[23]  Ian Witten,et al.  Data Mining , 2000 .

[24]  Jien Kato,et al.  A distance metric learning based summarization system for nursery school surveillance video , 2012, 2012 19th IEEE International Conference on Image Processing.

[25]  James M. Rehg,et al.  Gaze-enabled egocentric video summarization via constrained submodular maximization , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Hanqing Lu,et al.  Object-centered narratives for video surveillance , 2012, 2012 19th IEEE International Conference on Image Processing.

[27]  Kristen Grauman,et al.  Story-Driven Summarization for Egocentric Video , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Sung Wook Baik,et al.  Video summarization based tele-endoscopy: a service to efficiently manage visual data generated during wireless capsule endoscopy procedure , 2014, Journal of Medical Systems.

[29]  Manoranjan Paul,et al.  Efficient video coding using optimal compression plane and background modelling , 2012 .

[30]  Bu-Sung Lee,et al.  Explore and Model Better I-Frames for Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[31]  F. Sun,et al.  Outlier-attenuating summarization for user-generated-video , 2014, 2014 IEEE International Conference on Multimedia and Expo (ICME).

[32]  Ahmed Z. Emam,et al.  Endoscopy video summarization based on unsupervised learning and feature discrimination , 2013, 2013 Visual Communications and Image Processing (VCIP).