Human Motion Trajectory Analysis Based Video Summarization

Multimedia technology is growing day by day and contributing towards enormous amount of video data especially in the area of security surveillance. The browsing through such a large collection of videos is a challenging and time-consuming task. Despite the advancement in technology automatic browsing, retrieval, manipulation and analysis of large videos are still far behind. In this paper a fully automatic human-centric system for video summarization is proposed. In most of the surveillance applications, human motion is of great interest. In proposed system the moving parts in the video are detected using background subtraction, and blobs are extracted from the binary image. Human detection is done through Histogram of Oriented Gradient (HOG) using Support Vector Machine (SVM) classifier. Then, motion of humans is tracked through consecutive frames using Kalman filter, and trajectory of each person is extracted. The analysis of trajectory leads to a meaningful summary which covers only important parts of video. One can also mark region of interest to be included in the summary. Experimental results show the proposed system reduces long video into meaningful summary and saves a lot of time and cost in terms of storage, indexing and browsing effort.

[1]  Tomaso A. Poggio,et al.  A Trainable System for Object Detection , 2000, International Journal of Computer Vision.

[2]  Christopher Dyken,et al.  Simultaneous curve simplification , 2009, J. Geogr. Syst..

[3]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[4]  Cordelia Schmid,et al.  Human Detection Based on a Probabilistic Assembly of Robust Part Detectors , 2004, ECCV.

[5]  Hafiz Adnan Habib,et al.  Video summarization based handout generation from video lectures: a gesture recognition framework , 2005 .

[6]  Ba Tu Truong,et al.  Video abstraction: A systematic review and classification , 2007, TOMCCAP.

[7]  John Adcock,et al.  Video summarization preserving dynamic content , 2007, TVS '07.

[8]  Dariu Gavrila,et al.  Real-time object detection for "smart" vehicles , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[9]  RaviKansagara,et al.  A Study on Video Summarization Techniques , 2014 .

[10]  Jiby J Puthiyidam,et al.  A Survey on Video Summarization Techniques , 2015 .

[11]  David G. Lowe,et al.  Three-Dimensional Object Recognition from Single Two-Dimensional Images , 1987, Artif. Intell..

[12]  Alan M. McIvor,et al.  Background Subtraction Techniques , 2000 .

[13]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[14]  Yasushi Yagi,et al.  Human detection in outdoor scene using spatio-temporal motion analysis , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[15]  Yasushi Yagi,et al.  Human detection in outdoor scene using spatio-temporal motion analysis , 2004, ICPR 2004.

[16]  V. Ghini,et al.  An audio-video summarization scheme based on audio and video analysis , 2006, CCNC 2006. 2006 3rd IEEE Consumer Communications and Networking Conference, 2006..

[17]  Li Zhao,et al.  Key-frame extraction and shot retrieval using nearest feature line (NFL) , 2000, MULTIMEDIA '00.

[18]  Tomaso A. Poggio,et al.  Example-Based Object Detection in Images by Components , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Paul L. Rosin Techniques for Assessing Polygonal Approximations of Curves , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[21]  Daniel Thalmann,et al.  Key-posture extraction out of human motion data , 2001, 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[22]  Takeo Kanade,et al.  Object Detection Using the Statistics of Parts , 2004, International Journal of Computer Vision.

[23]  Ying Li,et al.  An Overview of Video Abstraction Techniques , 2001 .

[24]  Chun-Jen Chen,et al.  A linear-time component-labeling algorithm using contour tracing technique , 2004, Comput. Vis. Image Underst..

[25]  Massimo Piccardi,et al.  Background subtraction techniques: a review , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[26]  Wayne H. Wolf,et al.  Key frame selection by motion analysis , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.