Video shot boundary detection and key-frame extraction using mathematical models. (Détection des changements de plans et extraction d'images représentatives dans une séquence vidéo)

With the recent advancement in multimedia technologies, in conjunction with the rapid increase of the volume of digital video data and the growth of internet ; it has becom mandatory to have the hability browse and search through information stored in large multimedia databases. For this purpose, content based video retrieval (CBVR) has become an active area of research durinf the last decade. The objective of this thesis is to present applications for temporal video segmentation and video retrieval based on different mathematical models. A shot is considered as the elementary unit of a video, and is defined as a continuous sequence of frames taken from a single camera, representing an action during time. The different types of transitions that may occur in a video sequence are categorized into : abrupt and gradual transition. In this work, through statistical analysis, we segment a video into its constituent units. This is achieved by identifying transitions between adjacent shots. The first proposed algorithm aims to detect abrupt shot transitions only by measuring the similarity between consecutive frames. Given the size of the vector containing distances, it can be modeled by a log normal distribution since all the values are positive. Gradual shot transition identification is a more difficult task when compared to cut detection. Generally, a gradual transition may share similar characteristics as a dynamic segment with camera or object motion. In this work, singular value decomposition (SVD) is performed to project features from the spatial domain to the singular space. Resulting features are reduced and more refined, which makes the remaining tasks easier. The proposed system, designed for detecting both abrupt and gradual transitions, has lead to reliable performances achieving high detection rates. In addition, the acceptable computational time allows to process in real time. Once a video is partitioned into its elementary units, high-level applications can be processed, such as the key-frame extraction. Selecting representative frames from each shot to form a storyboard is considered as a static and local video summarization. In our research, we opted for a global method based on local extraction. Using refined centrist features from the singular space, we select representative frames using modified k-means clustering based on important scenes. This leads to catch pertinent frames without redoudancy in the final storyboard.

[1]  James M. Rehg,et al.  CENTRIST: A Visual Descriptor for Scene Categorization , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[3]  S. Domnic,et al.  Video cut detection using block based histogram differences in RGB color space , 2010, 2010 International Conference on Signal and Image Processing.

[4]  De Xu,et al.  Hierarchical Video Summarization Based on Video Structure and Highlight , 2006, SSPR/SPR.

[5]  Paul Over,et al.  Video shot boundary detection: Seven years of TRECVid activity , 2010, Comput. Vis. Image Underst..

[6]  Minerva M. Yeung,et al.  Efficient matching and clustering of video shots , 1995, Proceedings., International Conference on Image Processing.

[7]  Ananda S. Chowdhury,et al.  Video key frame extraction through dynamic Delaunay clustering with a structural constraint , 2013, J. Vis. Commun. Image Represent..

[8]  Sung Wook Baik,et al.  Adaptive key frame extraction for video summarization using an aggregation mechanism , 2012, J. Vis. Commun. Image Represent..

[9]  Nilesh V. Patel,et al.  Compressed Video Processing for Cut Detection , 1996 .

[10]  Donald A. Adjeroh,et al.  Adaptive Edge-Oriented Shot Boundary Detection , 2009, EURASIP J. Image Video Process..

[11]  Danny Crookes,et al.  Hierarchical video summarization in reference subspace , 2009, IEEE Transactions on Consumer Electronics.

[12]  Y.-N. Li,et al.  Fast video shot boundary detection framework employing pre-processing techniques , 2009, IET Image Process..

[13]  Bede Liu,et al.  Temporal segmentation of video using frame and histogram space , 2000, IEEE Transactions on Multimedia.

[14]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[15]  Sung Wook Baik,et al.  Efficient visual attention based framework for extracting key frames from videos , 2013, Signal Process. Image Commun..

[16]  Guoliang Fan,et al.  Combined key-frame extraction and object-based video segmentation , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Yueting Zhuang,et al.  Adaptive key frame extraction using unsupervised clustering , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[18]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[19]  Ioannis Pitas,et al.  Video shot-boundary detection using singular-value decomposition and statistical tests , 2007, J. Electronic Imaging.

[20]  Rainer Lienhart,et al.  Comparison of automatic shot boundary detection algorithms , 1998, Electronic Imaging.

[21]  Adnan M. Alattar Detecting and compressing dissolve regions in video sequences with a DVI multimedia image compression algorithm , 1993, 1993 IEEE International Symposium on Circuits and Systems.

[22]  Tie-Yan Liu,et al.  Shot reconstruction degree: a novel criterion for key frame selection , 2004, Pattern Recognit. Lett..

[23]  Dong-Sik Jang,et al.  Gradual shot boundary detection using localized edge blocks , 2006, Multimedia Tools and Applications.

[24]  Yiheng Cai,et al.  Saliency based Wireless Capsule Endoscopy video abstract , 2016, 2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI).

[25]  Petros Maragos,et al.  Video event detection and summarization using audio, visual and text saliency , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[26]  Ramin Zabih,et al.  A feature-based algorithm for detecting and classifying scene breaks , 1995, MULTIMEDIA '95.

[27]  Charles A. Bouman,et al.  ViBE: a compressed video database structured for active browsing and search , 2004, IEEE Transactions on Multimedia.

[28]  Aggelos K. Katsaggelos,et al.  Rate-distortion optimal video summary generation , 2005, IEEE Transactions on Image Processing.

[29]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[30]  Arding Hsu,et al.  Feature management for large video databases , 1993, Electronic Imaging.

[31]  Masaharu Ogawa,et al.  A highlight scene detection and video summarization system using audio feature for a personal video recorder , 2005, IEEE Transactions on Consumer Electronics.

[32]  Riccardo Leonardi,et al.  Scene break detection: a comparison , 1998, Proceedings Eighth International Workshop on Research Issues in Data Engineering. Continuous-Media Databases and Applications.

[33]  Yue Gao,et al.  A video summarization tool using two-level redundancy detection for personal video recorders , 2008, IEEE Transactions on Consumer Electronics.

[34]  Jun Yu,et al.  An efficient method for scene cut detection , 2001, Pattern Recognit. Lett..

[35]  André Zaccarin,et al.  A system for reliable dissolve detection in videos , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[36]  Irena Koprinska,et al.  Temporal video segmentation: A survey , 2001, Signal Process. Image Commun..

[37]  Guoliang Fan,et al.  Joint Key-Frame Extraction and Object Segmentation for Content-Based Video Analysis , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[38]  Takafumi Miyatake,et al.  IMPACT: an interactive natural-motion-picture dedicated multimedia authoring system , 1991, CHI.

[39]  Bo Zhang,et al.  A novel shot boundary detection framework , 2005, Visual Communications and Image Processing.

[40]  Ramesh C. Jain,et al.  Production model based digital video segmentation , 1995, Multimedia Tools and Applications.

[41]  Bo Zhang,et al.  A Formal Study of Shot Boundary Detection , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[42]  Kunal Roy,et al.  Key Frame Extraction and Foreground Modelling Using K-Means Clustering , 2015, 2015 7th International Conference on Computational Intelligence, Communication Systems and Networks.

[43]  Yoshinobu Tonomura,et al.  Video browsing using brightness data , 1991, Other Conferences.

[44]  Alan Hanjalic,et al.  TU Delft at TRECVID 2005: Shot Boundary Detection , 2005, TRECVID.

[45]  Amit K. Roy-Chowdhury,et al.  Embedded sparse coding for summarizing multi-view videos , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[46]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[47]  Yihong Gong Summarizing Audiovisual Contents of a Video Program , 2003, EURASIP J. Adv. Signal Process..

[48]  Jeho Nam,et al.  Detection of gradual transitions in video sequences using B-spline interpolation , 2005, IEEE Transactions on Multimedia.

[49]  Zygmunt Pizlo,et al.  Automated video program summarization using speech transcripts , 2006, IEEE Transactions on Multimedia.

[50]  Hong Heather Yu,et al.  A Hierarchical Multiresolution Video Shot Transition Detection Scheme , 1999, Comput. Vis. Image Underst..

[51]  Marco Pellegrini,et al.  STIMO: STIll and MOving video storyboard for the web scenario , 2009, Multimedia Tools and Applications.

[52]  Arnaldo de Albuquerque Araújo,et al.  VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method , 2011, Pattern Recognit. Lett..

[53]  Fernando Pereira,et al.  Automatic video summarization based on MPEG-7 descriptions , 2004, Signal Process. Image Commun..

[54]  Alan Hanjalic,et al.  An integrated scheme for automated video abstraction based on unsupervised cluster-validity analysis , 1999, IEEE Trans. Circuits Syst. Video Technol..

[55]  Jiebo Luo,et al.  Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection , 2012, IEEE Transactions on Multimedia.

[56]  G. Srinivasan,et al.  SHOT BOUNDARY DETECTION: AN IMPROVED ALGORITHM , 2013 .

[57]  Prosenjit Bose,et al.  Feature-based cut detection with automatic threshold selection , 2006, TRECVID.

[58]  Behzad Shahraray,et al.  Automatic generation of pictorial transcripts of video programs , 1995, Electronic Imaging.

[59]  George Ghinea,et al.  User-centred personalised video abstraction approach adopting SIFT features , 2015, Multimedia Tools and Applications.

[60]  A. Murat Tekalp,et al.  Two-stage hierarchical video summary extraction to match low-level user browsing preferences , 2003, IEEE Trans. Multim..

[61]  Youssef Hadi,et al.  Video summarization by k-medoid clustering , 2006, SAC '06.

[62]  Gregory L. Zick,et al.  Scene decomposition of MPEG-compressed video , 1995, Electronic Imaging.

[63]  Driss Aboutajdine,et al.  Video cut detection method based on a 2 D luminance histogram using an appropriate threshold and a post processing , 2015 .

[64]  Shaohui Mei,et al.  Video summarization via minimum sparse reconstruction , 2015, Pattern Recognit..

[65]  Rainer Lienhart,et al.  Reliable dissolve detection , 2001, IS&T/SPIE Electronic Imaging.

[66]  Regunathan Radhakrishnan,et al.  Motion activity-based extraction of key-frames from video shots , 2002, Proceedings. International Conference on Image Processing.

[67]  Yong Shi,et al.  Fast Video Shot Boundary Detection Based on SVD and Pattern Matching , 2013, IEEE Transactions on Image Processing.

[68]  Chong-Wah Ngo,et al.  Video partitioning by temporal slice coherency , 2001, IEEE Trans. Circuits Syst. Video Technol..

[69]  Shohreh Kasaei,et al.  Event Detection and Summarization in Soccer Videos Using Bayesian Network and Copula , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[70]  Xin Liu,et al.  Video summarization using singular value decomposition , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[71]  Kebin Jia,et al.  Video Key Frame Extraction Based on Spatial-Temporal Color Distribution , 2008, 2008 International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[72]  Jurandy Almeida,et al.  VISON: VIdeo Summarization for ONline applications , 2012, Pattern Recognit. Lett..

[73]  Bhabatosh Chanda,et al.  A Model-Based Shot Boundary Detection Technique Using Frame Transition Parameters , 2012, IEEE Transactions on Multimedia.

[74]  Shih-Fu Chang,et al.  Scene change detection in an MPEG-compressed video sequence , 1995, Electronic Imaging.

[75]  Adnan M. Alattar Detecting fade regions in uncompressed video sequences , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[76]  Ullas Gargi,et al.  Performance characterization of video-shot-change detection methods , 2000, IEEE Trans. Circuits Syst. Video Technol..

[77]  S. Domnic,et al.  Walsh–Hadamard Transform Kernel-Based Feature Vector for Shot Boundary Detection , 2014, IEEE Transactions on Image Processing.

[78]  Seong-Dae Kim,et al.  Iterative key frame selection in the rate-constraint environment , 2003, Signal Process. Image Commun..

[79]  Janko Calic,et al.  Efficient key-frame extraction and video analysis , 2002, Proceedings. International Conference on Information Technology: Coding and Computing.

[80]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.

[81]  Junsong Yuan,et al.  Sparse reconstruction cost for abnormal event detection , 2011, CVPR 2011.

[82]  Stefanos D. Kollias,et al.  A fuzzy video content representation for video summarization and content-based retrieval , 2000, Signal Process..

[83]  Alan Hanjalic,et al.  Shot-boundary detection: unraveled and resolved? , 2002, IEEE Trans. Circuits Syst. Video Technol..

[84]  Georgios Tziritas,et al.  Equivalent Key Frames Selection Based on Iso-Content Principles , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[85]  Alvaro Pardo Simple and Robust Hard Cut Detection Using Interframe Differences , 2005, CIARP.

[86]  Hamidreza Rashidy Kanan,et al.  AVCD-FRA: A novel solution to automatic video cut detection using fuzzy-rule-based approach , 2013, Comput. Vis. Image Underst..

[87]  Behzad Shahraray,et al.  Scene change detection and content-based sampling of video sequences , 1995, Electronic Imaging.

[88]  Driss Aboutajdine,et al.  Shot boundary detection via adaptive low rank and svd-updating , 2017, Comput. Vis. Image Underst..

[89]  Shuang Wu,et al.  Hierarchical video summarization with loitering indication , 2015, 2015 Visual Communications and Image Processing (VCIP).

[90]  Driss Aboutajdine,et al.  Video shot boundary detection method using histogram differences and local image descriptor , 2014, 2014 Second World Conference on Complex Systems (WCCS).

[91]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[92]  Chong-Wah Ngo,et al.  Video summarization and scene detection by graph modeling , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[93]  Andreas Girgensohn,et al.  Time-Constrained Keyframe Selection Technique , 2004, Multimedia Tools and Applications.

[94]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, Electronic Imaging.

[95]  Stephen W. Smoliar,et al.  Video parsing and browsing using compressed data , 1995, Multimedia Tools and Applications.

[96]  Ba Tu Truong,et al.  Video abstraction: A systematic review and classification , 2007, TOMCCAP.

[97]  Nilesh V. Patel,et al.  Statistical approach to scene change detection , 1995, Electronic Imaging.

[98]  Sarah V. Porter,et al.  Video Segmentation and Indexing using Motion Estimation , 2004 .

[99]  Raimondo Schettini,et al.  Supervised and unsupervised classification post-processing for visual video summaries , 2006, IEEE Transactions on Consumer Electronics.

[100]  Ramesh C. Jain,et al.  Digital video segmentation , 1994, MULTIMEDIA '94.

[101]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[102]  Warnakulasuriya Anil Chandana Fernando,et al.  Fade-in and fade-out detection in video sequences using histograms , 2000, 2000 IEEE International Symposium on Circuits and Systems. Emerging Technologies for the 21st Century. Proceedings (IEEE Cat No.00CH36353).

[103]  Sang Uk Lee,et al.  Efficient video indexing scheme for content-based retrieval , 1999, IEEE Trans. Circuits Syst. Video Technol..

[104]  Driss Aboutajdine,et al.  Video Cut Detector via Adaptive Features using the Frobenius Norm , 2016, ISVC.

[105]  Özgür Ulusoy,et al.  Fuzzy color histogram-based video segmentation , 2010, Comput. Vis. Image Underst..

[106]  Yelena Yesha,et al.  Keyframe-based video summarization using Delaunay clustering , 2006, International Journal on Digital Libraries.

[107]  Pablo César,et al.  Interaction design for online video and television , 2014, CHI Extended Abstracts.

[108]  Moon-Chuen Lee,et al.  Effective Detection of Various Wipe Transitions , 2007, IEEE Transactions on Circuits and Systems for Video Technology.