Key frame extraction based on global motion statistics for team-sport videos

Key frame extraction is an important manner of video summarization. It can be used to interpret video content quickly. Existing approaches first partition the entire video into video clips by shot boundary detection, and then, extract key frames by frame clustering. However, in most team-sport videos, a video clip usually includes many events, and it is difficult to extract the key frames related to all of these events accurately, because different events of a game shot can have features of similar appearance. As is well known, most events in team-sport videos are attack and defense conversions, which are related to global translation. Therefore, by using fine-grained partition based on the global motion, a shot could be further partitioned into more video clips, from which more key frames could be extracted and they are related to the events. In this study, global horizontal motion is introduced to further partition video clips into fine-grained video clips. Furthermore, global motion statistics are utilized to extract candidate key frames. Finally, the representative key frames are extracted based on the spatial–temporal consistence and hierarchical clustering, and the redundant frames are removed. A dataset called SportKF is built, which includes 25 videos of 197,878 frames in 112 min and 764 key frames from four types of sports (basketball, football, American football and field hockey). The experimental results demonstrate that the proposed scheme achieves state-of-the-art performance by introducing global motion statistics.

[1]  Zhong Qu,et al.  An Improved Algorithm of Keyframe Extraction for Video Summarization , 2011 .

[2]  Damminda Alahakoon,et al.  Interest-Oriented Video Summarization with Keyframe Extraction , 2019, 2019 19th International Conference on Advances in ICT for Emerging Regions (ICTer).

[3]  Ying Chen,et al.  Indexing and Matching of Video Shots Based on Motion and Color Analysis , 2006, 2006 9th International Conference on Control, Automation, Robotics and Vision.

[4]  Krishan Kumar,et al.  F-DES: Fast and Deep Event Summarization , 2017, IEEE Transactions on Multimedia.

[5]  Yueting Zhuang,et al.  Adaptive key frame extraction using unsupervised clustering , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[6]  Deepti D. Shrimankar,et al.  Equal Partition Based Clustering Approach for Event Summarization in Videos , 2016, 2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS).

[7]  Jan Kautz,et al.  PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8]  Abdolreza Mirzaei,et al.  An information theoretic approach to hierarchical clustering combination , 2015, Neurocomputing.

[9]  Thomas Brox,et al.  FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Ioannis Pitas,et al.  Information theory-based shot cut/fade detection and video summarization , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Priyanka Sharma,et al.  Survey of Compressed Domain Video Summarization Techniques , 2019, ACM Comput. Surv..

[12]  Bingbing Ni,et al.  Unsupervised Deep Learning for Optical Flow Estimation , 2017, AAAI.

[13]  Shih-Fu Chang,et al.  Real-time personalized sports video filtering and summarization , 2001, MULTIMEDIA '01.

[14]  Cheng Huang,et al.  A Novel Key-Frames Selection Framework for Comprehensive Video Summarization , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Krishan Kumar,et al.  EVS-DK: Event video skimming using deep keyframe , 2019, J. Vis. Commun. Image Represent..

[16]  Alberto Del Bimbo,et al.  Submitted to Ieee Transactions on Cybernetics 1 3d Human Action Recognition by Shape Analysis of Motion Trajectories on Riemannian Manifold , 2022 .

[17]  Dong Liu,et al.  Encoding Concept Prototypes for Video Event Detection and Summarization , 2015, ICMR.

[18]  Qi Wang,et al.  Fusing Motion Patterns and Key Visual Information for Semantic Event Recognition in Basketball Videos , 2020, Neurocomputing.

[19]  Huaijiang Sun,et al.  Nonconvex Low-Rank Kernel Sparse Subspace Learning for Keyframe Extraction and Motion Segmentation , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[20]  Stefanos D. Kollias,et al.  A stochastic framework for optimal key frame extraction from MPEG video databases , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).

[21]  Tao Zhou,et al.  Hierarchical Clustering Supported by Reciprocal Nearest Neighbors , 2019, Inf. Sci..

[22]  Thangaswamy Judi Vennila,et al.  A Stochastic Framework for Keyframe Extraction , 2020, 2020 International Conference on Emerging Trends in Information Technology and Engineering (ic-ETITE).

[23]  Deepti D. Shrimankar,et al.  V-LESS: A Video from Linear Event Summaries , 2017, CVIP.

[24]  Navjot Singh,et al.  Event BAGGING: A novel event summarization approach in multiview surveillance videos , 2017, 2017 International Conference on Innovations in Electronics, Signal Processing and Communication (IESC).

[25]  Deepti D. Shrimankar,et al.  Eratosthenes sieve based key-frame extraction technique for event summarization in videos , 2018, Multimedia Tools and Applications.

[26]  Petros Maragos,et al.  Video event detection and summarization using audio, visual and text saliency , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[27]  Arnaldo de Albuquerque Araújo,et al.  VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method , 2011, Pattern Recognit. Lett..

[28]  Kunal Roy,et al.  Key Frame Extraction and Foreground Modelling Using K-Means Clustering , 2015, 2015 7th International Conference on Computational Intelligence, Communication Systems and Networks.

[29]  Coskun Bayrak,et al.  Sports video summarization based on motion analysis , 2013, Comput. Electr. Eng..

[30]  Aristidis Likas,et al.  Weighted multi-view key-frame extraction , 2016, Pattern Recognit. Lett..

[31]  Mateu Sbert,et al.  Tsallis entropy-based information measures for shot boundary detection and keyframe selection , 2013, Signal Image Video Process..

[32]  Feng-Li Lian,et al.  Data reduction based on keyframe with motion energy extraction rules , 2014, 2014 IEEE International Conference on Information and Automation (ICIA).

[33]  Om Prakash,et al.  Key Frame Extraction using Uniform Local Binary Pattern , 2018, 2018 Second International Conference on Advances in Computing, Control and Communication Technology (IAC3T).

[34]  Richard Szeliski,et al.  A Database and Evaluation Methodology for Optical Flow , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[35]  Shih-Fu Chang,et al.  Algorithms and system for segmentation and structure analysis in soccer video , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[36]  Ezzeddine Zagrouba,et al.  Key frames extraction using graph modularity clustering for efficient video summarization , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[37]  Deepti D. Shrimankar,et al.  Deep Event Learning boosT-up Approach: DELTA , 2018, Multimedia Tools and Applications.

[38]  Antonio Bandera,et al.  Spatio-temporal feature-based keyframe detection from video shots using spectral clustering , 2013, Pattern Recognit. Lett..

[39]  Francisco Siles Canales,et al.  Evaluation of Different Histogram Distances for Temporal Segmentation in Digital Videos of Football Matches from TV Broadcast , 2017, 2017 International Conference and Workshop on Bioinspired Intelligence (IWOBI).

[40]  Navjot Singh,et al.  Key-Lectures: Keyframes Extraction in Video Lectures , 2018, Advances in Intelligent Systems and Computing.

[41]  Lifang Wu,et al.  Two Stage Shot Boundary Detection via Feature Fusion and Spatial-Temporal Convolutional Neural Networks , 2019, IEEE Access.

[42]  Tao Li,et al.  Key frame extraction based on improved frame blocks features and second extraction , 2015, 2015 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[43]  Mohammed Javed,et al.  An efficient method for video shot boundary detection and keyframe extraction using SIFT-point distribution histogram , 2016, International Journal of Multimedia Information Retrieval.