Automatic analysis of complex athlete techniques in broadcast taekwondo video

Athlete detection and action recognition in sports video is a very challenging task due to the dynamic and cluttered background. Several attempts for automatic analysis focus on athletes in many sports videos have been made. However, taekwondo video analysis remains an unstudied field. In light of this, a novel framework for automatic techniques analysis in broadcast taekwondo video is proposed in this paper. For an input video, in the first stage, athlete tracking and body segmentation are done through a modified Structure Preserving Object Tracker. In the second stage, the de-noised frames which completely contain the body of analyzed athlete from video sequence, are trained by a deep learning network PCANet to predict the athlete action of each single frame. As one technique is composed of many consecutive actions and each action corresponds a video frame, focusing on video sequences to achieve techniques analysis makes sense. In the last stage, linear SVM is used with the predicted action frames to get a techniques classifier. To evaluate the performance of the proposed framework, extensive experiments on real broadcast taekwondo video dataset are provided. The results show that the proposed method achieves state-of-the-art results for complex techniques analysis in taekwondo video.

[1]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2]  Josef Kittler,et al.  Transductive transfer learning for action recognition in tennis games , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[3]  Ana Gabriela Maguitman,et al.  Action Recognition in Tennis Videos using Optical Flow and Conditional Random Fields , 2013 .

[4]  Lu Zhang,et al.  Structure Preserving Object Tracking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Tao Liu,et al.  Gait Analysis Using Wearable Sensors , 2012, Sensors.

[6]  Ling Shao,et al.  Embedding Motion and Structure Features for Action Recognition , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8]  Noboru Babaguchi,et al.  Generating Semantic Descriptions of Broadcasted Sports Videos Based on Structures of Sports Games and TV Programs , 2004, Multimedia Tools and Applications.

[9]  Shu-Yuan Chen,et al.  Automatic Broadcast Soccer Video Analysis, Player Detection, and Tracking Based on Color Histogram , 2013 .

[10]  Darwin Gouwanda,et al.  Emerging Trends of Body-Mounted Sensors in Sports and Human Gait Analysis , 2008 .

[11]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Yongdong Zhang,et al.  Automatic Video-based Analysis of Athlete Action , 2007, 14th International Conference on Image Analysis and Processing (ICIAP 2007).

[13]  Tao Mei,et al.  Structure and event mining in sports video with efficient mosaic , 2008, Multimedia Tools and Applications.

[14]  Shihong Lao,et al.  Multiple Player Tracking in Sports Video: A Dual-Mode Two-Way Bayesian Inference Approach With Progressive Observation Modeling , 2011, IEEE Transactions on Image Processing.

[15]  Theo Gevers,et al.  Evaluation of Color STIPs for Human Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Noboru Babaguchi,et al.  A new spatio-temporal method for event detection and personalized retrieval of sports video , 2010, Multimedia Tools and Applications.

[17]  Carol Livermore,et al.  COMPACT, SCALABLE, HIGH-RESOLUTION, MEMS-ENABLED TACTILE DISPLAYS , 2014 .

[18]  H Ghasemzadeh,et al.  Coordination Analysis of Human Movements With Body Sensor Networks: A Signal Processing Model to Evaluate Baseball Swings , 2011, IEEE Sensors Journal.

[19]  Thomas Hofmann,et al.  Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..

[20]  William J. Christmas,et al.  Gesture spotting for low-resolution sports video annotation , 2008, Pattern Recognit..

[21]  Carol Livermore,et al.  Scalable, MEMS-enabled, vibrational tactile actuators for high resolution tactile displays , 2014 .

[22]  Jiwen Lu,et al.  PCANet: A Simple Deep Learning Baseline for Image Classification? , 2014, IEEE Transactions on Image Processing.

[23]  Andrew Zisserman,et al.  Convolutional Two-Stream Network Fusion for Video Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Junqing Yu,et al.  Affection arousal based highlight extraction for soccer video , 2013, Multimedia Tools and Applications.

[25]  M. Kalaiselvi Geetha,et al.  An Efficient Ball and Player Detection in Broadcast Tennis Video , 2016 .

[26]  Xueming Qian,et al.  HMM based soccer video event detection using enhanced mid-level semantic , 2011, Multimedia Tools and Applications.

[27]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[28]  Coskun Bayrak,et al.  Sports video summarization based on motion analysis , 2013, Comput. Electr. Eng..

[29]  Hadi Seyedarabi,et al.  Pose estimation of soccer players using multiple uncalibrated cameras , 2015, Multimedia Tools and Applications.

[30]  Ling-Hwei Chen,et al.  A novel method for slow motion replay detection in broadcast basketball video , 2015, Multimedia Tools and Applications.

[31]  Wei-Ta Chu,et al.  Explicit semantic events detection and development of realistic applications for broadcasting baseball videos , 2008, Multimedia Tools and Applications.

[32]  Greg Mori,et al.  Action recognition by learning mid-level motion features , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Koby Crammer,et al.  Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[34]  Changsheng Xu,et al.  A Novel Framework for Semantic Annotation and Personalized Retrieval of Sports Video , 2008, IEEE Transactions on Multimedia.

[35]  Aboul Ella Hassanien,et al.  Machine Learning-Based Soccer Video Summarization System , 2011, FGIT-MulGraB.

[36]  Yongdong Zhang,et al.  Automatic Detection and Analysis of Player Action in Moving Background Sports Video Sequences , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[37]  Yanxi Liu,et al.  Tracking Sports Players with Context-Conditioned Motion Models , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[38]  Shitala Prasad,et al.  Sports Video Summarization using Priority Curve Algorithm , 2010 .

[39]  Nazli Ikizler-Cinbis,et al.  Action Recognition and Localization by Hierarchical Space-Time Segments , 2013, 2013 IEEE International Conference on Computer Vision.

[40]  Lu Zhang,et al.  Preserving Structure in Model-Free Tracking , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Paul B Gastin,et al.  Quantification of tackling demands in professional Australian football using integrated wearable athlete tracking technology. , 2013, Journal of science and medicine in sport.