Semantic Extraction of Basketball Game Video Combining Domain Knowledge and In-Depth Features

The team sports game video features complex background, fast target movement, and mutual occlusion between targets, which poses great challenges to multiperson collaborative video analysis. This paper proposes a video semantic extraction method that integrates domain knowledge and in-depth features, which can be applied to the analysis of a multiperson collaborative basketball game video, where the semantic event is modeled as an adversarial relationship between two teams of players. We first designed a scheme that combines a dual-stream network and learnable spatiotemporal feature aggregation, which can be used for end-to-end training of video semantic extraction to bridge the gap between low-level features and high-level semantic events. Then, an algorithm based on the knowledge from different video sources is proposed to extract the action semantics. The algorithm gathers local convolutional features in the entire space-time range, which can be used to track the ball/shooter/hoop to realize automatic semantic extraction of basketball game videos. Experiments show that the scheme proposed in this paper can effectively identify the four categories of short, medium, long, free throw, and scoring events and the semantics of athletes’ actions based on the video footage of the basketball game.

[1]  Andreas Kamilaris,et al.  Deep learning in agriculture: A survey , 2018, Comput. Electron. Agric..

[2]  G. House,et al.  Future Directions in Sports-Related Concussion Management. , 2021, Clinics in sports medicine.

[3]  Qi Wang,et al.  Fusing Motion Patterns and Key Visual Information for Semantic Event Recognition in Basketball Videos , 2020, Neurocomputing.

[4]  이현주 Q. , 2005 .

[5]  Li Chen,et al.  Analysis of technical features in basketball video based on deep learning algorithm , 2020, Signal Process. Image Commun..

[6]  Dongbing Gu,et al.  Indoor Relocalization in Challenging Environments With Dual-Stream Convolutional Neural Networks , 2018, IEEE Transactions on Automation Science and Engineering.

[7]  Meng Jian,et al.  Ontology-Based Global and Collective Motion Patterns for Event Classification in Basketball Videos , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  P. Alam ‘A’ , 2021, Composites Engineering: An A–Z Guide.

[9]  Ram Gopal Raj,et al.  Spotting Football Events Using Two-Stream Convolutional Neural Network and Dilated Recurrent Neural Network , 2021, IEEE Access.

[10]  Rytis Maskeliūnas,et al.  Recognition of basketball referee signals from real-time videos , 2020, J. Ambient Intell. Humaniz. Comput..

[11]  Yang Yang,et al.  Research on sports action training method based on generative confrontation network model and artificial intelligence , 2021 .

[12]  P. Alam ‘G’ , 2021, Composites Engineering: An A–Z Guide.

[13]  Long Liu,et al.  Objects detection toward complicated high remote basketball sports by leveraging deep CNN architecture , 2021, Future Gener. Comput. Syst..

[14]  Alexander S. Ecker,et al.  The temporal structure of the inner retina at a single glance , 2019, Scientific Reports.

[15]  Gun Jun-jun,et al.  Basketball action recognition based on FPGA and particle image , 2021, Microprocess. Microsystems.

[16]  Hayit Greenspan,et al.  X-ray Categorization and Retrieval on the Organ and Pathology Level, Using Patch-Based Visual Words , 2011, IEEE Transactions on Medical Imaging.

[17]  Yi-Ping Phoebe Chen,et al.  Knowledge-Discounted Event Detection in Sports Video , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[18]  Abhinav Gupta,et al.  ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  P. Alam,et al.  R , 1823, The Herodotus Encyclopedia.

[20]  G. Nagarajan,et al.  Multimodal Fuzzy Ontology Creation and Knowledge Information Retrieval , 2016 .

[21]  Wu Liu,et al.  Deep learning based basketball video analysis for intelligent arena application , 2017, Multimedia Tools and Applications.

[22]  László Havasi,et al.  Using location and motion statistics for the localization of moving objects in multiple camera surveillance videos , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[23]  Ali Javed,et al.  Shot Classification of Field Sports Videos Using AlexNet Convolutional Neural Network , 2019, Applied Sciences.

[24]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[25]  Xinyang Xu,et al.  Application of Artificial Intelligence in Basketball Sport , 2021, Journal of Education, Health and Sport.

[26]  Nurul Fathiah Ghazali,et al.  Hockey activity recognition using pre-trained deep learning model , 2020, ICT Express.

[27]  Towards Real-Time Detection and Tracking of Basketball Players using Deep Neural Networks , 2017 .

[28]  James J. Little,et al.  Lightweight convolutional neural networks for player detection and classification , 2018, Comput. Vis. Image Underst..

[29]  Ling-Hwei Chen,et al.  Novel framework for sports video analysis: A basketball case study , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[30]  Limin Wang,et al.  Action recognition with trajectory-pooled deep-convolutional descriptors , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[32]  Pichao Wang,et al.  A Review of Dynamic Maps for 3D Human Motion Recognition Using ConvNets and Its Improvement , 2020, Neural Processing Letters.

[33]  Gyu Sang Choi,et al.  Scene Classification for Sports Video Summarization Using Transfer Learning , 2020, Sensors.

[34]  Kun Zhang,et al.  Multiple player tracking in basketball court videos , 2020, Journal of Real-Time Image Processing.

[35]  J. Gonzalez-Jimenez,et al.  Differences in movement limitations in different low back pain severity in functional tests using an RGB-D camera. , 2020, Journal of biomechanics.

[36]  Henry Carrillo,et al.  As Seen on TV: Automatic Basketball Video Production using Gaussian-based Actionness and Game States Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[37]  P. Alam ‘S’ , 2021, Composites Engineering: An A–Z Guide.

[38]  Alexander S. Ecker,et al.  The temporal structure of the inner retina at a single glance , 2019, Scientific Reports.

[39]  N. A. Rahmad,et al.  Badminton player detection using faster region convolutional neural network , 2019 .

[40]  Jianbo Shi,et al.  Predicting Behaviors of Basketball Players from First Person Videos , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Yiannis Andreopoulos,et al.  Video Classification With CNNs: Using the Codec as a Spatio-Temporal Activity Sensor , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[42]  Cordelia Schmid,et al.  PoTion: Pose MoTion Representation for Action Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[43]  Xu-Bo Fu,et al.  Camera-based Basketball Scoring Detection Using Convolutional Neural Network , 2020, International Journal of Automation and Computing.