Semantic annotation of soccer videos by visual instance clustering and spatial/temporal reasoning in ontologies

In this paper we present a framework for semantic annotation of soccer videos that exploits an ontology model referred to as Dynamic Pictorially Enriched Ontology, where the ontology, defined using OWL, includes both schema and data. Visual instances are used as matching references for the visual descriptors of the entities to be annotated. Three mechanisms are included to support effective annotation: visual instance clustering—to cluster instances of similar patterns, prototype selection—to select one or more visual representatives of each cluster, dynamic cluster updating—to update clusters and prototypes whenever new knowledge is presented to the ontology. Experimental results show the capability of performing semantic annotation of entities that exhibit a variety of complex changes in visual appearance or of events that show complex motion patterns in the same shot. SWRL rules are used to perform rule-based reasoning over both concepts and concept instances, to improve the quality of the annotation.

[1]  Min Chen,et al.  Video Semantic Event/Concept Detection Using a Subspace-Based Multimedia Data Mining Framework , 2008, IEEE Transactions on Multimedia.

[2]  Alberto Del Bimbo,et al.  Automatic detection of player's identity in soccer videos using faces and text cues , 2006, MM '06.

[3]  Alberto Del Bimbo,et al.  Semantic annotation of soccer videos: automatic highlights identification , 2003, Comput. Vis. Image Underst..

[4]  Marcel Worring,et al.  Multimedia event-based video indexing using time intervals , 2005, IEEE Transactions on Multimedia.

[5]  Milind R. Naphade,et al.  Classification of video events using 4-dimensional time-compressed motion features , 2007, CIVR '07.

[6]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[7]  Wen Gao,et al.  Jersey number detection in sports video for athlete identification , 2005, Visual Communications and Image Processing.

[8]  Shih-Fu Chang,et al.  Structure analysis of soccer video with domain knowledge and hidden Markov models , 2004, Pattern Recognit. Lett..

[9]  Lao Songyang Video Semantic Content Analysis Based on Ontology , 2009 .

[10]  Bernd Neumann,et al.  On scene interpretation with description logics , 2006, Image Vis. Comput..

[11]  Alberto Del Bimbo,et al.  Soccer highlights detection and recognition using HMMs , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[12]  A. Murat Tekalp,et al.  Automatic Soccer Video Analysis and Summarization , 2003, IS&T/SPIE Electronic Imaging.

[13]  Nicola Guarino,et al.  The Won-derWeb Library of Foundational Ontologies , 2002 .

[14]  Shamik Sural,et al.  Soccer video processing for the detection of advertisement billboards , 2008, Pattern Recognit. Lett..

[15]  M. Luo,et al.  Pyramidwise structuring for soccer highlight extraction , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[16]  Marcel Worring,et al.  Multimodal Video Indexing : A Review of the State-ofthe-art , 2001 .

[17]  Steffen Staab,et al.  Knowledge Representation for Semantic Multimedia Content Analysis and Reasoning , 2004, EWIMT.

[18]  Weiming Zhang,et al.  A Semantic Event Detection Approach for Soccer Video based on Perception Concepts and Finiste State Machines , 2007, Eighth International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS '07).

[19]  John R. Smith,et al.  Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[20]  Changsheng Xu,et al.  A Novel Framework for Semantic Annotation and Personalized Retrieval of Sports Video , 2008, IEEE Transactions on Multimedia.

[21]  Tao Mei,et al.  Structure and event mining in sports video with efficient mosaic , 2008, Multimedia Tools and Applications.

[22]  Steffen Staab,et al.  An Ontology Infrastructure for Multimedia Reasoning , 2005, VLBV.

[23]  Alberto Del Bimbo,et al.  Improving the robustness of particle filter-based visual trackers using online parameter adaptation , 2007, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance.

[24]  Sheng Tang,et al.  A statistical framework for replay detection in soccer video , 2008, 2008 IEEE International Symposium on Circuits and Systems.

[25]  Rita Cucchiara,et al.  Linear Transition Detection as a Unified Shot Detection Approach , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[26]  Yi Wu,et al.  Ontology-based multi-classification learning for video concept detection , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[27]  R. Dahyot,et al.  Browsing sports video: trends in sports-related indexing and retrieval work , 2006, IEEE Signal Processing Magazine.

[28]  Alberto Del Bimbo,et al.  Trademark matching and retrieval in sports video databases , 2007, MIR '07.

[29]  Nicola Guarino,et al.  WonderWeb Deliverable D17. The WonderWeb Library of Foundational Ontologies and the DOLCE ontology , 2002 .

[30]  Ramakant Nevatia,et al.  VERL: An Ontology Framework for Representing and Annotating Video Events , 2005, IEEE Multim..

[31]  Michael G. Strintzis,et al.  Knowledge-assisted semantic video object detection , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[32]  Akio Yamada,et al.  The MPEG-7 color layout descriptor: a compact image feature description for high-speed image/video segment retrieval , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[33]  Paul Buitelaar,et al.  Ontology-based Information Extraction with SOBA , 2006, LREC.

[34]  Noel E. O'Connor,et al.  Event detection in field sports video using audio-visual features and a support vector Machine , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[35]  Riccardo Leonardi,et al.  Semantic Indexing of Multimedia Documents , 2002, IEEE Multim..

[36]  Michael Wessel,et al.  Towards a Media Interpretation Framework for the Semantic Web , 2007 .

[37]  Ramanathan V. Guha,et al.  Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project , 1990 .

[38]  Ramesh C. Jain,et al.  Annotation of paintings with high-level semantic concepts using transductive inference and ontology-based concept disambiguation , 2007, ACM Multimedia.

[39]  Xinguo Yu,et al.  Current and Emerging Topics in Sports Video Processing , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[40]  Wei-Hao Lin,et al.  A Hybrid Approach to Improving Semantic Extraction of News Video , 2007 .

[41]  P. Gács,et al.  Algorithms , 1992 .

[42]  Chong-Wah Ngo,et al.  Ontology-enriched semantic space for video search , 2007, ACM Multimedia.

[43]  Tao Mei,et al.  Building a comprehensive ontology to refine video concept detection , 2007, MIR '07.

[44]  Alberto Del Bimbo,et al.  Soccer players identification based on visual local features , 2007, CIVR '07.

[45]  Shih-Fu Chang,et al.  Algorithms and system for segmentation and structure analysis in soccer video , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[46]  Chung-Lin Huang,et al.  Semantic analysis of soccer video using dynamic Bayesian network , 2006, IEEE Transactions on Multimedia.

[47]  Jia Liu,et al.  Automatic Player Detection, Labeling and Tracking in Broadcast Soccer Video , 2007, BMVC.

[48]  Chrisa Tsinaraki,et al.  Ontology-Based Semantic Indexing for MPEG-7 and TV-Anytime Audiovisual Content , 2005, Multimedia Tools and Applications.

[49]  Raymond Y. K. Lau,et al.  Using Information Filtering in Web Data Mining Process , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[50]  Behrang Q. Zadeh,et al.  A Framework for Temporal Content Modeling of Video Data Using an Ontological Infrastructure , 2006, SKG.

[51]  Nicola Guarino,et al.  The WonderWeb Library of Foundational Ontologies Preliminary Report , 2002 .

[52]  Marcel Worring,et al.  Adding Semantics to Detectors for Video Retrieval , 2007, IEEE Transactions on Multimedia.

[53]  Alberto Del Bimbo,et al.  Video Annotation with Pictorially Enriched Ontologies , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[54]  Ichiro Ide,et al.  An object detection method for describing soccer games from video , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[55]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[56]  J.J. Leonard,et al.  Challenges for Autonomous Mobile Robots , 2007, International Machine Vision and Image Processing Conference (IMVIP 2007).