Rule induction for adaptive sport video characterization using MLN clause templates

The grounding of high-level semantic concepts is a key requirement of video annotation systems. Rule induction can thus constitute an invaluable intermediate step in characterizing protocol-governed domains, such as broadcast sports footage. We here set out a novel “clause grammar template” approach to the problem of rule-induction in video footage of court games that employs a second-order meta-grammar for Markov Logic Network construction. The aim is to build an adaptive system for sports video annotation capable, in principle, both of learning ab initio and also adaptively transferring learning between distinct rule domains. The method is tested with respect to both a simulated game predicate generator and also real data derived from tennis footage via computer-vision based approaches including HOG3D based player-action classification, Hough-transform-based court detection, and graph-theoretic ball-tracking. Experiments demonstrate that the method exhibits both error resilience and learning transfer in the court domain context. Moreover the clause template approach naturally generalizes to any suitably-constrained, protocol-governed video domain characterized by feature noise or detector error. keywords: Video Annotation, Markov processes, Stochastic Logic, Markov logic network (MLN), Action Recognition, Behavior discovery, Statistical Relational Reasoning

[1]  William J. Christmas,et al.  A Novel Data Association Algorithm for Object Tracking in Clutter with Application to Tennis Video Analysis , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[2]  Ronald Poppe,et al.  A survey on vision-based human action recognition , 2010, Image Vis. Comput..

[3]  Anil C. Kokaram,et al.  Classification and representation of semantic content in broadcast tennis videos , 2005, IEEE International Conference on Image Processing 2005.

[4]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[5]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[6]  Alberto Del Bimbo,et al.  Video Annotation and Retrieval Using Ontologies and Rule Learning , 2010, IEEE MultiMedia.

[7]  Qi Tian,et al.  A unified framework for semantic shot classification in sports video , 2005, IEEE Trans. Multim..

[8]  William J. Christmas,et al.  Tennis stroke detection and classification based on boosted activity detectors and particle filtering , 2006 .

[9]  Ben Taskar,et al.  Markov Logic: A Unifying Framework for Statistical Relational Learning , 2007 .

[10]  Wei-Ta Chu,et al.  Event detection in tennis matches based on video data mining , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[11]  Alberto Del Bimbo,et al.  Taking into Consideration Sports Semantic Annotation of Sports Videos Content-based Multimedia Indexing and Retrieval , 2002 .

[12]  Rajeev Rastogi,et al.  Web information extraction using markov logic networks , 2011, KDD.

[13]  Stephen Muggleton,et al.  Learning from Positive Data , 1996, Inductive Logic Programming Workshop.

[14]  Janko Calic,et al.  A rule-based video annotation system , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Pedro M. Domingos,et al.  Deep Transfer: A Markov Logic Approach , 2011, AI Mag..

[16]  Raymond J. Mooney,et al.  Mapping and Revising Markov Logic Networks for Transfer Learning , 2007, AAAI.