Video event recounting using mixture subclass discriminant analysis

In this paper, a new feature selection method is used, in combination with a semantic model vector video representation, in order to enumerate the key semantic evidences of an event in a video signal. In particular, a set of semantic concept detectors is firstly used for estimating a model vector for each video signal, where each element of the model vector denotes the degree of confidence that the respective concept is depicted in the video. Then, a novel feature selection method is learned for each event of interest. This method is based on exploiting the first two eigenvectors derived using the eigenvalue formulation of the mixture subclass discriminant analysis. Subsequently, given a video-event pair, the proposed method jointly evaluates the significance of each concept for the detection of the given event and the degree of confidence with which this concept is detected in the given video, in order to decide which concepts provide the strongest evidence in support of the provided video-event link. Experimental results using a video collection of TRECVID demonstrate the effectiveness of the proposed video event recounting method.

[1]  Dong Liu,et al.  BBNVISER : BBN VISER TRECVID 2012 Multimedia Event Detection and Multimedia Event Recounting Systems , 2012, TRECVID.

[2]  Fengxi Song,et al.  Feature Selection Based on Linear Discriminant Analysis , 2010, 2010 International Conference on Intelligent System Design and Engineering Application.

[3]  Gang Hua,et al.  Semantic Model Vectors for Complex Video Event Recognition , 2012, IEEE Transactions on Multimedia.

[4]  Lei Wang,et al.  Feature Selection with Kernel Class Separability , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Brian Antonishek TRECVID 2010 – An Introduction to the Goals , Tasks , Data , Evaluation Mechanisms , and Metrics , 2010 .

[6]  N. Brown On The Prevalence of Event Clusters in Autobiographical Memory , 2005 .

[7]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Yiannis Kompatsiaris,et al.  Mixture Subclass Discriminant Analysis , 2011, IEEE Signal Processing Letters.

[9]  Yiannis Kompatsiaris,et al.  ITI-CERTH participation to TRECVID 2015 , 2015, TRECVID.

[10]  Cor J. Veenman,et al.  Visual Word Ambiguity , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Michael G. Strintzis,et al.  Estimation and representation of accumulated motion characteristics for semantic event detection , 2008, 2008 15th IEEE International Conference on Image Processing.

[12]  Yiannis Kompatsiaris,et al.  Automatic event-based indexing of multimedia content using a joint content-event model , 2010, EiMM '10.

[13]  No Value,et al.  IEEE International Conference on Image Processing , 2003 .

[14]  Yiannis Kompatsiaris,et al.  High-level event detection in video exploiting discriminant concepts , 2011, 2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI).

[15]  Yiannis Kompatsiaris,et al.  A Joint Content-Event Model for Event-Centric Multimedia Indexing , 2010, 2010 IEEE Fourth International Conference on Semantic Computing.

[16]  Mubarak Shah,et al.  High-level event recognition in unconstrained videos , 2013, International Journal of Multimedia Information Retrieval.

[17]  Koichi Shinoda,et al.  Multimedia event detection using GMM supervectors and SVMS , 2012, 2012 19th IEEE International Conference on Image Processing.

[18]  Daniel P. W. Ellis,et al.  IBM Research and Columbia University TRECVID-2012 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), and Semantic Indexing (SIN) Systems , 2012, TRECVID.

[19]  Yiannis Kompatsiaris,et al.  Mixture Subclass Discriminant Analysis Link to Restricted Gaussian Model and Other Generalizations , 2013, IEEE Transactions on Neural Networks and Learning Systems.