An Effective Multi-concept Classifier for Video Streams

In this paper, an effective multi-concept classifier is proposed for video semantic concept detection. The core of the proposed classifier is a supervised classification approach called C-RSPM (collateral representative subspace projection modeling) which is applied to a set of multimodal video features for knowledge discovery. It adaptively selects non-consecutive principal dimensions to form an accurate modeling of a representative subspace based on the statistical information analysis and thus achieves both promising classification accuracy and operational merits. Its effectiveness is demonstrated by the comparative experiment, as opposed to several well-known supervised classification approaches including SVM, Decision Trees, Neural Network, Multinomial Logistic Regression Model, and One Rule Classifier, on goal/corner event detection and sports/commercials concepts extraction from soccer videos and TRECVID news collections.

[1]  Min Chen,et al.  Semantic event detection via multimodal data mining , 2006, IEEE Signal Processing Magazine.

[2]  Lei Wang,et al.  Offense based temporal segmentation for event detection in soccer video , 2004, MIR '04.

[3]  Yi-Ping Phoebe Chen,et al.  Content-based video indexing for sports applications using integrated multi-modal approach , 2005, MULTIMEDIA '05.

[4]  Svante Wold,et al.  Pattern recognition by means of disjoint principal components models , 1976, Pattern Recognit..

[5]  Chng Eng Siong,et al.  Automatic replay generation for soccer video broadcasting , 2004, MULTIMEDIA '04.

[6]  Rangasami L. Kashyap,et al.  Identifying Overlapped Objects for Video Indexing and Modeling in Multimedia Database Systems , 2001, Int. J. Artif. Intell. Tools.

[7]  Chengcui Zhang,et al.  Innovative Shot Boundary Detection for Video Indexing , 2005 .

[8]  Shih-Fu Chang,et al.  Unsupervised discovery of multilevel statistical video structures using hierarchical hidden Markov models , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[9]  Shu-Ching Chen,et al.  Collateral Representative Subspace Projection Modeling for Supervised Classification , 2006, 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06).

[10]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[11]  Noel E. O'Connor,et al.  Event detection in field sports video using audio-visual features and a support vector Machine , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Qi Tian,et al.  A mid-level representation framework for semantic sports video analysis , 2003, ACM Multimedia.

[13]  Dit-Yan Yeung,et al.  Parzen-window network intrusion detectors , 2002, Object recognition supported by user interaction for service robots.

[14]  Min Chen,et al.  Video Semantic Event/Concept Detection Using a Subspace-Based Multimedia Data Mining Framework , 2008, IEEE Transactions on Multimedia.

[15]  Wen-Nung Lie,et al.  Motion-based event detection and semantic classification for baseball sport videos , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[16]  Min Chen,et al.  A multimodal data mining framework for soccer goal detection based on decision tree logic , 2006, Int. J. Comput. Appl. Technol..

[17]  Riccardo Leonardi,et al.  Semantic indexing of soccer audio-visual sequences: a multimodal approach based on controlled Markov chains , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  A. Murat Tekalp,et al.  Generic play-break event detection for summarization and hierarchical sports video analysis , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[19]  Shih-Fu Chang,et al.  Event detection in baseball video using superimposed caption recognition , 2002, MULTIMEDIA '02.