Automatic content-based retrieval and semantic classification of video content

The problem of video classification can be viewed as discovering the signature patterns in the elemental features of a video class. In order to solve this problem, a large and diverse set of video features is proposed in this paper. The contributions of the paper further lie in dealing with high-dimensionality induced by the feature space and in presenting an algorithm based on two-phase grid searching for automatic parameter selection for support vector machine (SVM). The framework thus is directed to bridge the gap between low-level features and semantic video classes. The experimental results and comparison with state-of-the-art learning tools on more than 5000 video segments show the effectiveness of our approach.

[1]  David C. Gibbon,et al.  Relevance Feedback using Support Vector Machines , 2001, ICML.

[2]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[3]  M.M. Van Hulle,et al.  View-based 3D object recognition with support vector machines , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[4]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[5]  Zhu Liu,et al.  Integration of multimodal features for video scene classification based on HMM , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).

[6]  Anil K. Jain,et al.  Incremental learning for Bayesian classification of images , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[7]  Janet L. Kolodner,et al.  Case-Based Reasoning , 1989, IJCAI 1989.

[8]  Cheng Lu,et al.  Classification of summarized videos using hidden markov models on compressed chromaticity signatures , 2001, MULTIMEDIA '01.

[9]  Ulrich H.-G. Kreßel,et al.  Pairwise classification and support vector machines , 1999 .

[10]  Huan Liu,et al.  Dimensionality reduction via discretization , 1996, Knowl. Based Syst..

[11]  Chih-Jen Lin,et al.  Asymptotic Behaviors of Support Vector Machines with Gaussian Kernel , 2003, Neural Computation.

[12]  J. Dem. far Using Machine Learning for Content-Based Image Retrieving , 1996 .

[13]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[14]  S. Hua,et al.  A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach. , 2001, Journal of molecular biology.

[15]  Brendan J. Frey,et al.  Probabilistic multimedia objects (multijects): a novel approach to video indexing and retrieval in multimedia systems , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[16]  Wolfgang Effelsberg,et al.  Automatic recognition of film genres , 1995, MULTIMEDIA '95.

[17]  Hayit Greenspan,et al.  Finding Pictures of Objects in Large Collections of Images , 1996, Object Representation in Computer Vision.

[18]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[19]  Ba Tu Truong,et al.  Automatic genre identification for content-based video categorization , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[20]  Arif Ghafoor,et al.  Semantic Modeling and Knowledge Representation in Multimedia Databases , 1999, IEEE Trans. Knowl. Data Eng..

[21]  Nuno Vasconcelos,et al.  Towards semantically meaningful feature spaces for the characterization of video content , 1997, Proceedings of International Conference on Image Processing.

[23]  Antonio Torralba,et al.  Semantic organization of scenes using discriminant structural templates , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[24]  Wei-Hao Lin,et al.  News video classification using SVM-based multimodal classifiers and combination strategies , 2002, MULTIMEDIA '02.

[25]  Anil K. Jain,et al.  Reject option for VQ-based Bayesian classification , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[26]  C.-C. Jay Kuo,et al.  A semantic classification and composite indexing approach to robust image retrieval , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[27]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[28]  David W. Aha,et al.  Feature Selection for Case-Based Classification of Cloud Types: An Empirical Comparison , 1994 .

[29]  Rosalind W. Picard,et al.  Texture orientation for sorting photos "at a glance" , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[30]  Stefanos D. Kollias,et al.  A neural network approach to interactive content-based retrieval of video databases , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[31]  Martin Szummer,et al.  Indoor-outdoor image classification , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[32]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[33]  Gérard Dreyfus,et al.  Single-layer learning revisited: a stepwise procedure for building and training a neural network , 1989, NATO Neurocomputing.

[34]  A. Murat Tekalp,et al.  Probabilistic Analysis and Extraction of Video Content , 1999, ICIP.

[35]  Qinbao Song,et al.  Automatic video classification using decision tree method , 2002, Proceedings. International Conference on Machine Learning and Cybernetics.

[36]  Franc Solina,et al.  Using machine learning for content-based image retrieving , 1996, Proceedings of 13th International Conference on Pattern Recognition.