Hamming Selection Pruned Sets (HSPS) for Efficient Multi-label Video Classification

Videos have become an integral part of our life, from watching movies online to the use of videos in classroom teaching. Existing machine learning techniques are constrained with this scaled up activity because of this huge upsurge in online activity. A lot of research is now focused on reducing the time and accuracy of video classification. Content-Based Video Information Retrieval CBVIR implementation (E.g. Columbia374) is one such approach. We propose a fast Hamming Selection Pruned Sets (HSPS) algorithm that efficiently transforms multi-label video dataset into single-label representation. Thus, multi-label relationship between the labels can be retained for later single label classifier learning stage. Hamming distance (HD) is used to detect similarity between label-sets. HSPS captures new potential label-set relationships that were previously undetected by baseline approach. Experiments show a significant 22.9% dataset building time reduction and consistent accuracy improvement over the baseline method. HSPS also works on general multi-label dataset.

[1]  Marcel Worring,et al.  The challenge problem for automated detection of 101 semantic concepts in multimedia , 2006, MM '06.

[2]  Eyke Hüllermeier,et al.  Combining Instance-Based Learning and Logistic Regression for Multilabel Classification , 2009, ECML/PKDD.

[3]  Geoffrey Holmes,et al.  Efficient multi-label classification for evolving data streams , 2010 .

[4]  Cor J. Veenman,et al.  Robust Scene Categorization by Learning Image Statistics in Context , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[5]  Jesse Read,et al.  A Pruned Problem Transformation Method for Multi-label Classification , 2008 .

[6]  Marcel Worring,et al.  Concept-Based Video Retrieval , 2009, Found. Trends Inf. Retr..

[7]  Geoff Holmes,et al.  Leveraging Bagging for Evolving Data Streams , 2010, ECML/PKDD.

[8]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[9]  Mikhail Petrovskiy,et al.  Paired Comparisons Method for Solving Multi-Label Learning Problem , 2006, 2006 Sixth International Conference on Hybrid Intelligent Systems (HIS'06).

[10]  Chong-Wah Ngo,et al.  Exploring inter-concept relationship with context space for semantic video indexing , 2009, CIVR '09.

[11]  E. Nowak,et al.  Vehicle Categorization: Parts for Speed and Accuracy , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[12]  Mohan S. Kankanhalli,et al.  Multimedia data mining: state of the art and challenges , 2010, Multimedia Tools and Applications.

[13]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[14]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[15]  Tao Mei,et al.  Automatic Video Genre Categorization using Hierarchical SVM , 2006, 2006 International Conference on Image Processing.

[16]  Shih-Fu Chang,et al.  CU-VIREO 374 : Fusing Columbia 374 and VIREO 374 for Large Scale Semantic Concept Detection , 2008 .

[17]  Shih-Fu Chang,et al.  A reranking approach for context-based concept fusion in video indexing and retrieval , 2007, CIVR '07.

[18]  Shih-Fu Chang,et al.  Columbia University’s Baseline Detectors for 374 LSCOM Semantic Visual Concepts , 2007 .

[19]  Joost N. Kok Machine Learning: ECML 2007, 18th European Conference on Machine Learning, Warsaw, Poland, September 17-21, 2007, Proceedings , 2007, ECML.

[20]  Chong-Wah Ngo,et al.  Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study , 2010, IEEE Transactions on Multimedia.

[21]  Chong-Wah Ngo,et al.  Domain adaptive semantic diffusion for large scale context-based video annotation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[22]  Richard W. Hamming,et al.  Error detecting and error correcting codes , 1950 .

[23]  Luis Alfonso Ureña López,et al.  Adaptive Selection of Base Classifiers in One-Against-All Learning for Large Multi-labeled Collections , 2004, EsTAL.

[24]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[25]  Johannes Fürnkranz,et al.  Multi-Label Classification with Label Constraints , 2008 .

[26]  Grigorios Tsoumakas,et al.  Random k -Labelsets: An Ensemble Method for Multilabel Classification , 2007, ECML.