Video concept detection using Hadoop MapReduce framework

Indexing video collection with a large set of video concepts allows us to employ a query-by-concept paradigm. Through concept detection, the task is to detect in the shot- segmented video data the presence of a set of pre-defined semantic concepts. Columbia University provided 374 detectors based on color, texture and edge features, and their unsupervised classifier fusion that can be utilized for concepts training purposes. We had successfully implemented a Concept-based Video retrieval System (CBVRS) to support query-by-concept. However, one of the main challenges is to reduce the training time of concepts to support the video concept detection. This paper presents an alternative architecture to overcome the issue. The proposed CBVRS framework, which consists of three main modules i.e. pre-processing, video analysis, and annotation module, shall utilize the Hadoop MapReduce framework for fast computation of the concept detection.

[1]  Shih-Fu Chang,et al.  Columbia University’s Baseline Detectors for 374 LSCOM Semantic Visual Concepts , 2007 .

[2]  James Ze Wang,et al.  Real-time computerized annotation of pictures. , 2008, IEEE transactions on pattern analysis and machine intelligence.

[3]  David Dagan Feng,et al.  Improving News Video Annotation with Semantic Context , 2010, 2010 International Conference on Digital Image Computing: Techniques and Applications.

[4]  Meng Wang,et al.  Beyond Distance Measurement: Constructing Neighborhood Similarity for Video Annotation , 2009, IEEE Transactions on Multimedia.

[5]  Xuelong Li,et al.  Semantic Mining Technologies for Multimedia Databases , 2009 .

[6]  Xian-Sheng Hua,et al.  Beyond Accuracy: Typicality Ranking for Video Annotation , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[7]  Chao-Chun Chen,et al.  Design and Implement a MapReduce Framework for Executing Standalone Software Packages in Hadoop-based Distributed Environments , 2013 .

[8]  Guiguang Ding,et al.  Semantic classifier based on compressed sensing for image and video annotation , 2010 .

[9]  Mark Craven,et al.  Active Learning with Real Annotation Costs , 2008 .

[10]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[11]  Yang Li,et al.  A Novel Video Annotation Framework Based on Video Object , 2009, 2009 International Joint Conference on Artificial Intelligence.

[12]  Janko Calic,et al.  A rule-based video annotation system , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Hongjiang Zhang,et al.  Content-Based Video Analysis, Retrieval and Browsing , 2003 .

[14]  Marco Bertini,et al.  Semantic annotation and retrieval of video events using multimedia ontologies , 2007 .

[15]  Christos Faloutsos,et al.  Automatic image captioning , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[16]  John R. Smith,et al.  Active learning for simultaneous annotation of multiple binary semantic concepts [video content analysis] , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).