Dimensionality reduction for fast and accurate video search and retrieval in a large scale database

A large amount of video data is generated every day. Searching through huge video database is an important problem in many applications. For instance, individuals may want to search for video content they are interested in from YouTube video, media companies may want to locate video content that violates their copyright protection and so on. Fast and accurate algorithm in all these cases is needed for efficient video retrieval. The high dimensionality of video sequence poses a major challenge of video indexing and retrieval. As dimensionality increases, query performance degrades. This phenomenon generally referred to as the dimensionality curse, can be circumvented by reducing the dimensionality of the data. Such a reduction is however accompanied by loss of precision of query results. Feature extraction and dimensionality reduction are highly related to each other, as the combined goal of the two processing steps is to generate a compact representation of the content of an image. Here we propose to perform dimensionality reduction in both phases of the video search and retrieval system by extracting appropriate features. We shall try to exploit the use of principal component analysis to transform the original data of high dimensionality into new co-ordinate system with low dimensionality and then use sparse representation before applying similarity match for fast and accurate search and retrieval of videos.

[1]  Jiebo Luo,et al.  Utilizing semantic word similarity measures for video retrieval , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Roziati Zainuddin,et al.  Compacted Dither Pattern Codes versus Principal Component Analysis in video visual depiction , 2010, 2010 International Symposium on Information Technology.

[3]  Witold Pedrycz,et al.  Dimensionality reduction for content-based image classification , 2004 .

[4]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[5]  Andrew Zisserman,et al.  Person Spotting: Video Shot Retrieval for Face Sets , 2005, CIVR.

[6]  Zhouyu Fu,et al.  Semantic-Based Surveillance Video Retrieval , 2007, IEEE Transactions on Image Processing.

[7]  Yi Yang,et al.  Interactive Video Indexing With Statistical Active Learning , 2012, IEEE Transactions on Multimedia.

[8]  Marcel Worring,et al.  Adding Semantics to Detectors for Video Retrieval , 2007, IEEE Transactions on Multimedia.

[9]  Shih-Fu Chang,et al.  Visual information retrieval from large distributed online repositories , 1997, CACM.

[10]  Qi Tian,et al.  Visualization and User-Modeling for Browsing Personal Photo Libraries , 2004, International Journal of Computer Vision.

[11]  Mark Halpern Variable-width tables with binary-search facility , 1958, CACM.

[12]  Avideh Zakhor,et al.  Motion indexing of video , 1997, Proceedings of International Conference on Image Processing.

[13]  Gang Wei,et al.  Video classification based on HMM using text and faces , 2000, 2000 10th European Signal Processing Conference.

[14]  O. Pybus,et al.  The Epidemic Behavior of the Hepatitis C Virus , 2001, Science.

[15]  Alan F. Smeaton,et al.  Video retrieval using dialogue, keyframe similarity and video objects , 2005, IEEE International Conference on Image Processing 2005.

[16]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[17]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[18]  Hideyuki Tamura,et al.  Textural Features Corresponding to Visual Perception , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[19]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[20]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[21]  Sven Loncaric,et al.  A survey of shape analysis techniques , 1998, Pattern Recognit..

[22]  Boon-Lock Yeo,et al.  Time-constrained clustering for segmentation of video into story units , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[23]  Nuno Vasconcelos,et al.  Towards semantically meaningful feature spaces for the characterization of video content , 1997, Proceedings of International Conference on Image Processing.

[24]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[25]  Marcel Worring,et al.  Efficient Genre-Specific Semantic Video Indexing , 2012, IEEE Transactions on Multimedia.

[26]  Aggelos K. Katsaggelos,et al.  Video retrieval using sparse Bayesian reconstruction , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[27]  Forouzan Golshani,et al.  Motion recovery for video content classification , 1995, TOIS.