FXPAL Experiments for TRECVID 2004

For shot boundary detection, our approach combines pairwise similarity analysis and supervised classification. Using primitive low-level image features, we build secondary features based on inter-frame dissimilarity. The secondary features are motivated by prior work on media segmentation in which a kernel function is correlated along the main diagonal of a similarity matrix to construct a frameindexed novelty measure. In contrast to many previous approaches, the kernel functions combine all pairwise dissimilarity information in a neighborhood of L frames around the current frame. These secondary features are used as input to an efficient k-Nearest-Neighbor (kNN) classifier. The classifier labels each frame as a shot boundary or non-boundary, and the classifier outputs are minimally processed to determine the final segmentation.

[1]  Jonathan Foote,et al.  Scene boundary detection via video self-similarity analysis , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[2]  Nilesh V. Patel,et al.  Video shot detection and characterization for video databases , 1997, Pattern Recognit..

[3]  Yanjun Qi,et al.  Supervised classification for video shot segmentation , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[4]  Matthew Cooper Video segmentation combining similarity analysis and classification , 2004, MULTIMEDIA '04.

[5]  Jing Huang,et al.  Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Susan T. Dumais,et al.  Using Linear Algebra for Intelligent Information Retrieval , 1995, SIAM Rev..

[7]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[8]  Alexander G. Gray,et al.  Efficient exact k-NN and nonparametric classification in high dimensions , 2003, NIPS 2003.