Moving away from narrow-scope solutions in multimedia content analysis

Recent research results in the field of Multimedia Content Analysis (MCA) have been marked by an abundance of theoretical and algorithmic solutions covering narrow application domains only. In this paper we analyze this tendency and its origin in more detail and explain why, in our view, this should not be considered "the way to go" in providing easy access to content in multimedia systems and applications of the future. In particular, we concentrate on the case study of digital video, which we see as a straightforward example of multimedia. Through this case study we will discuss the needs and challenges of improving the generic potential of MCA algorithms.

[1]  Alan Hanjalic,et al.  Content-Based Analysis of Digital Video , 2004, Springer US.

[2]  Jenny Benois-Pineau,et al.  Real-Time and Distributed AV Content Analysis System for Consumer Electronics Networks , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[3]  Lie Lu,et al.  Unsupervised content discovery in composite audio , 2005, MULTIMEDIA '05.

[4]  Milind R. Naphade,et al.  A probabilistic framework for semantic video indexing, filtering, and retrieval , 2001, IEEE Trans. Multim..

[5]  Fumiko Satoh,et al.  Learning personalized video highlights from detailed MPEG-7 metadata , 2002, Proceedings. International Conference on Image Processing.

[6]  Marcel Worring,et al.  Time interval maximum entropy based event indexing in soccer video , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[7]  Dominique Barba,et al.  Grouping video shots into scenes based on 1D mosaic descriptors , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[8]  Peter J. L. van Beek,et al.  Detection of slow-motion replay segments in sports video for highlights generation , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[9]  Boon-Lock Yeo,et al.  Time-constrained clustering for segmentation of video into story units , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[10]  John R. Kender,et al.  Video scene segmentation via continuous video coherence , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[11]  Alan Hanjalic,et al.  Automated high-level movie segmentation for advanced video-retrieval systems , 1999, IEEE Trans. Circuits Syst. Video Technol..

[12]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[13]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Noboru Babaguchi,et al.  Extracting actors, actions and events from sports video -a fundamental approach to story tracking , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[15]  Jun Fan,et al.  Face related features in consumer electronic (CE) device environments , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[16]  Wenjun Zeng,et al.  Integrated image and speech analysis for content-based video indexing , 1996, Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems.

[17]  HongJiang Zhang,et al.  Automatic parsing of TV soccer programs , 1995, Proceedings of the International Conference on Multimedia Computing and Systems.

[18]  Alan Hanjalic,et al.  Affective video content representation and modeling , 2005, IEEE Transactions on Multimedia.

[19]  Jan Nesvadba,et al.  Integrating Media Management Towards Ambient Intelligence , 2005, Adaptive Multimedia Retrieval.