Semantic Indexing of Multimedia Documents

We propose two approaches for semantic indexing of audio-visual documents, based on bottom-up and top-down strategies. We base the first approach on a finite-state machine using low-level motion indices extracted from an MPEG compressed bitstream. The second approach innovatively performs semantic indexing through Hidden Markov Models.

[1]  Riccardo Leonardi,et al.  The ToCAI Description Scheme for Indexing and Retrieval of Multimedia Documents , 2001, Multimedia Tools and Applications.

[2]  Edoardo Ardizzone,et al.  Video indexing using optical flow field , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[3]  Riccardo Leonardi,et al.  Semantic video indexing using MPEG motion vectors , 2000, 2000 10th European Signal Processing Conference.

[4]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[5]  Riccardo Leonardi,et al.  Identification of story units in audio-visual sequences by joint audio and video processing , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[6]  Riccardo Leonardi,et al.  Indexing audiovisual databases through joint audio and video processing , 1998, Int. J. Imaging Syst. Technol..

[7]  B. S. Manjunath,et al.  Content-based search of video using color, texture, and motion , 1997, Proceedings of International Conference on Image Processing.

[8]  T. Sikora MPEG Digital Audio-and Video-Coding Standards , 1997, IEEE Signal Processing Magazine.

[9]  Warnakulasuriya Anil Chandana Fernando,et al.  Video segmentation and classification for content-based storage and retrieval using motion vectors , 1998, Electronic Imaging.

[10]  Richard J. Qian,et al.  Detecting semantic events in soccer games: towards a complete solution , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[11]  Riccardo Leonardi,et al.  Event recognition in sport programs using low-level motion indices , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[12]  Stefano Tubaro,et al.  Multistage motion estimation for image interpolation , 1995, Signal Process. Image Commun..

[13]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[14]  Thomas Sikora,et al.  MPEG digital video-coding standards , 1997, IEEE Signal Process. Mag..

[15]  Zhu Liu,et al.  Multimedia content analysis-using both audio and visual clues , 2000, IEEE Signal Process. Mag..