Video Handling with Music and Speech Detection

The audio-based approach to video indexing described by the authors detects music and speech independently even when they occur simultaneously. The indexed video segments, when presented on the Video Sound Browser, let users randomly access the video. The Video in Time system provides different video condensation levels based on video structuring that can link the video segments and the director's intentions.

[1]  Yukinobu Taniguchi,et al.  An intuitive and efficient access interface to real-time incoming video based on automatic indexing , 1995, MULTIMEDIA '95.

[2]  Douglas Keislar,et al.  Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[3]  Barry Arons Hands-on demonstration: interacting with SpeechSkimmer , 1995, UIST '95.

[4]  Yukinobu Taniguchi,et al.  Structured Video Computing , 1994, IEEE MultiMedia.

[5]  Glorianna Davenport,et al.  Video streamer , 1994, CHI Conference Companion.

[6]  Philippe Gelin,et al.  Keyword spotting for video soundtrack indexing , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[7]  Yoshinobu Tonomura,et al.  Projection-detecting filter for video cut detection , 1994, MULTIMEDIA '93.

[8]  Jonathan Foote,et al.  A Similarity Measure for Automatic Audio Classification , 1997 .

[9]  Wolfgang Effelsberg,et al.  Automatic audio content analysis , 1997, MULTIMEDIA '96.

[10]  Walter Bender,et al.  Salient video stills: content and context preserved , 1993, MULTIMEDIA '93.

[11]  Scott D. Lipscomb,et al.  Perceptual judgement of the relationship between musical and visual components in film. , 1994 .

[12]  Riccardo Leonardi,et al.  Audio as a support to scene change detection and characterization of video sequences , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Don Kimber,et al.  Acoustic Segmentation for Audio Browsers , 1997 .

[14]  Guy J. Brown,et al.  Computational auditory scene analysis , 1994, Comput. Speech Lang..

[15]  S. Abe,et al.  Content oriented visual interface using video icons for visual database systems , 1989, [Proceedings] 1989 IEEE Workshop on Visual Languages.

[16]  Stephen W. Smoliar,et al.  Content based video indexing and retrieval , 1994, IEEE MultiMedia.

[17]  Malcolm Slaney,et al.  Construction and evaluation of a robust multifeature speech/music discriminator , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  Michael G. Christel,et al.  Automating the creation of a digital video library , 1995, MULTIMEDIA '95.

[19]  Hiroshi Hamada,et al.  Enhanced video handling based on audio analysis , 1997, Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[20]  Yoshinobu Tonomura,et al.  Video tomography: an efficient method for camerawork extraction and motion analysis , 1994, MULTIMEDIA '94.

[21]  Percy H. Tannenbaum,et al.  Music background in the judgment of stage and television drama , 1956 .

[22]  Julian F. Thayer,et al.  Effects of music on psychophysiological responses to a stressful film. , 1983 .

[23]  Glorianna Davenport,et al.  Creating and Viewing the Elastic Charles - A Hypermedia Journal , 1989, UK Hypertext.

[24]  Michael Hawley Structure out of sound , 1993 .