论文信息 - Improving Access to Digital Video Archives through Informedia Technology

Improving Access to Digital Video Archives through Informedia Technology

Informedia research at Carnegie Mellon University combines speech recognition, image processing, and natural language processing to automatically index a digital video library. This engineering report focuses on the contribution of speech analysis for transcript generation and alignment, and the use of these features in library interface development. By deepening the automated analysis, such as using named entity extraction to identify people and place names in the audio transcript, better summaries and visualizations can be produced to navigate through video libraries holding thousands of hours of material.

Howard D. Wactlar | Alexander G. Hauptmann | Michael G. Christel

[1] Michael G. Christel,et al. Evolving video skims into useful multimedia abstractions , 1998, CHI.

[2] Michael G. Christel,et al. Informedia Goes to School: Early Findings from the Digital Video Library Project , 1996, D Lib Mag..

[3] Howard D. Wactlar,et al. Indexing and search of multimodal information , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4] Michael G. Christel,et al. Improving Access to a Digital Video Library , 1997, INTERACT.

[5] Alexander G. Hauptmann,et al. Learning to Recognize Speech by Watching Television , 1999, IEEE Intell. Syst..

[6] Michael J. Witbrock,et al. Using words and phonetic strings for efficient information retrieval from imperfectly transcribed spoken documents , 1997, DL '97.

[7] Ralph Weischedel,et al. NAMED ENTITY EXTRACTION FROM SPEECH , 1998 .

[8] Michael J. Witbrock,et al. Artificial intelligence techniques in the interface to a Digital Video Library , 1997, CHI Extended Abstracts.

[9] Karen Spärck Jones,et al. Retrieving spoken documents by combining multiple index sources , 1996, SIGIR '96.