Temporal Information in Collateral Texts for Indexing Movies

This paper suggests that video indexing is an interesting and important natural language application for which it is crucial to identify temporal information in collateral text that articulates the semantic content of moving images. Recently a rich source of information about the content of films and television programmes has become available in the form of audio description scripts. The analysis of the expression of temporal information in a corpus of audio description scripts leads to a discussion of some consequences for schemes to annotate such information in a video indexing application.

[1]  G. Miller,et al.  A Semantic Network of English Verbs , 1998 .

[2]  Andrew Salway,et al.  Video annotation: the role of specialist text , 1999 .

[3]  Alberto Del Bimbo,et al.  Multi-Perspective Navigation of Movies , 1996, J. Vis. Lang. Comput..

[4]  Hans-Hellmut Nagel,et al.  From image sequences towards conceptual descriptions , 1988, Image Vis. Comput..

[5]  Rangasami L. Kashyap,et al.  Semantic Models for Multimedia Database Searching and Browsing , 2000, Advances in Database Systems.

[6]  Roger C. Schank,et al.  Scripts, plans, goals and understanding: an inquiry into human knowledge structures , 1978 .

[7]  David K. Gifford,et al.  Composition and Search with a Video Algebra , 1995, IEEE Multim..

[8]  Glorianna Davenport,et al.  Cinematic primitives for multimedia , 1991, IEEE Computer Graphics and Applications.

[9]  FRANK NACK,et al.  Toward the Automated Editing of Theme Oriented Video Sequences , 1997, Appl. Artif. Intell..

[10]  Takeo Kanade,et al.  Name-It: Naming and Detecting Faces in News Videos , 1999, IEEE Multim..

[11]  Michael Halliday,et al.  An Introduction to Functional Grammar , 1985 .

[12]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[13]  Rohini K. Srihari,et al.  Computational models for integrating linguistic and visual information: A survey , 2004, Artificial Intelligence Review.

[14]  Jan Svartvik,et al.  A __ comprehensive grammar of the English language , 1988 .

[15]  Shih-Fu Chang,et al.  Visually Searching the Web for Content , 1997, IEEE Multim..

[16]  Yasuo ARIKI,et al.  Organization and Retrieval of VIdeo Data , 1999 .

[17]  Inderjeet Mani,et al.  Guidelines for Annotating Temporal Information , 2001, HLT.

[18]  Yihong Gong,et al.  Lessons Learned from Building a Terabyte Digital Video Library , 1999, Computer.

[19]  Mark T. Maybury,et al.  Towards content-based browsing of broadcast news video , 1997 .

[20]  Djoerd Hiemstra,et al.  Language-Based Multimedia Information Retrieval , 2000, RIAO.

[21]  Volker Roth Content-based retrieval from digital video , 1999, Image Vis. Comput..

[22]  Bernard Comrie Aspect: An Introduction to the Study of Verbal Aspect and Related Problems , 1976 .

[23]  Alan P. Parkes The prototype cloris system: Describing, retrieving and discussing videodisc stills and sequence , 1989, Inf. Process. Manag..

[24]  James M. Turner Some Characteristics of Audio Description and the Corresponding Moving Image. , 1998 .

[25]  Andrea Setzer,et al.  Temporal information in newswire articles : an annotation scheme and corpus study , 2001 .

[26]  A F Bobick,et al.  Movement, activity and action: the role of knowledge in the perception of motion. , 1997, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[27]  Robert J. Gaizauskas,et al.  Annotating Events and Temporal Information in Newswire Texts , 2000, LREC.

[28]  Rosalind W. Picard,et al.  Tools for Browsing a TV Situation Comedy Based on Content Specific Attributes , 2004, Multimedia Tools and Applications.