A Framework for Ontology Enriched Semantic Annotation of CCTV Video

This paper deals with the problem of semantic transcoding of CCTV video footage. A framework is proposed that combines Computer Vision algorithms that extract visual semantics, together with Natural Language Processing that automatically builds the domain ontology from unstructured text annotations. The final aim is a system that will link the visual and text semantics in order to routinely annotate video sequences with the appropriate keywords of the domain experts' terminology.

[1]  Julio Gonzalo,et al.  Corpus-based terminology extraction applied to information access , 2001 .

[2]  Ramakant Nevatia,et al.  Event Detection and Analysis from Video Streams , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Tim J. Ellis,et al.  Partial Observation vs. Blind Tracking through Occlusion , 2002, BMVC.

[4]  Alberto Del Bimbo,et al.  An Integrated Framework for Semantic Annotation and Adaptation , 2005, Multimedia Tools and Applications.

[5]  Ramakant Nevatia,et al.  VERL: An Ontology Framework for Representing and Annotating Video Events , 2005, IEEE Multim..

[6]  S. Wright,et al.  Handbook of terminology management. Vol. 1. , 1997 .

[7]  Lee Gillam,et al.  Automatic Ontology Extraction from Unstructured Texts , 2005, OTM Conferences.

[8]  Khurshid Ahmad,et al.  Corpus-Based Thesaurus Construction for Image Retrieval in Specialist Domains , 2003, ECIR.

[9]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[10]  Tim J. Ellis,et al.  Learning semantic scene models from observing activity in visual surveillance , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[11]  Lee Gillam,et al.  Scene of Crime Information System: Playing at St. Andrews , 2003, CLEF.

[12]  James Orwell,et al.  Learning Surveillance Tracking Models for the Self-Calibrated Ground Plane , 2002, BMVC.

[13]  J.-P. Renno,et al.  Application and Evaluation of Colour Constancy in Visual Surveillance , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[14]  Paolo Remagnino,et al.  Classifying Surveillance Events from Attributes and Behaviour , 2001, BMVC.

[15]  Michael G. Strintzis,et al.  Knowledge-assisted semantic video object detection , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Touradj Ebrahimi,et al.  Semantic segmentation and description for video transcoding , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).