On the Use of MPEG-7 for Visual Surveillance

This paper investigates the use of the MPEG-7 standard to represent the output of automated visual surveillance systems. There are existing standards and proposals for the representation of surveillance outputs in an extensible on tology: the prospects for accommodating their features in a single MPEG-7 surveillance schema are appraised. The aim is to define an extensible framework, suitable for the output of single cameras and whole systems, to be used by operators and automatic processes to satisfy the diverse and extending set of requirements for surveillance metadata. These requirements are discussed in some detail. An MPEG-7 document for a simple surveillance task is constructed, with two layers of descriptors for the observed pedestrians. The first layer comprises colour descriptions of transitory objects observed from single sensors. The sec ond layer aims to provide a uniform, unique identifier for each entity observed in the overall scene. The prototype schema demonstrates techniques to profile and extend the MPEG-7 standard, and uses a graph-based approach to represent the uncertain state of knowledge about the relationships between low and high level elements.

[1]  J. Marques,et al.  On-line Tracking Groups of Pedestrians with Bayesian Networks , 2004 .

[2]  Lai-Man Po,et al.  MPEG-7 dominant color descriptor based relevance feedback using merged palette histogram , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Brian V. Funt,et al.  A comparison of computational color constancy algorithms. I: Methodology and experiments with synthesized data , 2002, IEEE Trans. Image Process..

[4]  Jin Hyeong Park,et al.  Performance evaluation of object detection algorithms , 2002, Object recognition supported by user interaction for service robots.

[5]  B. S. Manjunath,et al.  Introduction to mpeg-7 , 2002 .

[6]  Ramakant Nevatia,et al.  VERL: An Ontology Framework for Representing and Annotating Video Events , 2005, IEEE Multim..

[7]  James A. Hendler,et al.  DAML+OIL: An Ontology Language for the Semantic Web , 2002, IEEE Intell. Syst..

[8]  Robert B. Fisher,et al.  CVML - an XML-based computer vision markup language , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[9]  J.-P. Renno,et al.  Evaluation of MPEG7 color descriptors for visual surveillance retrieval , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[10]  R. Y. Tsai,et al.  An Efficient and Accurate Camera Calibration Technique for 3D Machine Vision , 1986, CVPR 1986.