Interpretation of complex situations in a semantic-based surveillance framework

The integration of cognitive capabilities in computer vision systems requires both to enable high semantic expressiveness and to deal with high computational costs as large amounts of data are involved in the analysis. This contribution describes a cognitive vision system conceived to automatically provide high-level interpretations of complex real-time situations in outdoor and indoor scenarios, and to eventually maintain communication with casual end users in multiple languages. The main contributions are: (i) the design of an integrative multilevel architecture for cognitive surveillance purposes; (ii) the proposal of a coherent taxonomy of knowledge to guide the process of interpretation, which leads to the conception of a situation-based ontology; (iii) the use of situational analysis for content detection and a progressive interpretation of semantically rich scenes, by managing incomplete or uncertain knowledge, and (iv) the use of such an ontological background to enable multilingual capabilities and advanced end-user interfaces. Experimental results are provided to show the feasibility of the proposed approach.

[1]  C. Bishop The MIT Encyclopedia of the Cognitive Sciences , 1999 .

[2]  M. G. Strintzis,et al.  INTEGRATING KNOWLEDGE , SEMANTICS AND CONTENT FOR USER-CENTRED INTELLIGENT MEDIA SERVICES : THE ACEMEDIA PROJECT , 2004 .

[3]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[4]  Jordi Gonzàlez,et al.  Improving Tracking by Handling Occlusions , 2005, ICAPR.

[5]  K. Schäfer,et al.  “F-Limette” fuzzy logic programming integrating metric temporal extensions , 1996 .

[6]  Gösta H. Granlund,et al.  Organization of Architectures for Cognitive Vision Systems , 2006, Cognitive Vision Systems.

[7]  J.L. Crowley,et al.  Situated Observation of Human Activity , 2005, Computer Vision for Interactive and Intelligent Environment (CVIIE'05).

[8]  Ehud Reiter,et al.  Book Reviews: Building Natural Language Generation Systems , 2000, CL.

[9]  Jordi Gonzàlez i Sabaté Human sequence evaluation: the key-frame approach , 2005 .

[10]  Hilary Buxton,et al.  Learning and understanding dynamic scene activity: a review , 2003, Image Vis. Comput..

[11]  J. M. Geusebroek Cognitive Vision Systems , 2004 .

[12]  L. Talmy Toward a Cognitive Semantics , 2003 .

[13]  Wolfgang Ponweiser,et al.  Contextual Coordination in a Cognitive Vision System for Symbolic Activity Interpretation , 2006, Fourth IEEE International Conference on Computer Vision Systems (ICVS'06).

[14]  Pau Baiget,et al.  Automatic generation of computer animated sequences based on human behaviour modelling , 2007 .

[15]  Daniel Rowe,et al.  Improving Background Subtraction Based on a Casuistry of Colour-Motion Segmentation Problems , 2007, IbPRIA.

[16]  Bernd Neumann,et al.  Natural Language Dialogue about Moving Objects in an Automatically Analyzed Traffic Scene , 1981, IJCAI.

[17]  Sergei Nirenburg,et al.  Book Review: Ontological Semantics, by Sergei Nirenburg and Victor Raskin , 2004, CL.

[18]  Hans-Hellmut Nagel,et al.  Steps toward a Cognitive Vision System , 2004, AI Mag..

[19]  Pau Baiget,et al.  Natural Language Descriptions of Human Behavior from Video Sequences , 2007, KI.

[20]  Jake K. Aggarwal,et al.  Event semantics in two-person interactions , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[21]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[22]  Minhua Ma,et al.  Visual Semantics and Ontology of Eventive Verbs , 2004, IJCNLP.

[23]  Robert A. Wilson,et al.  Book Reviews: The MIT Encyclopedia of the Cognitive Sciences , 2000, CL.

[24]  H.-H. Nagel,et al.  Representation of occurrences for road vehicle traffic , 2008, Artif. Intell..

[25]  Hans-Hellmut Nagel,et al.  Cognitive Vision Systems, Sampling the Spectrum of Approaches [based on a Dagstuhl seminar] , 2006, Cognitive Vision Systems.

[26]  Kunio Fukunaga,et al.  Natural Language Description of Human Activities from Video Images Based on Concept Hierarchy of Actions , 2002, International Journal of Computer Vision.

[27]  Leonard Talmy,et al.  Toward a cognitive semantics, Vol. 1: Concept structuring systems. , 2000 .

[28]  Hans-Hellmut Nagel,et al.  Behavioral Knowledge Representation for the Understanding and Creation of Video Sequences , 2003, KI.

[29]  Hans-Hellmut Nagel,et al.  Integration of Image Sequence Evaluation and Fuzzy Metric Temporal Logic Programming , 1997, KI.

[30]  Diego Calvanese,et al.  The Description Logic Handbook , 2007 .