Approximate reasoning and finite state machines to the detection of actions in video sequences

In this paper a novel approach for recognizing actions in video sequences is presented, where the information obtained from the segmentation and tracking algorithms is used as input data. First of all, the fuzzification of input data is done and this process allows to successfully manage the uncertainty inherent to the information obtained from low-level and medium-level vision tasks, to unify the information obtained from different vision algorithms into a homogeneous representation and to aggregate the characteristics of the analyzed scenario and the objects in motion. Another contribution is the novelty of representing actions by means of an automaton and the generation of input symbols for the finite automaton depending on the comparison process between objects and actions, i.e., the main reasoning process is based on the operation of automata with capability to manage fuzzy representations of all video data. The experiments on several real traffic video sequences demonstrate encouraging results, especially when no training algorithms to obtain predefined actions to be identified are required.

[1]  Lotfi A. Zadeh,et al.  The Concepts of a Linguistic Variable and its Application to Approximate Reasoning , 1975 .

[2]  Hans-Hellmut Nagel,et al.  Association of Motion Verbs with Vehicle Movements Extracted from Dense Optical Flow Fields , 1994, ECCV.

[3]  Jose Jesus Castro-Schez,et al.  Action Recognition in Video Sequences using a Mealy Machine , 2008 .

[4]  Ian D. Reid,et al.  A general method for human activity recognition in video , 2006, Comput. Vis. Image Underst..

[5]  Aaron F. Bobick,et al.  Action recognition using probabilistic parsing , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[6]  George H. Mealy,et al.  A method for synthesizing sequential circuits , 1955 .

[7]  L. Jimenez Fuzzy logic to track objects from MPEG video sequences , 2008 .

[8]  Ramakant Nevatia,et al.  Video-based event recognition: activity representation and probabilistic recognition methods , 2004, Comput. Vis. Image Underst..

[9]  Peter Secretan Learning , 1965, Mental Health.

[10]  L. A. ZADEH,et al.  The concept of a linguistic variable and its application to approximate reasoning - I , 1975, Inf. Sci..

[11]  Zakia Hammal,et al.  Facial expression classification: An approach based on the fusion of facial deformations using the transferable belief model , 2007, Int. J. Approx. Reason..

[12]  Dariu Gavrila,et al.  The Visual Analysis of Human Movement: A Survey , 1999, Comput. Vis. Image Underst..

[13]  Luis Jiménez,et al.  Real-time moving object segmentation in H.264 compressed domain based on approximate reasoning , 2009, Int. J. Approx. Reason..

[14]  Aaron F. Bobick,et al.  Recognition of Visual Activities and Interactions by Stochastic Parsing , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Francisco José Madrid-Cuevas,et al.  Multi-camera people tracking using evidential filters , 2009, Int. J. Approx. Reason..

[16]  Denis Pellerin,et al.  Belief Scheduler based on model failure detection in the TBM framework. Application to human activity recognition , 2010, Int. J. Approx. Reason..

[17]  A. G. Amitha Perera,et al.  Evaluation of Algorithms for Tracking Multiple Objects in Video , 2006, 35th IEEE Applied Imagery and Pattern Recognition Workshop (AIPR'06).

[18]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[19]  Paolo Remagnino,et al.  Distributed intelligence for multi-camera visual surveillance , 2004, Pattern Recognit..

[20]  Van-Thinh Vu,et al.  Temporal scenario for automatic video interpretation , 2004 .

[21]  Christèle Lecomte,et al.  Hierarchical and conditional combination of belief functions induced by visual tracking , 2010, Int. J. Approx. Reason..

[22]  Li Li,et al.  Semantic event representation and recognition using syntactic attribute graph grammar , 2009, Pattern Recognit. Lett..

[23]  Swarup Medasani,et al.  An overview of membership function generation techniques for pattern recognition , 1998, Int. J. Approx. Reason..

[24]  James J. Little,et al.  Tracking and recognizing actions of multiple hockey players using the boosted particle filter , 2009, Image Vis. Comput..

[25]  Luis Jiménez,et al.  Automatic objects behaviour recognition from compressed video domain , 2009, Image Vis. Comput..

[26]  Aaron F. Bobick,et al.  Recognition of human body motion using phase space constraints , 1995, Proceedings of IEEE International Conference on Computer Vision.

[27]  Ronald Poppe,et al.  A survey on vision-based human action recognition , 2010, Image Vis. Comput..

[28]  Patrick Pérez,et al.  View-Independent Action Recognition from Temporal Self-Similarities , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Mubarak Shah,et al.  Learning, detection and representation of multi-agent events in videos , 2007, Artif. Intell..

[30]  Aaron F. Bobick,et al.  Recognizing Planned, Multiperson Action , 2001, Comput. Vis. Image Underst..