Activity recognition and uncertain knowledge in video scenes

Activity recognition has been a growing research topic in the last years and its application varies from automatic recognition of social interaction such as shaking hands, parking lot surveillance, traffic monitoring and the detection of abandoned luggage. This paper describes a probabilistic framework for uncertainty handling in a description-based event recognition approach. The proposed approach allows the flexible modeling of composite events with complex temporal constraints. It uses probability theory to provide a consistent framework for dealing with uncertain knowledge for the recognition of complex events. We validate the event recognition accuracy of the proposed algorithm on real-world videos. The experimental results show that our system can successfully recognize activities with a high recognition rate. We conclude by comparing our algorithm with the state of the art and showing how the definition of event models and the probabilistic reasoning can influence the results of real-time event recognition.

[1]  Ehud Rivlin,et al.  Video Event Modeling and Recognition in Generalized Stochastic Petri Nets , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Chang-Hsing Lee,et al.  Scene-based event detection for baseball videos , 2007, J. Vis. Commun. Image Represent..

[3]  P. Robert,et al.  Alzheimer's patient activity assessment using different sensors , 2012 .

[4]  F. Bremond,et al.  Automatic video monitoring system for assessment of Alzheimer’s Disease symptoms , 2012, The journal of nutrition, health & aging.

[5]  François Brémond,et al.  A Generic Framework for Video Understanding Applied to Group Behavior Recognition , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[6]  Malik Ghallab,et al.  On Chronicles: Representation, On-line Recognition and Learning , 1996, KR.

[7]  J. Echternach,et al.  Test-Retest Reliability and Minimal Detectable Change Scores for the Timed " Up & Go " Test , the Six-Minute Walk Test , and Gait Speed in People With Alzheimer Disease , 2018 .

[8]  Rama Chellappa,et al.  A Constrained Probabilistic Petri Net Framework for Human Activity Detection in Video* , 2008, IEEE Transactions on Multimedia.

[9]  François Brémond,et al.  Automatic Video Interpretation: A Recognition Algorithm for Temporal Scenarios Based on Pre-compiled Scenario Models , 2003, ICVS.

[10]  Irfan A. Essa,et al.  Learning Temporal Sequence Model from Partially Labeled Data , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11]  Duc Phu Chau,et al.  A multi-feature tracking algorithm enabling adaptation to context variations , 2011, ICDP.

[12]  Fabien Cardinaux,et al.  Video based technology for ambient assisted living: A review of the literature , 2011, J. Ambient Intell. Smart Environ..

[13]  Eric Horvitz,et al.  Layered representations for human activity recognition , 2002, Proceedings. Fourth IEEE International Conference on Multimodal Interfaces.

[14]  Ehud Rivlin,et al.  Building Petri Nets from Video Event Ontologies , 2007, ISVC.

[15]  François Brémond,et al.  Multisensor Fusion for Monitoring Elderly Activities at Home , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[16]  François Brémond,et al.  Design and Assessment of an Intelligent Activity Monitoring Platform , 2005, EURASIP J. Adv. Signal Process..

[17]  Bernard Boulay,et al.  Human posture recognition for behaviour understanding , 2007 .

[18]  Francois Bremond,et al.  A Computer system to monitor older adults at home: Preliminary results , 2009 .

[19]  Jiaping Wu,et al.  The Role of Probability , 2014 .

[20]  Aaron F. Bobick,et al.  Recognition of Visual Activities and Interactions by Stochastic Parsing , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[22]  François Brémond,et al.  Probabilistic Recognition of Complex Event , 2011, ICVS.

[23]  James F. Allen,et al.  Actions and Events in Interval Temporal Logic , 1994, J. Log. Comput..

[24]  Yan Huang,et al.  Propagation networks for recognition of partially ordered sequential action , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[25]  Maurice Milgram,et al.  A novel approach for recognition of human actions with semi-global features , 2008, Machine Vision and Applications.

[26]  Adnan Darwiche,et al.  Constant-space reasoning in dynamic Bayesian networks , 2001, Int. J. Approx. Reason..

[27]  Jake K. Aggarwal,et al.  Recognition of Composite Human Activities through Context-Free Grammar Based Representation , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[28]  H. Foroughi,et al.  An eigenspace-based approach for human fall detection using Integrated Time Motion Image and Neural Network , 2008, 2008 9th International Conference on Signal Processing.

[29]  Datong Chen,et al.  Intelligent Video Monitoring to Improve Safety of Older Persons , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[30]  Abbas K. Zaidi,et al.  On temporal logic programming using Petri nets , 1999, IEEE Trans. Syst. Man Cybern. Part A.

[31]  François Brémond,et al.  Applying 3D human model in a posture recognition system , 2006, Pattern Recognit. Lett..

[32]  Larry S. Davis,et al.  VidMAP: video monitoring of activity with Prolog , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[33]  Claudio S. Pinhanez,et al.  Human action detection using PNF propagation of temporal constraints , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[34]  Ehud Rivlin,et al.  Propagating Certainty in Petri Nets for Activity Recognition , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[35]  Shaogang Gong,et al.  Recognition of group activities using dynamic probabilistic networks , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[36]  J. Cummings,et al.  The Neuropsychiatric Inventory , 1997, Neurology.

[37]  Francois Bremond,et al.  Temporal Constraints for Video Interpretation , 2002 .

[38]  Stephen J. McKenna,et al.  Activity summarisation and fall detection in a supportive home environment , 2004, ICPR 2004.

[39]  Alan Fern,et al.  Probabilistic event logic for interval-based event recognition , 2011, CVPR 2011.

[40]  Monique Thonnat,et al.  Activity Recognition from Video Sequences using Declarative Models , 2000, ECAI.

[41]  P. Robert,et al.  Automatic video monitoring system for assessment of Alzheimer's disease symptoms. , 2012, The journal of nutrition, health & aging.

[42]  David C. Minnen,et al.  Propagation networks for recognition of partially ordered sequential action , 2004, CVPR 2004.

[43]  J. H. Lee,et al.  Measurement instrument for assessing functional abilities of elderly people with and without dementia using a video monitoring system , 2012 .

[44]  Anh-Tuan Nghiem,et al.  Adaptive algorithms for background estimation to detect moving objects in videos. (Algorithmes adaptatifs d'estimation du fond pour la détection des objets mobiles dans les séquences vidéos) , 2010 .

[45]  Chen-Khong Tham,et al.  Eating activity primitives detection - a step towards ADL recognition , 2008, HealthCom 2008 - 10th International Conference on e-health Networking, Applications and Services.

[46]  Irfan A. Essa,et al.  Expectation grammars: leveraging high-level expectations for activity recognition , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[47]  M. Van der Linden,et al.  Apathy and Executive Dysfunction in Alzheimer Disease , 2010, Alzheimer disease and associated disorders.

[48]  Svetha Venkatesh,et al.  Activity recognition and abnormality detection with the switching hidden semi-Markov model , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[49]  Holger R. Maier,et al.  Future research challenges for incorporation of uncertainty in environmental and ecological decision-making , 2008 .

[50]  Monique Thonnat,et al.  Video Activity Recognition Framework for assessing motor behavioural disorders in Alzheimer Disease Patients , 2011 .

[51]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[52]  Michael J. Black,et al.  Learning image statistics for Bayesian tracking , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[53]  Jake K. Aggarwal,et al.  Hierarchical Recognition of Human Activities Interacting with Objects , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[54]  David S. Ebert,et al.  Development of a mobile user interface for image-based dietary assessment , 2010, MUM.

[55]  D Maquet,et al.  The value of instrumental gait analysis in elderly healthy, MCI or Alzheimer's disease subjects and a comparison with other clinical tests used in single and dual-task conditions. , 2009, Annals of physical and rehabilitation medicine.

[56]  Maarten S. Krol,et al.  Identification and classification of uncertainties in the application of environmental models , 2010, Environ. Model. Softw..

[57]  Sophia Antipolis,et al.  Human Posture Recognition for Behaviour Understanding , 2008 .

[58]  L. Lundin-Olsson,et al.  Timed "Up & Go" test: reliability in older people dependent in activities of daily living--focus on cognitive state. , 2006, Physical therapy.

[59]  Rama Chellappa,et al.  Interpretation of state sequences in HMM for activity representation , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[60]  D. Chau Dynamic and Robust Object Tracking for Activity Recognition. (Suivi dynamique et robuste d'objets pour la reconnaissance d'activités) , 2012 .

[61]  Ramakant Nevatia,et al.  Multi-agent event recognition , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[62]  Bohyung Han,et al.  Scenario-based video event recognition by constraint flow , 2011, CVPR 2011.

[63]  Jake K. Aggarwal,et al.  A hierarchical Bayesian network for event recognition of human actions and interactions , 2004, Multimedia Systems.

[64]  Jake K. Aggarwal,et al.  Stochastic Representation and Recognition of High-Level Group Activities , 2011, International Journal of Computer Vision.

[65]  Jin Song Dong,et al.  Mild Dementia Care at Home - Integrating Activity Monitoring, User Interface Plasticity and Scenario Verification , 2010, ICOST.

[66]  M. Alwan,et al.  A Smart and Passive Floor-Vibration Based Fall Detector for Elderly , 2006, 2006 2nd International Conference on Information & Communication Technologies.

[67]  G Demiris,et al.  Change of patients' perceptions of TeleHomeCare. , 2001, Telemedicine journal and e-health : the official journal of the American Telemedicine Association.

[68]  François Brémond,et al.  ETISEO, performance evaluation for video surveillance systems , 2007, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance.

[69]  Christopher Town,et al.  Ontological inference for image and video analysis , 2006, Machine Vision and Applications.

[70]  Weimin Huang,et al.  Sensor based micro context for mild dementia assistance , 2010, PETRA '10.

[71]  Alexei A. Efros,et al.  People Watching: Human Actions as a Cue for Single View Geometry , 2012, International Journal of Computer Vision.

[72]  François Brémond,et al.  Combining multiple sensors for event recognition of older people , 2013, MIIRH '13.

[73]  David J. Kriegman,et al.  Leveraging temporal, contextual and ordering constraints for recognizing complex activities in video , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[74]  S. Eddy Hidden Markov models. , 1996, Current opinion in structural biology.

[75]  Ramakant Nevatia,et al.  Hierarchical Language-based Representation of Events in Video Streams , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[76]  François Brémond,et al.  A Framework Dealing with Uncertainty for Complex Event Recognition , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[77]  Barry N. Taylor,et al.  Guidelines for Evaluating and Expressing the Uncertainty of Nist Measurement Results , 2017 .

[78]  François Brémond,et al.  Automatic Video Interpretation: A Novel Algorithm for Temporal Scenario Recognition , 2003, IJCAI.

[79]  Fengjun Lv,et al.  Left-Luggage Detection using Bayesian Inference , 2006 .

[80]  Rina Dechter,et al.  Temporal Constraint Networks , 1989, Artif. Intell..

[81]  Georgios Paliouras,et al.  Allen's Hourglass: Probabilistic Treatment of Interval Relations , 2010, 2010 17th International Symposium on Temporal Representation and Reasoning.

[82]  Chengcui Zhang,et al.  Interactive mining and semantic retrieval of videos , 2007, MDM '07.

[83]  Larry S. Davis,et al.  Representation and Recognition of Events in Surveillance Video Using Petri Nets , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[84]  Thomas W. Shilgalis Cumulative Distribution Functions , 1988 .

[85]  François Routhier,et al.  Examination of New Environmental Control Applications , 2002, Assistive technology : the official journal of RESNA.

[86]  J. Meunier,et al.  An intelligent videomonitoring system for fall detection at home: perceptions of elderly people , 2009, Journal of telemedicine and telecare.

[87]  Jeffrey Mark Siskind,et al.  Grounding the Lexical Semantics of Verbs in Visual Perception using Force Dynamics and Event Logic , 1999, J. Artif. Intell. Res..

[88]  Monique Thonnat,et al.  Realtime image sequence interpretation for video-surveillance applications , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[89]  Ramakant Nevatia,et al.  Video-based event recognition: activity representation and probabilistic recognition methods , 2004, Comput. Vis. Image Underst..

[90]  Matej Kristan,et al.  Analysis of multi-agent activity using petri nets , 2010, Pattern Recognit..

[91]  Victor Foo Siang Fook,et al.  From Context to Micro-context - Issues and Challenges in Sensorizing Smart Spaces for Assistive Living , 2011, ANT/MobiWIS.

[92]  Jia Liu,et al.  Managing uncertain temporal relations using a probabilistic Interval Algebra , 2008, 2008 IEEE International Conference on Systems, Man and Cybernetics.

[93]  Van-Thinh Vu,et al.  Temporal scenario for automatic video interpretation , 2004 .

[94]  Malik Ghallab,et al.  Situation Recognition: Representation and Algorithms , 1993, IJCAI.

[95]  Alan F. Newell,et al.  Gathering the requirements for a fall monitor using drama and video with older people , 2006 .

[96]  Larry S. Davis,et al.  Event Modeling and Recognition Using Markov Logic Networks , 2008, ECCV.

[97]  Ehud Rivlin,et al.  Propagating Uncertainty in Petri Nets for Activity Recognition , 2010, ISVC.

[98]  Jerome R. Ravetz,et al.  Uncertainty and Quality in Science for Policy , 1990 .

[99]  Claudia Pahl-Wostl,et al.  Conceptualising uncertainty in environmental decision-making: The example of the EU water framework directive , 2010 .

[100]  Rama Chellappa,et al.  A Constrained Probabilistic Petri Net Framework for Human Activity Detection in Video , 2008, IEEE Trans. Multim..

[101]  Monique Thonnat,et al.  Handling uncertainty for video event recognition , 2009, ICDP.

[102]  Chengcui Zhang,et al.  An Interactive Semantic Video Mining and Retrieval Platform--Application in Transportation Surveillance Video for Incident Detection , 2006, Sixth International Conference on Data Mining (ICDM'06).

[103]  Alexei A. Efros,et al.  Scene Semantics from Long-Term Observation of People , 2012, ECCV.

[104]  José María Martínez Sanchez,et al.  A semantic-based probabilistic approach for real-time video event recognition , 2012, Comput. Vis. Image Underst..

[105]  Kuntal Sengupta,et al.  Framework for real-time behavior interpretation from traffic video , 2005, IEEE Transactions on Intelligent Transportation Systems.

[106]  H. Buxton,et al.  Advanced visual surveillance using Bayesian networks , 1997 .

[107]  Guido Pusiol Discovery of human activities in video , 2012 .

[108]  Jake K. Aggarwal,et al.  Semantic Representation and Recognition of Continued and Recursive Human Activities , 2009, International Journal of Computer Vision.

[109]  Stuart M. Shieber,et al.  Recognition of Users' Activities Using Constraint Satisfaction , 2009, UMAP.

[110]  Alex Pentland,et al.  Auditory Context Awareness via Wearable Computing , 1998 .

[111]  Warren E. Walker,et al.  Defining Uncertainty: A Conceptual Basis for Uncertainty Management in Model-Based Decision Support , 2003 .