Use of Multiple Distributed Process Instances for Activity Analysis in Videos

Video surveillance of security-relevant areas is being used ever more frequently. Because of limited human resources, they are usually only checked for the presence of problematic activity after a specific event has occurred. An approach to the solution is provided by automated systems that are capable of detecting and analyzing movement sequences of objects including persons. Even though solutions already exist for scene recognition [3, 4, 8, 9], their architecture, their problem-specific domain and the nature of the systems make it difficult to integrate new activities or better algorithms. A system structure based on decentralized process instances and their communication via defined interfaces would instead enable a simpler expansion of the system. This paper describes the determination of activities in videos using decentralized frameworks and their interconnection via standardized interfaces. All components act autonomously and provide their data via a central location. Based on this approach, the modular system can be used for a wide variety of applications in the context of machine learning.

[1]  Larry S. Davis,et al.  AVSS 2011 demo session: A large-scale benchmark dataset for event recognition in surveillance video , 2011, AVSS.

[2]  Mubarak Shah,et al.  Real-World Anomaly Detection in Surveillance Videos , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3]  Matthias Vodel,et al.  A Scalable System Architecture for Activity Detection with Simple Heuristics , 2019, 2019 IEEE Winter Applications of Computer Vision Workshops (WACVW).

[4]  Jonathan G. Fiscus,et al.  TRECVID 2018: Benchmarking Video Activity Detection, Video Captioning and Matching, Video Storytelling Linking and Video Search , 2018, TRECVID.

[5]  Álvaro García-Martín,et al.  DiVA: A Distributed Video Analysis Framework Applied to Video-Surveillance Systems , 2008, 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services.

[6]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Daijin Kim,et al.  A Depth Video-based Human Detection and Activity Recognition using Multi-features and Embedded Hidden Markov Models for Health Care Monitoring Systems , 2017, Int. J. Interact. Multim. Artif. Intell..

[8]  Incheol Kim,et al.  DeepAct: A Deep Neural Network Model for Activity Detection in Untrimmed Videos , 2018, J. Inf. Process. Syst..