Neuroplex: learning to detect complex events in sensor networks through knowledge injection

Despite the remarkable success in a broad set of sensing applications, state-of-the-art deep learning techniques struggle with complex reasoning tasks across a distributed set of sensors. Unlike recognizing transient complex activities (e.g., human activities such as walking or running) from a single sensor, detecting more complex events with larger spatial and temporal dependencies across multiple sensors is extremely difficult, e.g., utilizing a hospital's sensor network to detect whether a nurse is following a sanitary protocol as they traverse from patient to patient. Training a more complicated model requires a larger amount of data-which is unrealistic considering complex events rarely happen in nature. Moreover, neural networks struggle with reasoning about serial, aperiodic events separated by large quantities in the spatial-temporal dimensions. We propose Neuroplex, a neural-symbolic framework that learns to perform complex reasoning on raw sensory data with the help of high-level, injected human knowledge. Neuroplex decomposes the entire complex learning space into explicit perception and reasoning layers, i.e., by maintaining neural networks to perform low-level perception tasks and neurally reconstructed reasoning models to perform high-level, explainable reasoning. After training the neurally reconstructed reasoning model using human knowledge, Neuroplex allows effective end-to-end training of perception models with an additional semantic loss using only sparse, high-level annotations. Our experiments and evaluation show that Neuroplex is capable of learning to efficiently and effectively detect complex events-which cannot be handled by state-of-the-art neural network models. During the training, Neuroplex not only reduces data annotation requirements by 100x, but also significantly speeds up the learning process for complex event detection by 4x.

[1]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[2]  Xi Wang,et al.  Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification , 2015, ACM Multimedia.

[3]  Lajos Jeno Fülöp,et al.  Survey on Complex Event Processing and Predictive Analytics , 2010 .

[4]  Harold Soh,et al.  Semantically-Regularized Logic Graph Embeddings , 2019, ArXiv.

[5]  Xi Wang,et al.  Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification , 2016, ACM Multimedia.

[6]  Mani Srivastava,et al.  A Hybrid Neuro-Symbolic Approach for Complex Event Processing , 2020, ArXiv.

[7]  Yanlei Diao,et al.  High-performance complex event processing over streams , 2006, SIGMOD Conference.

[8]  Masakiyo Fujimoto,et al.  Exploiting spectro-temporal locality in deep learning based acoustic event detection , 2015, EURASIP J. Audio Speech Music. Process..

[9]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[10]  Thomas Demeester,et al.  Lifted Rule Injection for Relation Embeddings , 2016, EMNLP.

[11]  Luc De Raedt,et al.  ProbLog: A Probabilistic Prolog and its Application in Link Discovery , 2007, IJCAI.

[12]  Song-Chun Zhu,et al.  Predicting Human Activities Using Stochastic Grammar , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[13]  Stefano Ermon,et al.  Label-Free Supervision of Neural Networks with Physics and Domain Knowledge , 2016, AAAI.

[14]  Nicu Sebe,et al.  Learning Deep Representations of Appearance and Motion for Anomalous Event Detection , 2015, BMVC.

[15]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[16]  Ioan Marius Bilasco,et al.  Events Detection Using a Video-Surveillance Ontology and a Rule-Based Approach , 2014, ECCV Workshops.

[17]  Luc De Raedt,et al.  DeepProbLog: Neural Probabilistic Logic Programming , 2018, BNAIC/BENELEARN.

[18]  George Roussos,et al.  Escalation: Complex Event Detection in Wireless Sensor Networks , 2007, EuroSSC.

[19]  Shih-Fu Chang,et al.  Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Bart Vanrumste,et al.  DCASE 2018 Challenge - Task 5: Monitoring of domestic activities based on multi-channel acoustics , 2018, ArXiv.

[21]  Marian Verhelst,et al.  The SINS Database for Detection of Daily Activities in a Home Environment Using an Acoustic Sensor Network , 2017, DCASE.

[22]  Amy Loutfi,et al.  A review of unsupervised feature learning and deep learning for time-series modeling , 2014, Pattern Recognit. Lett..

[23]  Wu Liu,et al.  T-C3D: Temporal Convolutional 3D Network for Real-Time Action Recognition , 2018, AAAI.

[24]  Dahua Lin,et al.  Recognize complex events from static images by fusing deep channels , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Phong Le,et al.  Compositional Distributional Semantics with Long Short Term Memory , 2015, *SEMEVAL.

[26]  Sozo Inoue,et al.  Nurse care activity recognition challenge: summary and results , 2019, UbiComp/ISWC Adjunct.

[27]  Xiang Bai,et al.  An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  GarofalakisMinos,et al.  Issues in complex event processing , 2017 .

[29]  VALENTIN RADU,et al.  Multimodal Deep Learning for Activity and Context Recognition , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[30]  Hongyu Guo,et al.  Long Short-Term Memory Over Recursive Structures , 2015, ICML.

[31]  Simon Haykin,et al.  GradientBased Learning Applied to Document Recognition , 2001 .

[32]  Minos N. Garofalakis,et al.  Issues in complex event processing: Status and prospects in the Big Data era , 2017, J. Syst. Softw..

[33]  B. S. Manjunath,et al.  Caesar: cross-camera complex activity recognition , 2019, SenSys.

[34]  Christopher D. Manning,et al.  Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.

[35]  A. Savvides,et al.  A sensory grammar for inferring behaviors in sensor networks , 2006, 2006 5th International Conference on Information Processing in Sensor Networks.

[36]  Opher Etzion,et al.  Complex event processing , 2004, Proceedings. IEEE International Conference on Web Services, 2004..

[37]  Priya L. Donti,et al.  SATNet: Bridging deep learning and logical reasoning using a differentiable satisfiability solver , 2019, ICML.

[38]  Alun D. Preece,et al.  DeepCEP: Deep Complex Event Processing Using Distributed Multimodal Information , 2019, 2019 IEEE International Conference on Smart Computing (SMARTCOMP).

[39]  D. T. Lee,et al.  Video Event Detection via Multi-modality Deep Learning , 2014, 2014 22nd International Conference on Pattern Recognition.

[40]  Germain Forestier,et al.  Deep learning for time series classification: a review , 2018, Data Mining and Knowledge Discovery.

[41]  Alexander M. Rush,et al.  Character-Aware Neural Language Models , 2015, AAAI.

[42]  George Roussos,et al.  Complex Event Detection in Extremely Resource-Constrained Wireless Sensor Networks , 2011, Mob. Networks Appl..

[43]  Sameer Singh,et al.  Injecting Logical Background Knowledge into Embeddings for Relation Extraction , 2015, NAACL.

[44]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[45]  Eric P. Xing,et al.  Harnessing Deep Neural Networks with Logic Rules , 2016, ACL.

[46]  Mani B. Srivastava,et al.  Enabling Edge Devices that Learn from Each Other: Cross Modal Training for Activity Recognition , 2018, EdgeSys@MobiSys.

[47]  Guy Van den Broeck,et al.  A Semantic Loss Function for Deep Learning with Symbolic Knowledge , 2017, ICML.

[48]  Federico Cerutti,et al.  A Pilot Study on Detecting Violence in Videos Fusing Proxy Models , 2019, 2019 22th International Conference on Information Fusion (FUSION).

[49]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[50]  WangJun,et al.  Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks , 2018 .

[51]  Md. Nazmul Haque,et al.  Nurse care activity recognition: a GRU-based approach with attention mechanism , 2019, UbiComp/ISWC Adjunct.