Modeling and Discovering Occupancy Patterns in Sensor Networks Using Latent Dirichlet Allocation

This paper presents a novel way to perform probabilistic modeling of occupancy patterns from a sensor network. The approach is based on the Latent Dirichlet Allocation (LDA) model. The application of the LDA model is shown using a real dataset of occupancy logs from the sensor network of a modern office building. LDA is a generative and unsupervised probabilistic model for collections of discrete data. Continuous sequences of just binary sensor readings are segmented together in order to build the dataset discrete data (bag-of-words). Then, these bag-of-words are used to train the model with a fixed number of topics, also known as routines. Preliminary obtained results state that the LDA model successfully found latent topics over all rooms and therefore obtain the dominant occupancy patterns or routines on the sensor network.

[1]  J. Brian Burns,et al.  Recovering Social Networks From Massive Track Datasets , 2008, 2008 IEEE Workshop on Applications of Computer Vision.

[2]  Darren Leigh,et al.  The MERL motion detector dataset , 2007, MD '07.

[3]  Daniel Gatica-Perez,et al.  Discovering routines from large-scale human locations using probabilistic topic models , 2011, TIST.

[4]  Yee Whye Teh,et al.  On Smoothing and Inference for Topic Models , 2009, UAI.

[5]  Juan Carlos Niebles,et al.  Unsupervised Learning of Human Action Categories , 2006 .

[6]  Lauri Ojala,et al.  From mornings to evenings: is there variation in shopping behaviour between different hours of the day? , 2007 .

[7]  Albert Ali Salah,et al.  T-Patterns Revisited: Mining for Temporal Patterns in Sensor Data , 2010, Sensors.

[8]  Gerard Salton,et al.  Research and Development in Information Retrieval , 1982, Lecture Notes in Computer Science.

[9]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[11]  M S Magnusson,et al.  Discovering hidden time patterns in behavior: T-patterns and their detection , 2000, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[12]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[13]  Simon A. Dobson,et al.  Activity recognition using temporal evidence theory , 2010, J. Ambient Intell. Smart Environ..

[14]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine-mediated learning.

[15]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.