Unsupervised Long-Term Routine Modelling Using Dynamic Bayesian Networks

Routine can be defined as the frequent and regular activity patterns over a specified timescale (e.g. daily/weekly routine). In this work, we capture routine patterns for a single person from long- term visual data using a Dynamic Bayesian Network (DBN). Assuming a person always performs purposeful activities at corresponding locations; spatial, pose and time-of-day information are used as sources of input for routine modelling. We assess variations of the independence assumptions within the DBN model among selected features. Unlike traditional models that are supervisedly trained, we automatically select the number of hidden states for fully unsupervised discovery of a single person's indoor routine. We emphasize unsupervised learning as it is practically unrealistic to obtain ground-truth labels for long term behaviours. The datasets used in this work are long term recordings of non-scripted activities in their native environments, each lasting for six days. The first captures the routine of three individuals in an office kitchen; the second is recorded in a residential kitchen. We evaluate the routine by comparing to ground-truth when present, using exhaustive search to relate discovered patterns to ground-truth ones. We also propose a graphical visualisation to represent and qualitatively evaluate the discovered routine.

[1]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[2]  Lawrence B. Holder,et al.  Discovering Activities to Recognize and Track in a Smart Environment , 2011, IEEE Transactions on Knowledge and Data Engineering.

[3]  Tai-hoon Kim,et al.  Applications, Systems and Methods in Smart Home Technology: A Review , 2010 .

[4]  Sridhar Mahadevan,et al.  Learning hierarchical models of activity , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[5]  A. Savvides,et al.  Discovering Routine Events and their Periods in Sensor Time Series Data , 2009 .

[6]  Fazli Can,et al.  Concepts and effectiveness of the cover-coefficient-based clustering methodology for text databases , 1990, TODS.

[7]  Suzanne Martin,et al.  Smart home technologies for health and social care support. , 2008, The Cochrane database of systematic reviews.

[8]  Moritz Tenorth,et al.  The TUM Kitchen Data Set of everyday manipulation activities for motion tracking and action recognition , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[9]  Shaogang Gong,et al.  Video Behaviour Mining Using a Dynamic Topic Model , 2011, International Journal of Computer Vision.

[10]  Michael Karg,et al.  Low cost activity recognition using depth cameras and context dependent spatial regions , 2014, AAMAS.

[11]  Svetha Venkatesh,et al.  Discovery of Activity Structures using the Hierarchical Hidden Markov Model , 2005, BMVC.

[12]  Shaogang Gong,et al.  Beyond Tracking: Modelling Activity and Understanding Behaviour , 2006, International Journal of Computer Vision.

[13]  Dima Damen,et al.  Unsupervised daily routine modelling from a depth sensor using top-down and bottom-up hierarchies , 2015, 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR).

[14]  Alex Pentland,et al.  Coupled hidden Markov models for complex action recognition , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Andreas Savvides,et al.  Extracting spatiotemporal human activity patterns in assisted living using a home sensor network , 2008, PETRA '08.

[16]  Shaogang Gong,et al.  Activity based surveillance video content modelling , 2008, Pattern Recognit..

[17]  Monique Thonnat,et al.  Unsupervised discovery of human activities from long-time videos , 2015, IET Comput. Vis..

[18]  Silvio Savarese,et al.  Watch-n-Patch: Unsupervised Learning of Actions and Relations , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.