Scalable Daily Human Behavioral Pattern Mining from Multivariate Temporal Data

This work introduces a set of scalable algorithms to identify patterns of human daily behaviors. These patterns are extracted from multivariate temporal data that have been collected from smartphones. We have exploited sensors that are available on these devices, and have identified frequent behavioral patterns with a temporal granularity, which has been inspired by the way individuals segment time into events. These patterns are helpful to both end-users and third parties who provide services based on this information. We have demonstrated our approach on two real-world datasets and showed that our pattern identification algorithms are scalable. This scalability makes analysis on resource constrained and small devices such as smartwatches feasible. Traditional data analysis systems are usually operated in a remote system outside the device. This is largely due to the lack of scalability originating from software and hardware restrictions of mobile/wearable devices. By analyzing the data on the device, the user has the control over the data, i.e., privacy, and the network costs will also be removed.

[1]  Lionel M. Ni,et al.  An unsupervised framework for sensing individual and cluster behavior patterns from human mobile data , 2012, UbiComp.

[2]  J. Kruskal Nonmetric multidimensional scaling: A numerical method , 1964 .

[3]  Salvatore Orlando,et al.  Fast and memory efficient mining of frequent closed itemsets , 2006, IEEE Transactions on Knowledge and Data Engineering.

[4]  James Bailey,et al.  Automatically recognizing places of interest from unreliable GPS data using spatio-temporal density estimation and line intersections , 2015, Pervasive Mob. Comput..

[5]  Blaine A. Price,et al.  Wearables: has the age of smartwatches finally arrived? , 2015, Commun. ACM.

[6]  Alastair R. Beresford,et al.  Device analyzer: large-scale mobile data collection , 2014, PERV.

[7]  Xing Xie,et al.  Mining Individual Life Pattern Based on Location History , 2009, 2009 Tenth International Conference on Mobile Data Management: Systems, Services and Middleware.

[8]  Krzysztof Janowicz,et al.  On the semantic annotation of places in location-based social networks , 2011, KDD.

[9]  Katarzyna Wac,et al.  UbiqLog: a generic mobile phone-based life-log framework , 2013, Personal and Ubiquitous Computing.

[10]  Chelsea Dobbins,et al.  Lesson Learned from Collecting Quantified Self Information via Mobile and Wearable Devices , 2015, J. Sens. Actuator Networks.

[11]  Daniel Gatica-Perez,et al.  A probabilistic approach to mining mobile phone data sequences , 2013, Personal and Ubiquitous Computing.

[12]  Ming-Syan Chen,et al.  Mining top-k frequent patterns in the presence of the memory constraint , 2008, The VLDB Journal.

[13]  Won Suk Lee,et al.  CP-tree: An adaptive synopsis structure for compressing frequent itemsets over online data streams , 2014, Inf. Sci..

[14]  Katarzyna Wac,et al.  Getting closer: an empirical investigation of the proximity of user to their smart phones , 2011, UbiComp '11.

[15]  Ming-Syan Chen,et al.  Mining Group Movement Patterns for Tracking Moving Objects Efficiently , 2011, IEEE Transactions on Knowledge and Data Engineering.

[16]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[17]  George Kollios,et al.  Mining, indexing, and querying historical spatiotemporal data , 2004, KDD.

[18]  Peter Ruppel,et al.  Combining GPS and GSM Cell-ID positioning for Proactive Location-based Services , 2007, 2007 Fourth Annual International Conference on Mobile and Ubiquitous Systems: Networking & Services (MobiQuitous).

[19]  Denzil Ferreira,et al.  Understanding Human-Smartphone Concerns: A Study of Battery Life , 2011, Pervasive.

[20]  Cem Ersoy,et al.  A Review and Taxonomy of Activity Recognition on Mobile Phones , 2013 .

[21]  Ramesh Govindan,et al.  Energy-efficient positioning for smartphones using Cell-ID sequence matching , 2011, MobiSys '11.

[22]  Alex Pentland,et al.  Reality mining: sensing complex social systems , 2006, Personal and Ubiquitous Computing.

[23]  James Bailey,et al.  Mining Probabilistic Frequent Spatio-Temporal Sequential Patterns with Gap Constraints from Uncertain Databases , 2013, 2013 IEEE 13th International Conference on Data Mining.

[24]  Chelsea Dobbins,et al.  Clustering of Physical Activities for Quantified Self and mHealth Applications , 2015, 2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing.

[25]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[26]  Sushil Jajodia,et al.  Time Granularities in Databases, Data Mining, and Temporal Reasoning , 2000, Springer Berlin Heidelberg.

[27]  Jae-Gil Lee,et al.  Mining Discriminative Patterns for Classifying Trajectories on Road Networks , 2011, IEEE Transactions on Knowledge and Data Engineering.

[28]  Dino Pedreschi,et al.  Human mobility, social ties, and link prediction , 2011, KDD.

[29]  Margaret Martonosi,et al.  Identifying Important Places in People's Lives from Cellular Network Data , 2011, Pervasive.

[30]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[31]  Sourav Bhattacharyaa,et al.  Towards Using Unlabeled Data in a Sparse-coding Framework for Human Activity Recognition , 2014 .

[32]  Thomas Plötz,et al.  Using unlabeled data in a sparse-coding framework for human activity recognition , 2014, Pervasive Mob. Comput..

[33]  Saeed Moghaddam,et al.  MobileMiner: mining your frequent patterns on your phone , 2014, UbiComp.

[34]  Robin Le Poidevin,et al.  The Experience and Perception of Time , 2000 .

[35]  A Min Tjoa,et al.  Securing Shareable Life-logs , 2010, 2010 IEEE Second International Conference on Social Computing.

[36]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[37]  Fabian Mörchen,et al.  Efficient mining of understandable patterns from multivariate interval time series , 2007, Data Mining and Knowledge Discovery.

[38]  Pedro José Marrón,et al.  Micro-navigation for urban bus passengers: using the internet of things to improve the public transport experience , 2014 .

[39]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[40]  D. Gática-Pérez,et al.  Towards rich mobile phone datasets: Lausanne data collection campaign , 2010 .

[41]  Antonio Gomariz,et al.  SPMF: a Java open-source pattern mining library , 2014, J. Mach. Learn. Res..

[42]  Andrew T. Campbell,et al.  From Smart to Cognitive Phones , 2012, IEEE Pervasive Computing.

[43]  Suman Nath,et al.  ACE: Exploiting Correlation for Energy-Efficient and Continuous Context Sensing , 2012, IEEE Transactions on Mobile Computing.

[44]  S. Shekhar,et al.  Discovering Personal Paths from Sparse GPS Traces , 2005 .

[45]  Enhong Chen,et al.  A habit mining approach for discovering similar mobile users , 2012, WWW.

[46]  Chelsea Dobbins,et al.  The Big Data Obstacle of Lifelogging , 2014, 2014 28th International Conference on Advanced Information Networking and Applications Workshops.