Pattern-based causal relationships discovery from event sequences for modeling behavioral user profile in ubiquitous environments

This paper presents a novel and practical model for behavioral user profile modeling using causal relationships. In this model, causal relationships, which represent the influence among variables, are discovered from event sequences representing users behaviors, and used for modeling behavioral user profiles. Our model first discovers significant patterns using probabilistic suffix trees, and then discovers pattern correlations using a new sequence clustering algorithm and a modified version of the normalized mutual information (NMI) measure. Causal relationships between the significant patterns are then discovered using the transfer entropy approach. These relationships are used to construct the causal graphs of activities to generate user profiles. Through extensive experiments over a variety of datasets, we empirically demonstrate that these causality-based profiles lead to significant improvement of performance in activity prediction and user identification. We also show that our proposed model is generic and effective in constructing individual user profiles and common profiles for groups of users, in indoor and outdoor environments.

[1]  Trevor Darrell,et al.  Latent-Dynamic Discriminative Models for Continuous Gesture Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Byeong Man Kim,et al.  Constructing User Profiles for Collaborative Recommender System , 2004, APWeb.

[3]  Kent Larson,et al.  Activity Recognition in the Home Using Simple and Ubiquitous Sensors , 2004, Pervasive.

[4]  Analía Amandi,et al.  User profiling in personal information agents: a survey , 2005, The Knowledge Engineering Review.

[5]  D. Shopland,et al.  Toward the 1990 objectives for smoking: measuring the progress with 1985 NHIS data. , 1987, Public health reports.

[6]  Salvatore J. Stolfo,et al.  A Behavior-Based Approach to Securing Email Systems , 2003, MMM-ACNS.

[7]  Katarzyna J. Blinowska,et al.  A new method of the description of the information flow in the brain structures , 1991, Biological Cybernetics.

[8]  Henry A. Kautz,et al.  Inferring activities from interactions with objects , 2004, IEEE Pervasive Computing.

[9]  J L Fuller,et al.  Patterns of behavior. , 1966 .

[10]  Jiawei Han,et al.  Generating semantic annotations for frequent patterns with context analysis , 2006, KDD '06.

[11]  Bessam Abdulrazak,et al.  User’s behavior study for smart houses occupant prediction , 2010, Ann. des Télécommunications.

[12]  Fabio Crestani,et al.  Building user profiles from topic models for personalised search , 2013, CIKM.

[13]  Joyojeet Pal,et al.  Bayesian Networks: an Exploratory Tool for Understanding ICT Adoption , 2006, 2006 International Conference on Information and Communication Technologies and Development.

[14]  Gerlof Bouma,et al.  Normalized (pointwise) mutual information in collocation extraction , 2009 .

[15]  Fabio Abbattista,et al.  Extraction of User Profiles by Discovering Preferences through Machine Learning , 2003, IIS.

[16]  Yi Zhang,et al.  Complex adaptive filtering user profile using graphical models , 2008, Inf. Process. Manag..

[17]  Gordon Pipa,et al.  Transfer entropy—a model-free measure of effective connectivity for the neurosciences , 2010, Journal of Computational Neuroscience.

[18]  Jiong Yang,et al.  Towards automatic clustering of protein sequences , 2002, Proceedings. IEEE Computer Society Bioinformatics Conference.

[19]  Xing Xie,et al.  Learning transportation mode from raw gps data for geographic applications on the web , 2008, WWW.

[20]  Jiawei Han,et al.  Frequent pattern mining: current status and future directions , 2007, Data Mining and Knowledge Discovery.

[21]  Srivaths Ravi,et al.  Efficient fingerprint-based user authentication for embedded systems , 2005, Proceedings. 42nd Design Automation Conference, 2005..

[22]  Diane J. Cook,et al.  Improving home automation by discovering regularly occurring device usage patterns , 2003, Third IEEE International Conference on Data Mining.

[23]  Virgílio A. F. Almeida,et al.  Characterizing user navigation and interactions in online social networks , 2012, Inf. Sci..

[24]  Z. Zenn Bien,et al.  A Nonsupervised Learning Framework of Human Behavior Patterns Based on Sequential Actions , 2010, IEEE Transactions on Knowledge and Data Engineering.

[25]  C. Granger Investigating Causal Relations by Econometric Models and Cross-Spectral Methods , 1969 .

[26]  Analía Amandi,et al.  User profiling with Case-Based Reasoning and Bayesian Networks , 2000, IBERAMIA-SBIA 2000 Open Discussion Track.

[27]  Esteban García-Cuesta,et al.  User modeling: Through statistical analysis and subspace learning , 2012, Expert Syst. Appl..

[28]  Jiawei Han,et al.  Semantic annotation of frequent patterns , 2007, TKDD.

[29]  Yan Liu,et al.  Learning Temporal Causal Graphs for Relational Time-Series Analysis , 2010, ICML.

[30]  Wei Chu,et al.  A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.

[31]  Francesco Ricci,et al.  Context-Aware Recommender Systems , 2011, AI Mag..

[32]  Lora Aroyo,et al.  Analyzing user behavior across social sharing environments , 2013, ACM Trans. Intell. Syst. Technol..

[33]  Qiang Yang,et al.  Joint learning user's activities and profiles from GPS data , 2009, LBSN '09.

[34]  Eleazar Eskin,et al.  Detecting the Presence and Absence of Causal Relationships between Expression of Yeast Genes with Very Few Samples , 2010, J. Comput. Biol..

[35]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[36]  Henry A. Kautz,et al.  Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields , 2007, Int. J. Robotics Res..

[37]  C. Lee Giles,et al.  Probabilistic user behavior models , 2003, Third IEEE International Conference on Data Mining.

[38]  You-Jin Park,et al.  Individual and group behavior-based customer profile model for personalized product recommendation , 2009, Expert Syst. Appl..

[39]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[40]  Alexander J. Smola,et al.  Scalable distributed inference of dynamic user interests for behavioral targeting , 2011, KDD.

[41]  Michael E. Theologou,et al.  User Profile Modeling in the context of web-based learning management systems , 2008, J. Netw. Comput. Appl..

[42]  Olaf Sporns,et al.  Methods for quantifying the informational structure of sensory and motor data , 2007, Neuroinformatics.

[43]  B. Shipley Testing causal explanations in organismal biology : causation, correlation and structural equation modelling , 1999 .

[44]  Ravi Kumar,et al.  A Characterization of Online Search Behavior , 2009, IEEE Data Eng. Bull..

[45]  Schreiber,et al.  Measuring information transfer , 2000, Physical review letters.

[46]  Geert-Jan Houben,et al.  Cross-system user modeling and personalization on the Social Web , 2013, User Modeling and User-Adapted Interaction.

[47]  GeunSik Jo,et al.  User Preference Modeling from Positive Contents for Personalized Recommendation , 2007, Discovery Science.

[48]  Karl J. Friston,et al.  Dynamic causal modelling , 2003, NeuroImage.

[49]  Federica Cena,et al.  Anisotropic propagation of user interests in ontology-based user models , 2013, Inf. Sci..

[50]  Ashique Mahmood,et al.  Structure Learning of Causal Bayesian Networks: A Survey , 2011 .

[51]  Ahmed Hassan Awadallah,et al.  Beyond DCG: user behavior as a predictor of a successful search , 2010, WSDM '10.

[52]  F. Sung,et al.  Bidirectional relation between schizophrenia and epilepsy: A population‐based retrospective cohort study , 2011, Epilepsia.

[53]  Josep Lluís de la Rosa i Esteva,et al.  Concept-based learning of human behavior for customer relationship management , 2011, Inf. Sci..

[54]  John E. Laird,et al.  Variability in Human Behavior Modeling for Military Simulations , 2003 .

[55]  Yi Zhang,et al.  Bayesian adaptive user profiling with explicit & implicit feedback , 2006, CIKM '06.

[56]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[57]  Gwenn Englebienne,et al.  Accurate activity recognition in a home setting , 2008, UbiComp.

[58]  D N Glaser,et al.  TUTORIAL: CAUSAL MODELING AND PATIENT SATISFACTION , 1996, Quality management in health care.

[59]  J. Becker,et al.  Social network interactions: A buffer or a stress , 1983, American journal of community psychology.

[60]  Wayne Lee,et al.  Decision theory and human behavior , 1971 .

[61]  Shengrui Wang,et al.  A New Algorithm Based On Sequential Pattern Mining For Person Identification In Ubiquitous Environments , 2010 .

[62]  Francis Heylighen,et al.  Causality as distinction conservation: a theory of predictability, reversibility, and time order , 1989 .

[63]  Sanjay Chawla,et al.  Mining for Outliers in Sequential Databases , 2006, SDM.

[64]  Plamen P. Angelov,et al.  Creating Evolving User Behavior Profiles Automatically , 2012, IEEE Transactions on Knowledge and Data Engineering.

[65]  Geoffrey I. Webb,et al.  # 2001 Kluwer Academic Publishers. Printed in the Netherlands. Machine Learning for User Modeling , 1999 .

[66]  Wei-Ying Ma,et al.  Adaptive User Profile Model and Collaborative Filtering for Personalized News , 2006, APWeb.

[67]  Philip S. Yu,et al.  Coupled Behavior Analysis with Applications , 2012, IEEE Transactions on Knowledge and Data Engineering.

[68]  Stuart E. Middleton,et al.  Ontological user profiling in recommender systems , 2004, TOIS.