Modeling Human Dynamics and Lifestyle Using Digital Traces

Human behavior drives a range of complex social, urban, and economic systems, yet understanding its structure and dynamics at the individual level remains an open question. From credit card transactions to communications data, human behavior appears to exhibit bursts of activity driven by task prioritization and periodicity, however, current research does not offer generative models capturing these mechanisms. We propose a multivariate, periodic Hawkes process (MPHP) model that captures -- at the individual level -- the temporal clustering of human activity, the interdependence structure and co-excitation of different activities, and the periodic effects of weekly rhythms. We also propose a scalable parameter estimation technique for this model using maximum-aposteriori expectation-maximization that additionally provides estimation of latent variables revealing branching structure of an individual's behavior patterns. We apply the model to a large dataset of credit card transactions, and demonstrate the MPHP outperforms a non-homogeneous Poisson model and LDA in both statistical fit for the distribution of inter-event times and an activity prediction task.

[1]  Dino Pedreschi,et al.  The retail market as a complex system , 2014, EPJ Data Science.

[2]  Manuel Cebrián,et al.  Social Media Fingerprints of Unemployment , 2014, PloS one.

[3]  Jari Saramäki,et al.  Daily Rhythms in Mobile Telephone Communication , 2015, PloS one.

[4]  A. Veen,et al.  Estimation of Space–Time Branching Process Models in Seismology Using an EM–Type Algorithm , 2006 .

[5]  Kazuo Yano,et al.  Self-exciting point process modeling of conversation event sequences , 2012, ArXiv.

[6]  Le Song,et al.  Learning Triggering Kernels for Multi-dimensional Hawkes Processes , 2013, ICML.

[7]  Marta C. González,et al.  Sequence of purchases in credit card data reveal life styles in urban populations , 2017, ArXiv.

[8]  Siddharth Gupta,et al.  The TimeGeo modeling framework for urban mobility without travel surveys , 2016, Proceedings of the National Academy of Sciences.

[9]  Ward Whitt,et al.  Asymptotics for M/G/1 low-priority waiting-time tail probabilities , 1997, Queueing Syst. Theory Appl..

[10]  Tim Jones,et al.  Understanding the Heavy Tailed Dynamics in Human Behavior , 2015, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[12]  E. Bacry,et al.  Non-parametric kernel estimation for symmetric Hawkes processes. Application to high frequency financial data , 2011, 1112.1838.

[13]  Ram Rajagopal,et al.  Household Energy Consumption Segmentation Using Hourly Data , 2014, IEEE Transactions on Smart Grid.

[14]  Marta C. González,et al.  Persistent cascades: Measuring fundamental communication structure in social networks , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[15]  R. Hidalgo,et al.  Conditions for the emergence of scaling in the inter-event time of uncorrelated and seasonal systems , 2006 .

[16]  Le Song,et al.  Learning Social Infectivity in Sparse Low-rank Networks Using Multi-dimensional Hawkes Processes , 2013, AISTATS.

[17]  Ioane Muni Toke,et al.  An Introduction to Hawkes Processes with Applications to Finance , 2011 .

[18]  Scott W. Linderman,et al.  Bayesian Inference for Latent Hawkes Processes , 2017 .

[19]  Alan Cobham,et al.  Priority Assignment in Waiting Line Problems , 1954, Oper. Res..

[20]  Xiaowen Dong,et al.  Social Bridges in Urban Purchase Behavior , 2017, ACM Trans. Intell. Syst. Technol..

[21]  Albert-László Barabási,et al.  The origin of bursts and heavy tails in human dynamics , 2005, Nature.

[22]  Zhi-Dan Zhao,et al.  Relative clock verifies endogenous bursts of human dynamics , 2012 .

[23]  Albert-László Barabási,et al.  Limits of Predictability in Human Mobility , 2010, Science.

[24]  S. Xu,et al.  Household Segmentation by Load Shape and Daily Consumption , 2017 .

[25]  Christian Schneider,et al.  Spatiotemporal Patterns of Urban Human Mobility , 2012, Journal of Statistical Physics.

[26]  Kimmo Kaski,et al.  Circadian pattern and burstiness in mobile phone communication , 2011, 1101.0377.

[27]  Albert-László Barabási,et al.  Modeling bursts and heavy tails in human dynamics , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  U. Military Persistent Cascades and the Structure of Influence in a Communication Network , 2017 .

[29]  Kimmo Kaski,et al.  Bursty Human Dynamics , 2017, ArXiv.

[30]  Kimmo Kaski,et al.  Correlated bursts and the role of memory range. , 2015, Physical review. E, Statistical, nonlinear, and soft matter physics.

[31]  Didier Sornette,et al.  Two-state Markov-chain Poisson nature of individual cellphone call statistics , 2015, ArXiv.

[32]  Giovanni Quattrone,et al.  Temporal analysis of activity patterns of editors in collaborative mapping project of OpenStreetMap , 2013, OpenSym.

[33]  M. Barthelemy,et al.  From mobile phone data to the spatial structure of cities , 2014, Scientific Reports.

[34]  Josep Blat,et al.  Urban association rules: Uncovering linked trips for shopping behavior , 2016, ArXiv.

[35]  Isabel Valera,et al.  Modeling Adoption and Usage of Competing Products , 2014, 2015 IEEE International Conference on Data Mining.

[36]  Yosihiko Ogata,et al.  On Lewis' simulation method for point processes , 1981, IEEE Trans. Inf. Theory.

[37]  Adilson E. Motter,et al.  A Poissonian explanation for heavy tails in e-mail communication , 2008, Proceedings of the National Academy of Sciences.

[38]  A. Barabasi,et al.  Human dynamics: Darwin and Einstein correspondence patterns , 2005, Nature.

[39]  Andrea L. Bertozzi,et al.  Point-process models of social network interactions: Parameter estimation and missing data recovery , 2015, European Journal of Applied Mathematics.

[40]  Marta C. González,et al.  Sequences of purchases in credit card data reveal lifestyles in urban populations , 2017, Nature Communications.

[41]  Marta C. González,et al.  Coupling human mobility and social ties , 2015, Journal of The Royal Society Interface.

[42]  Taha Yasseri,et al.  Circadian Patterns of Wikipedia Editorial Activity: A Demographic Analysis , 2011, PloS one.

[43]  Marijn ten Thij,et al.  Circadian Patterns in Twitter , 2014 .