Human Dynamics in Large Communication Networks

How often humans communicate with each other? What are the mechanisms that explain how human actions are distributed over time? Here we answer these questions by studying the time interval between calls and SMS messages in an anonymized, large mobile network, with 3.1 million users, over 200 million phone calls and 300 million SMS messages,spanning 70 GigaBytes. Our first contribution is the Truncated Autocatalytic Process (TAP ) model, that explains the time between communication events (ie., times between phone-initiations) for a single individual. The novelty is that the model is ’autocatalytic’, in the sense that the parameters of the model change, depending on the latest inter-event time: long periods of inactivity in the past result in long periods of inactivity in the future, and vice-versa. We show that the TAP model mimics the inter-event times of the users of our dataset extremely well, despite its parsimony and simplicity. Our second contribution is the TAP-classifier , a classification method based on the interevent times and in addition to other features. We showed that the inferred sleep intervals and the reciprocity between outgoing and incoming calls are good features to classify users. Finally, analyze the network effects of each class of users and we found surprising results. Moreover, all of our methods are fast, and scale linearly with the number of customers.

[1]  Albert-László Barabási,et al.  The origin of bursts and heavy tails in human dynamics , 2005, Nature.

[2]  Swapna S. Gokhale,et al.  Log-logistic software reliability growth model , 1998, Proceedings Third IEEE International High-Assurance Systems Engineering Symposium (Cat. No.98EX231).

[3]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[4]  F. Haight Handbook of the Poisson Distribution , 1967 .

[5]  Jean-Pierre Eckmann,et al.  Entropy of dialogues creates coherent structures in e-mail traffic. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Maya Paczuski,et al.  Correlated dynamics in human printing behavior , 2004, ArXiv.

[7]  P. Fisk THE GRADUATION OF INCOME DISTRIBUTIONS , 1961 .

[8]  Albert-László Barabási,et al.  Modeling bursts and heavy tails in human dynamics , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  M. O. Lorenz,et al.  Methods of Measuring the Concentration of Wealth , 1905, Publications of the American Statistical Association.

[10]  Jennifer Neville,et al.  Modeling relationship strength in online social networks , 2010, WWW '10.

[11]  M. I. Ahmad,et al.  Log-logistic flood frequency analysis , 1988 .

[12]  G. Seber Multivariate observations / G.A.F. Seber , 1983 .

[13]  Christos Faloutsos,et al.  Surprising Patterns for the Call Duration Distribution of Mobile Phone Users , 2010, ECML/PKDD.

[14]  Gordon Johnston,et al.  Statistical Models and Methods for Lifetime Data , 2003, Technometrics.

[15]  A. Barabasi,et al.  Dynamics of information access on the web. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  Adilson E. Motter,et al.  A Poissonian explanation for heavy tails in e-mail communication , 2008, Proceedings of the National Academy of Sciences.

[17]  Michalis Faloutsos,et al.  On power-law relationships of the Internet topology , 1999, SIGCOMM '99.

[18]  Wang Bing-Hong,et al.  Heavy-Tailed Statistics in Short-Message Communication , 2009 .

[19]  Tanya Y. Berger-Wolf,et al.  A framework for community identification in dynamic social networks , 2007, KDD '07.

[20]  S. Bennett,et al.  Log‐Logistic Regression Models for Survival Data , 1983 .

[21]  T. Mahmood Survival of Newly Founded Businesses: A Log-Logistic Model Approach , 2000 .

[22]  B. Bollobás The evolution of random graphs , 1984 .

[23]  P. Erdos,et al.  On the evolution of random graphs , 1984 .