Human dynamics revealed through Web analytics

The increasing ubiquity of Internet access and the frequency with which people interact with it raise the possibility of using the Web to better observe, understand, and monitor several aspects of human social behavior. Web sites with large numbers of frequently returning users are ideal for this task. If these sites belong to companies or universities, their usage patterns can furnish information about the working habits of entire populations. In this work, we analyze the properly anonymized logs detailing the access history to Emory University's Web site. Emory is a medium-sized university located in Atlanta, Georgia. We find interesting structure in the activity patterns of the domain and study in a systematic way the main forces behind the dynamics of the traffic. In particular, we find that linear preferential linking, priority-based queuing, and the decay of interest for the contents of the pages are the essential ingredients to understand the way users navigate the Web.

[1]  Fang Wu,et al.  Novelty and collective attention , 2007, Proceedings of the National Academy of Sciences.

[2]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[3]  S. Redner,et al.  Organization of growing random networks. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[5]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[6]  M. V. Simkin,et al.  A theory of web traffic , 2007, 0711.1235.

[7]  Beom Jun Kim,et al.  Role of activity in human dynamics , 2007, EPL (Europhysics Letters).

[8]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[9]  David Bawden,et al.  Book Review: Evolution and Structure of the Internet: A Statistical Physics Approach. , 2006 .

[10]  Huberman,et al.  Strong regularities in world wide web surfing , 1998, Science.

[11]  Ravi Kumar,et al.  Self-similarity in the web , 2001, TOIT.

[12]  G. Madey,et al.  Uncovering individual and collective human dynamics from mobile phone records , 2007, 0710.2939.

[13]  Alexei Vázquez,et al.  Exact results for the Barabási model of human dynamics. , 2005, Physical review letters.

[14]  Vittorio Loreto,et al.  Collaborative Tagging and Semiotic Dynamics , 2006, ArXiv.

[15]  Albert-László Barabási,et al.  Evolution of Networks: From Biological Nets to the Internet and WWW , 2004 .

[16]  Wendy Hall,et al.  Creating a Science of the Web , 2006, Science.

[17]  Albert-László Barabási,et al.  The origin of bursts and heavy tails in human dynamics , 2005, Nature.

[18]  C. Cattuto,et al.  A Yule-Simon process with memory , 2006 .

[19]  S. N. Dorogovtsev,et al.  Scaling properties of scale-free evolving networks: continuous approach. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  J. G. Oliveira,et al.  Human Dynamics: The Correspondence Patterns of Darwin and Einstein , 2005 .

[21]  Guido Caldarelli,et al.  Preferential attachment in the growth of social networks: the case of Wikipedia , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[22]  A-L Barabási,et al.  Structure and tie strengths in mobile communication networks , 2006, Proceedings of the National Academy of Sciences.

[23]  D. Watts A twenty-first century science , 2007, Nature.

[24]  Filippo Menczer,et al.  On the lack of typical behavior in the global Web traffic network , 2005, WWW '05.

[25]  S. Redner,et al.  Connectivity of growing random networks. , 2000, Physical review letters.

[26]  S. Bornholdt,et al.  World Wide Web scaling exponent from Simon's 1955 model. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  J. M. Oshorn Proc. Nat. Acad. Sei , 1978 .

[28]  S. N. Dorogovtsev,et al.  WWW and Internet models from 1955 till our days and the ``popularity is attractive'' principle , 2000, cond-mat/0009090.

[29]  K. Pearson,et al.  Biometrika , 1902, The American Naturalist.

[30]  S. Redner Citation statistics from 110 years of physical review , 2005, physics/0506056.

[31]  Albert-László Barabási,et al.  Modeling bursts and heavy tails in human dynamics , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Vittorio Loreto,et al.  Semiotic dynamics and collaborative tagging , 2006, Proceedings of the National Academy of Sciences.

[33]  H. Simon,et al.  ON A CLASS OF SKEW DISTRIBUTION FUNCTIONS , 1955 .

[34]  Alexei Vazquez Impact of memory on human dynamics , 2007 .

[35]  James E. Pitkow,et al.  Characterizing Browsing Strategies in the World-Wide Web , 1995, Comput. Networks ISDN Syst..

[36]  Santo Fortunato,et al.  Ranking web sites with real user traffic , 2008, WSDM '08.

[37]  A. Barabasi,et al.  Evolution of the social network of scientific collaborations , 2001, cond-mat/0104162.

[38]  Filippo Menczer,et al.  Growing and navigating the small world Web by local content , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[39]  R Pastor-Satorras,et al.  Dynamical and correlation properties of the internet. , 2001, Physical review letters.