The Metabolism and Growth of Web Forums

We view web forums as virtual living organisms feeding on user's clicks and investigate how they grow at the expense of clickstreams. We find that (the number of page views in a given time period) and (the number of unique visitors in the time period) of the studied forums satisfy the law of the allometric growth, i.e., . We construct clickstream networks and explain the observed temporal dynamics of networks by the interactions between nodes. We describe the transportation of clickstreams using the function , in which is the total amount of clickstreams passing through node and is the amount of the clickstreams dissipated from to the environment. It turns out that , an indicator for the efficiency of network dissipation, not only negatively correlates with , but also sets the bounds for . In particular, when and when . Our findings have practical consequences. For example, can be used as a measure of the “stickiness” of forums, which quantifies the stable ability of forums to remain users “lock-in” on the forum. Meanwhile, the correlation between and provides a method to predict the long-term “stickiness” of forums from the clickstream data in a short time period. Finally, we discuss a random walk model that replicates both of the allometric growth and the dissipation function .

[1]  Claudio J. Tessone,et al.  Sustainable growth in complex networks , 2010, 1007.1330.

[2]  Sven Erik Jørgensen,et al.  Ecosystems emerging: 2. Dissipation , 1999 .

[3]  Bernardo A. Huberman,et al.  The laws of the web - patterns in the ecology of information , 2001 .

[4]  James H. Brown,et al.  Toward a metabolic theory of ecology , 2004 .

[5]  Erick Cantú-Paz,et al.  Personalized click prediction in sponsored search , 2010, WSDM '10.

[6]  Sunil Gupta,et al.  Choice and the Internet: From Clickstream to Research Stream , 2002 .

[7]  Bart Cammaerts,et al.  Online Political Debate, Unbounded Citizenship, and the Problematic Nature of a Transnational Public Sphere , 2005 .

[8]  Fang Wu,et al.  Novelty and collective attention , 2007, Proceedings of the National Academy of Sciences.

[9]  A. Bejan,et al.  The constructal law and the evolution of design in nature. , 2011, Physics of life reviews.

[10]  M. Newman Power laws, Pareto distributions and Zipf's law , 2005 .

[11]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[12]  Kristian J. Hammond,et al.  Mining navigation history for recommendation , 2000, IUI '00.

[13]  Tim O'Reilly,et al.  What is Web 2.0: Design Patterns and Business Models for the Next Generation of Software , 2007 .

[14]  Jiang Zhang,et al.  Scaling behaviors of weighted food webs as energy transportation networks. , 2010, Journal of theoretical biology.

[15]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[16]  Zengru Di,et al.  Analyzing netizens' view and reply behaviors on the forum , 2009, 0908.4388.

[17]  Guido Caldarelli,et al.  Universal scaling relations in food webs , 2003, Nature.

[18]  Martin Halvey,et al.  WWW '07: Proceedings of the 16th international conference on World Wide Web , 2007, WWW 2007.

[19]  Mark S. Ackerman,et al.  Expertise networks in online communities: structure and algorithms , 2007, WWW '07.

[20]  Jiang Zhang,et al.  Allometry and Dissipation of Ecological Flow Networks , 2013, PloS one.

[21]  Anja Feldmann,et al.  Proceedings of the 9th ACM SIGCOMM Conference on Internet Measurement 2009, Chicago, Illinois, USA, November 4-6, 2009 , 2009, IMC 2009.

[22]  Vittorio Loreto,et al.  Semiotic dynamics and collaborative tagging , 2006, Proceedings of the National Academy of Sciences.

[23]  Bamshad Mobasher,et al.  Personalized recommendation in social tagging systems using hierarchical clustering , 2008, RecSys '08.

[24]  Michael A. Rodriguez,et al.  Clickstream Data Yields High-Resolution Maps of Science , 2009, PloS one.

[25]  M. Stephens EDF Statistics for Goodness of Fit and Some Comparisons , 1974 .

[26]  Freimut Bodendorf,et al.  Detecting opinion leaders and trends in online social networks , 2009, CIKM-SWSM.

[27]  Jaideep Srivastava,et al.  Discovery of Interesting Usage Patterns from Web Data , 1999, WEBKDD.

[28]  Kristina Lerman,et al.  Information Contagion: An Empirical Study of the Spread of News on Digg and Twitter Social Networks , 2010, ICWSM.

[29]  James H. Brown,et al.  A General Model for the Origin of Allometric Scaling Laws in Biology , 1997, Science.

[30]  Virgílio A. F. Almeida,et al.  Characterizing user behavior in online social networks , 2009, IMC '09.

[31]  Gerald L. Lohse,et al.  Cognitive Lock-In and the Power Law of Practice , 2003 .

[32]  Vittorio Loreto,et al.  Collaborative Tagging and Semiotic Dynamics , 2006, ArXiv.

[33]  Christos Faloutsos,et al.  Graph evolution: Densification and shrinking diameters , 2006, TKDD.

[34]  Michael Bieber,et al.  A clickstream-based collaborative filtering personalization model: towards a better performance , 2004, WIDM '04.

[35]  Jaideep Srivastava,et al.  Web usage mining: discovery and application of interesting patterns from web data , 2000 .

[36]  Saleem N. Bhatti,et al.  Modelling user behaviour in networked games , 2001, MULTIMEDIA '01.

[37]  Vittorio Loreto,et al.  Collective dynamics of social annotation , 2009, Proceedings of the National Academy of Sciences.

[38]  Masahiko Higashi,et al.  Extended input-output flow analysis of ecosystems , 1986 .

[39]  Catarina Sismeiro,et al.  A Model of Web Site Browsing Behavior Estimated on Clickstream Data , 2003 .

[40]  James H. Brown,et al.  The fourth dimension of life: fractal geometry and allometric scaling of organisms. , 1999, Science.

[41]  Fang Wu,et al.  Crowdsourcing, attention and productivity , 2008, J. Inf. Sci..

[42]  Fang Wu,et al.  Feedback Loops of Attention in Peer Production , 2009, 2009 International Conference on Computational Science and Engineering.

[43]  D. Helbing,et al.  Growth, innovation, scaling, and the pace of life in cities , 2007, Proceedings of the National Academy of Sciences.

[44]  Amos Maritan,et al.  Size and form in efficient transportation networks , 1999, Nature.

[45]  Huberman,et al.  Strong regularities in world wide web surfing , 1998, Science.

[46]  Jiang Zhang,et al.  Accelerating growth and size-dependent distribution of human online activities. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[47]  Hsinchun Chen,et al.  Applying authorship analysis to extremist-group Web forum messages , 2005, IEEE Intelligent Systems.