Evolution of an Open Source Community Network – An Exploratory Study

The study attempts to better understand the evolution of the network structure using two snapshots of the developer-project affiliations in an Open Source Software (OSS) community. We use complex networks and social network theory to guide our analysis. We proceed by first extracting separate bipartite networks of projects in each of the five development stages – planning, pre-alpha, alpha, beta and production/stables stages. Then, by analyzing changes in the network using degree distributions, assortativity, component sizes, visualizations and p-star models, we try to infer the project-joining behavior of the OSS developers. Simulations are used to establish the significance of some findings. Highlights of our results are the higher levels of assortativity and networking in the Beta and Stable subnetworks, and a surprisingly higher level of connectivity of the Planning subnetwork. Significant clustering of projects is observed based on the programming language but not on other project attributes, including even license.

[1]  M E Newman,et al.  Scientific collaboration networks. I. Network construction and fundamental results. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  R. Solé,et al.  Self-organization versus hierarchy in open-source social networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[4]  Sara B. Soderstrom,et al.  Dynamics of Dyads in Social Networks: Assortative, Relational, and Proximity Mechanisms , 2010 .

[5]  Béla Bollobás,et al.  Random Graphs , 1985 .

[6]  W. Powell,et al.  Network Dynamics and Field Evolution: The Growth of Interorganizational Collaboration in the Life Sciences1 , 2005, American Journal of Sociology.

[7]  Sergi Valverde,et al.  Topology and evolution of technology innovation networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Deepa Chandrasekaran,et al.  How Knowledge Overlap Drives (and Doesn’t Drive) Developer Preferences for Joining Related Open Source Software Projects , 2011 .

[9]  Lada A. Adamic,et al.  Zipf's law and the Internet , 2002, Glottometrics.

[10]  Jie Wu,et al.  Small Worlds: The Dynamics of Networks between Order and Randomness , 2003 .

[11]  Peter R. Monge,et al.  Theories of Communication Networks , 2003 .

[12]  B. Bollobás The evolution of random graphs , 1984 .

[13]  M. Newman,et al.  Why social networks are different from other types of networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  R. Merton,et al.  On social structure and science , 1996 .

[15]  Peter R. Monge,et al.  Who connects with whom? A social network analysis of an online open source software community , 2011, First Monday.

[16]  Kate Ehrlich,et al.  Searching for experts in the enterprise: combining text and social network analysis , 2007, GROUP.

[17]  M. O. Lorenz,et al.  Methods of Measuring the Concentration of Wealth , 1905, Publications of the American Statistical Association.

[18]  Kevin Crowston,et al.  The Perils and Pitfalls of Mining SourceForge , 2004, MSR.

[19]  D. Watts The “New” Science of Networks , 2004 .

[20]  S. Bornholdt,et al.  Scale-free topology of e-mail networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  S. Wasserman,et al.  Logit models and logistic regressions for social networks: I. An introduction to Markov graphs andp , 1996 .

[22]  Kuang-Yuan Huang,et al.  Relating and Clustering Free/Libre Open Source Software Projects and Developers: A Social Network Perspective , 2011, 2011 44th Hawaii International Conference on System Sciences.

[23]  Barry W. Boehm,et al.  Software Engineering Economics , 1993, IEEE Transactions on Software Engineering.

[24]  Igor M. Sokolov,et al.  Changing Correlations in Networks: Assortativity and Dissortativity , 2005 .

[25]  Fan Chung Graham,et al.  A random graph model for massive graphs , 2000, STOC '00.

[26]  M. Newman,et al.  Random graphs with arbitrary degree distributions and their applications. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  Vladimir Batagelj,et al.  Centrality in Social Networks , 1993 .

[28]  H E Stanley,et al.  Classes of small-world networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Selene Uras,et al.  Social Network Analysis of Communication in Open Source Projects , 2008, XP.

[30]  Nima Sarshar,et al.  Experience versus talent shapes the structure of the Web , 2008, Proceedings of the National Academy of Sciences.

[31]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[32]  Kate Ehrlich,et al.  Out of sight but not out of mind?: Informal networks, communication and media use in global software teams , 2007, CASCON.

[33]  P. Bonacich Power and Centrality: A Family of Measures , 1987, American Journal of Sociology.

[34]  Param Vir Singh,et al.  The small-world effect , 2010, ACM Trans. Softw. Eng. Methodol..