On Clusters in Open Source Ecosystems

This paper seeks to find characteristics of relationships between developers within various clusters of FLOSS ecosystems. We have mined the repository of the open source programming language Ruby, and linked developers and projects on the basis of collaboration. We used Social Network Analysis, and more specifically, the concept of modularity to expose underlying clusters or sub-communities. A survey was constructed to aid in the qualitative part of this research. The data shows that Ruby’s ecosystem consist mostly of single developers who work independently. Developers within clusters of a few developers often have personal relationships formed through friendship, work and the open source community. Personal relationships formed through the open source community grow as clusters consist of more developers. Developers in clusters with large number of developers are often unaware of friend-of-friend relationships. Project administrators, however, fail to list developers that contribute through pull/request issues as authors, making data on Ruby’s repository incomplete.

[1]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Slinger Jansen,et al.  Steering Insight: An Exploration of the Ruby Software Ecosystem , 2011, ICSOB.

[3]  Greg Madey,et al.  THE OPEN SOURCE SOFTWARE DEVELOPMENT PHENOMENON: AN ANALYSIS BASED ON SOCIAL NETWORK THEORY , 2002 .

[4]  Yongqin Gao,et al.  Analysis and Modeling of the Open Source Software Community , 2003 .

[5]  Fang Wu,et al.  Finding communities in linear time: a physics approach , 2003, ArXiv.

[6]  Jin Xu,et al.  Application of Social Network Analysis to the Study of Open Source Software , 2006 .

[7]  Slinger Jansen,et al.  A sense of community: A research agenda for software ecosystems , 2009, 2009 31st International Conference on Software Engineering - Companion Volume.

[8]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[9]  Caroline Haythornthwaite,et al.  The strength and the impact of new media , 2001, Proceedings of the 34th Annual Hawaii International Conference on System Sciences.

[10]  David G. Messerschmitt,et al.  Software Ecosystem: Understanding an Indispensable Technology and Industry , 2003 .

[11]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[12]  Jesús M. González-Barahona,et al.  Applying Social Network Analysis to the Information in CVS Repositories , 2004, MSR.

[13]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.