Analyzing rich-club behavior in open source projects

The network of collaborations in an open source project can reveal relevant emergent properties that influence its prospects of success. In this work, we analyze open source projects to determine whether they exhibit a rich-club behavior, i.e., a phenomenon where contributors with a high number of collaborations (i.e., strongly connected within the collaboration network) are likely to cooperate with other well-connected individuals. The presence or absence of a rich-club has an impact on the sustainability and robustness of the project. For this analysis, we build and study a dataset with the 100 most popular projects in GitHub, exploiting connectivity patterns in the graph structure of collaborations that arise from commits, issues and pull requests. Results show that rich-club behavior is present in all the projects, but only few of them have an evident club structure. We compute coefficients both for single source graphs and the overall interaction graph, showing that rich-club behavior varies across different layers of software development. We provide possible explanations of our results, as well as implications for further analysis.

[1]  Alessandro Vespignani,et al.  Detecting rich-club ordering in complex networks , 2006, physics/0602134.

[2]  Maxwell A. Bertolero,et al.  The diverse club , 2017, Nature Communications.

[3]  Jordi Cabot,et al.  Gitana: A software project inspector , 2018, Sci. Comput. Program..

[4]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[5]  O. Sporns,et al.  Rich-Club Organization of the Human Connectome , 2011, The Journal of Neuroscience.

[6]  Shi Zhou,et al.  The rich-club phenomenon in the Internet topology , 2003, IEEE Communications Letters.

[7]  Jordi Cabot,et al.  Enabling the Definition and Enforcement of Governance Rules in Open Source Systems , 2015, 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering.

[8]  Guido Conaldi,et al.  Flat for the Few, Steep for the Many: Structural Cohesion and Rich-Club Effect as Measures of Hierarchy and Control in FLOSS Communities , 2010, Int. J. Open Source Softw. Process..

[9]  R. Solé,et al.  Self-organization versus hierarchy in open-source social networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Luciano da Fontoura Costa,et al.  Rich-club phenomenon across complex network hierarchies , 2007 .

[11]  Tore Opsahl,et al.  Prominence and control: the weighted rich-club effect. , 2008, Physical review letters.

[12]  Antonio Lima,et al.  Coding Together at Scale: GitHub as a Collaborative Social Network , 2014, ICWSM.

[13]  Wei Song,et al.  The rich-club phenomenon of China's population flow network during the country's spring festival , 2018 .

[14]  Jing Liu,et al.  Multi-granularity evolution analysis of software using complex network theory , 2011, J. Syst. Sci. Complex..