Going Farther Together: The Impact of Social Capital on Sustained Participation in Open Source

Sustained participation by contributors in opensource software is critical to the survival of open-source projects and can provide career advancement benefits to individual contributors. However, not all contributors reap the benefits of open-source participation fully, with prior work showing that women are particularly underrepresented and at higher risk of disengagement. While many barriers to participation in open-source have been documented in the literature, relatively little is known about how the social networks that open-source contributors form impact their chances of long-term engagement. In this paper we report on a mixed-methods empirical study of the role of social capital (i.e., the resources people can gain from their social connections) for sustained participation by women and men in open-source GitHub projects. After combining survival analysis on a large, longitudinal data set with insights derived from a user survey, we confirm that while social capital is beneficial for prolonged engagement for both genders, women are at disadvantage in teams lacking diversity in expertise.

[1]  Eirini Kalliamvakou,et al.  Understanding "watchers" on GitHub , 2014, MSR 2014.

[2]  Christian Kästner,et al.  Adding Sparkle to Social Coding: An Empirical Study of Repository Badges in the npm Ecosystem , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering: Companion (ICSE-Companion).

[3]  A. Portes Social Capital: Its Origins and Applications in Modern Sociology , 1998 .

[4]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2006 .

[5]  Jesús M. González-Barahona,et al.  Applying Social Network Analysis to the Information in CVS Repositories , 2004, MSR.

[6]  D. Fesenmaier,et al.  Modeling Participation in an Online Travel Community , 2004 .

[7]  Alexander Serebrenik,et al.  The Babel of Software Development: Linguistic Diversity in Open Source , 2013, SocInfo.

[8]  Maryam Razavian,et al.  Feminine Expertise in Architecting Teams , 2016, IEEE Software.

[9]  Cliff Lampe,et al.  The Benefits of Facebook "Friends: " Social Capital and College Students' Use of Online Social Network Sites , 2007, J. Comput. Mediat. Commun..

[10]  Sven Apel,et al.  Measuring and modeling programming experience , 2013, Empirical Software Engineering.

[11]  Benno Stein,et al.  Overview of the 5th Author Profiling Task at PAN 2017: Gender and Language Variety Identification in Twitter , 2017, CLEF.

[12]  R. Burt THE GENDER OF SOCIAL CAPITAL , 1998 .

[13]  Daniel M. Germán,et al.  Will my patch make it? And how fast? Case study on the Linux kernel , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[14]  Georgios Gousios,et al.  The GHTorent dataset and tool suite , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[15]  J. Ferris,et al.  Social Capital and Philanthropy: An Analysis of the Impact of Social Capital on Individual Giving and Volunteering , 2007 .

[16]  Jesús M. González-Barahona,et al.  Women in Free/Libre/Open Source Software: The Situation in the 2010s , 2016, OSS.

[17]  Alexander Serebrenik,et al.  Gender, Representation and Online Participation: A Quantitative Study of StackOverflow , 2012, 2012 International Conference on Social Informatics.

[18]  Sogol Balali,et al.  Newcomers’ Barriers. . . Is That All? An Analysis of Mentors’ and Newcomers’ Barriers in OSS Projects , 2018, Computer Supported Cooperative Work (CSCW).

[19]  Premkumar T. Devanbu,et al.  Developer onboarding in GitHub: the role of prior social links and language experience , 2015, ESEC/SIGSOFT FSE.

[20]  R. Stuart Geiger Summary Analysis of the 2017 GitHub Open Source Survey , 2017, ArXiv.

[21]  Brian Fitzgerald,et al.  Why Hackers Do What They Do: Understanding Motivation and Effort in Free/Open Source Software Projects , 2007 .

[22]  James D. Herbsleb,et al.  Impression formation in online peer production: activity traces and personal profiles in github , 2013, CSCW.

[23]  Chris Parnin,et al.  Someone like me: How does peer parity influence participation of women on stack overflow? , 2017, 2017 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC).

[24]  Greg Madey,et al.  THE OPEN SOURCE SOFTWARE DEVELOPMENT PHENOMENON: AN ANALYSIS BASED ON SOCIAL NETWORK THEORY , 2002 .

[25]  James D. Herbsleb,et al.  Ecosystem-level determinants of sustained activity in open-source projects: a case study of the PyPI ecosystem , 2018, ESEC/SIGSOFT FSE.

[26]  Carlos Jensen,et al.  Gender Differences in Early Free and Open Source Software Joining Process , 2012, OSS.

[27]  A. Bakker,et al.  Job demands, job resources, and their relationship with burnout and engagement: a multi‐sample study , 2004 .

[28]  Marco Tulio Valente,et al.  Why modern open source projects fail , 2017, ESEC/SIGSOFT FSE.

[29]  S. Christopherson Working in the Creative Economy: Risk, Adaptation, and the Persistence of Exclusionary Networks , 2009 .

[30]  M. Lutter Do Women Suffer from Network Closure? The Moderating Effect of Social Capital on Gender Inequality in a Project-Based Labor Market, 1929 to 2010 , 2015 .

[31]  Premkumar T. Devanbu,et al.  Will They Like This? Evaluating Code Contributions with Language Models , 2015, 2015 IEEE/ACM 12th Working Conference on Mining Software Repositories.

[32]  Daniela E. Damian,et al.  Selecting Empirical Methods for Software Engineering Research , 2008, Guide to Advanced Empirical Software Engineering.

[33]  Margaret M. Burnett,et al.  Open Source Barriers to Entry, Revisited: A Sociotechnical Perspective , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[34]  Rupert G. Miller,et al.  Survival Analysis , 2022, The SAGE Encyclopedia of Research Design.

[35]  P. Adler,et al.  Social Capital: Prospects for a New Concept , 2002 .

[36]  Chee Wei Phang,et al.  Promoting Continuance Through Shaping Members' Social Identity in Knowledge-Based Versus Support/Advocacy Virtual Communities , 2016, IEEE Transactions on Engineering Management.

[37]  Daniela E. Damian,et al.  The promises and perils of mining GitHub , 2009, MSR 2014.

[38]  Helen Blair,et al.  Active Networking: Action, Social Structure and the Process of Networking , 2009 .

[39]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[40]  Markus Strohmaier,et al.  Inferring Gender from Names on the Web: A Comparative Evaluation of Gender Detection Methods , 2016, WWW.

[41]  Premkumar T. Devanbu,et al.  Gender and Tenure Diversity in GitHub Teams , 2015, CHI.

[42]  Jennifer Marlow,et al.  Activity traces and signals in software developer recruitment and hiring , 2013, CSCW.

[43]  D. Owen,et al.  Handbook of Statistical Distributions , 1978 .

[44]  Harald C. Gall,et al.  Don't touch my code!: examining the effects of ownership on software quality , 2011, ESEC/FSE '11.

[45]  Yulin Fang,et al.  Understanding Sustained Participation in Open Source Software Projects , 2009, J. Manag. Inf. Syst..

[46]  Sven Laumer,et al.  Who Will Remain? An Evaluation of Actual Person-Job and Person-Team Fit to Predict Developer Retention in FLOSS Projects , 2012, 2012 45th Hawaii International Conference on System Sciences.

[47]  I. Grugulis,et al.  Social Capital and Networks in Film and TV: Jobs for the Boys? , 2012 .

[48]  Adam Croom,et al.  Roads and Bridges: The Unseen Labor Behind Our Digital Infrastructure / Ford Foundation , 2016 .

[49]  Emerson Murphy-Hill,et al.  Gender differences and bias in open source: pull request acceptance of women versus men , 2017, PeerJ Comput. Sci..

[50]  P. Grambsch,et al.  Proportional hazards tests and diagnostics based on weighted residuals , 1994 .

[51]  M. Newman,et al.  Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[52]  A. Bakker,et al.  The Measurement of Work Engagement With a Short Questionnaire , 2006 .

[53]  Alexander Serebrenik,et al.  Recognizing Gender of Stack Overflow Users , 2016, 2016 IEEE/ACM 13th Working Conference on Mining Software Repositories (MSR).

[54]  Patricia Lago,et al.  Uncovering Latent Social Communities in Software Development , 2013, IEEE Software.

[55]  Sergio L. Toral Marín,et al.  Analysis of virtual communities supporting OSS projects using social network analysis , 2010, Inf. Softw. Technol..

[56]  Tom Mens,et al.  On the variation and specialisation of workload—A case study of the Gnome ecosystem community , 2014, Empirical Software Engineering.

[57]  Jacob Cohen,et al.  Applied multiple regression/correlation analysis for the behavioral sciences , 1979 .

[58]  J. Coleman Foundations of Social Theory , 1990 .

[59]  Fabrizio Perretti,et al.  Mixing genres and matching people: a study in innovation and team composition in Hollywood , 2007 .

[60]  Audris Mockus,et al.  What make long term contributors: Willingness and opportunity in OSS community , 2012, 2012 34th International Conference on Software Engineering (ICSE).

[61]  Heng-Li Yang,et al.  Team structure and team performance in IS development: a social network perspective , 2004, Inf. Manag..

[62]  James D. Herbsleb,et al.  Social coding in GitHub: transparency and collaboration in an open software repository , 2012, CSCW.

[63]  Kate Ehrlich,et al.  Leveraging expertise in global software teams: Going outside boundaries , 2006, 2006 IEEE International Conference on Global Software Engineering (ICGSE'06).

[64]  Marco Aurélio Gerosa,et al.  More Common Than You Think: An In-depth Study of Casual Contributors , 2016, 2016 IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering (SANER).

[65]  D. Stark,et al.  Disruptive Diversity and Recurring Cohesion: Assembling Creative Teams in the Video Game Industry, 1979-2009 , 2011 .

[66]  Chen Zhang,et al.  Emergence of New Project Teams from Open Source Software Developer Networks: Impact of Prior Collaboration Ties , 2008, Inf. Syst. Res..

[67]  James D. Herbsleb,et al.  From Diversity by Numbers to Diversity as Process: Supporting Inclusiveness in Software Development Teams with Brainstorming , 2017, 2017 IEEE/ACM 39th International Conference on Software Engineering (ICSE).

[68]  Guido Hertel,et al.  Motivation of software developers in Open Source projects: an Internet-based survey of contributors to the Linux kernel , 2003 .

[69]  Bo Xu,et al.  Volunteers' participation in open source software development: a study from the social-relational perspective , 2010, DATB.

[70]  Christoph Treude,et al.  Who is Who in the Mailing List? Comparing Six Disambiguation Heuristics to Identify Multiple Addresses of a Participant , 2016, 2016 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[71]  Jan Drengner,et al.  Events and Loyalty Formation: The Role of Satisfaction, Felt Community, Emotional Experience, and Frequency of Use , 2010 .

[72]  R. V. Anand Chasing Stars: The Myth of Talent and the Portability of Performance , 2012 .

[73]  Eric T. G. Wang,et al.  Understanding knowledge sharing in virtual communities: An integration of social capital and social cognitive theories , 2006, Decis. Support Syst..

[74]  Sandra Slaughter,et al.  Understanding the Motivations, Participation, and Performance of Open Source Software Developers: A Longitudinal Study of the Apache Projects , 2006, Manag. Sci..

[75]  Amanda Lee One-Time Contributors to FLOSS: Surveys and Data Analysis , 2018, SOEN.

[76]  Simon Rodan,et al.  More than Network Structure: How Knowledge Heterogeneity Influences Managerial Performance and Innovativeness , 2004 .

[77]  DongGyun Han,et al.  Writing Acceptable Patches: An Empirical Study of Open Source Project Patches , 2014, 2014 IEEE International Conference on Software Maintenance and Evolution.

[78]  Gail C. Murphy,et al.  Impact of developer turnover on quality in open-source software , 2015, ESEC/SIGSOFT FSE.

[79]  Philip J. Guo,et al.  Paradise unplugged: identifying barriers for female participation on stack overflow , 2016, SIGSOFT FSE.

[80]  Gregorio Robles,et al.  Developer Turnover in Global, Industrial Open Source Projects: Insights from Applying Survival Analysis , 2017, 2017 IEEE 12th International Conference on Global Software Engineering (ICGSE).

[81]  Sara D. Elder,et al.  Sex‐specific labour market indicators: What they show , 1999 .

[82]  Ananth V. Iyer,et al.  Information networks and market behavior , 1992 .

[83]  Wonseok Oh,et al.  Membership Herding and Network Stability in the Open Source Community: The Ising Perspective , 2007, Manag. Sci..

[84]  Michael Gertz,et al.  Mining email social networks , 2006, MSR '06.

[85]  Jijie Wang The Role of Social Capital in Open Source Software Communities , 2005, AMCIS.

[86]  Rohan Padhye,et al.  A study of external community contribution to open-source projects on GitHub , 2014, MSR 2014.

[87]  Alexander Serebrenik,et al.  Perceptions of Diversity on Git Hub: A User Survey , 2015, 2015 IEEE/ACM 8th International Workshop on Cooperative and Human Aspects of Software Engineering.

[88]  Yi Wang,et al.  Competence-Confidence Gap: A Threat to Female Developers' Contribution on GitHub , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering: Software Engineering in Society (ICSE-SEIS).

[89]  N. Lin Social Capital: A Theory of Social Structure and Action , 2001 .

[90]  Dawn Nafus,et al.  ‘Patches don’t have gender’: What is not open in open source software , 2012, New Media Soc..

[91]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[92]  Arie van Deursen,et al.  An exploratory study of the pull-based software development model , 2014, ICSE.

[93]  M. B. Aguilera,et al.  The Impact of Social Capital on Labor Force Participation: Evidence from the 2000 Social Capital Benchmark Survey , 2002 .

[94]  R. Burt Structural Holes versus Network Closure as Social Capital , 2001 .