Who Will Stay in the FLOSS Community? Modeling Participant’s Initial Behavior

Motivation: To survive and succeed, FLOSS projects need contributors able to accomplish critical project tasks. However, such tasks require extensive project experience of long term contributors (LTCs). Aim: We measure, understand, and predict how the newcomers' involvement and environment in the issue tracking system (ITS) affect their odds of becoming an LTC. Method: ITS data of Mozilla and Gnome, literature, interviews, and online documents were used to design measures of involvement and environment. A logistic regression model was used to explain and predict contributor's odds of becoming an LTC. We also reproduced the results on new data provided by Mozilla. Results: We constructed nine measures of involvement and environment based on events recorded in an ITS. Macro-climate is the overall project environment while micro-climate is person-specific and varies among the participants. Newcomers who are able to get at least one issue reported in the first month to be fixed, doubled their odds of becoming an LTC. The macro-climate with high project popularity and the micro-climate with low attention from peers reduced the odds. The precision of LTC prediction was 38 times higher than for a random predictor. We were able to reproduce the results with new Mozilla data without losing the significance or predictive power of the previously published model. We encountered unexpected changes in some attributes and suggest ways to make analysis of ITS data more reproducible. Conclusions: The findings suggest the importance of initial behaviors and experiences of new participants and outline empirically-based approaches to help the communities with the recruitment of contributors for long-term participation and to help the participants contribute more effectively. To facilitate the reproduction of the study and of the proposed measures in other contexts, we provide the data we retrieved and the scripts we wrote at https://www.passion-lab.org/projects/developerfluency.html.

[1]  Vidal Díaz de Rada,et al.  Internet, Phone, Mail and Mixed-Mode Surveys: The Tailored Design Method. Don A. Dillman, Jolene D. Smyth y Leah Melani Christian. (New Jersey, John Wiley and Sons, 2014) , 2016 .

[2]  Barbara Bickart Roger Tourangeau, Lance J. Rips, and Kenneth Rasinski, The Psychology of Survey Response , 2001 .

[3]  Terry A. Beehr,et al.  A meta-analysis of relations between person–organization fit and work attitudes , 2003 .

[4]  A. Kristof-brown,et al.  CONSEQUENCES OF INDIVIDUALS' FIT AT WORK: A META-ANALYSIS OF PERSON-JOB, PERSON-ORGANIZATION, PERSON-GROUP, AND PERSON-SUPERVISOR FIT , 2005 .

[5]  J. Alberto Espinosa,et al.  Learning from Experience in Software Development: A Multilevel Analysis , 2007, Manag. Sci..

[6]  Katherine J. Stewart,et al.  The Impact of Ideology on Effectiveness in Open Source Software Development Teams , 2006, MIS Q..

[7]  A. Mockus,et al.  Does the initial environment impact the future of developers , 2011, 2011 33rd International Conference on Software Engineering (ICSE).

[8]  Robert J. Crutcher,et al.  The role of deliberate practice in the acquisition of expert performance. , 1993 .

[9]  P. Resnick,et al.  Building Successful Online Communities: Evidence-Based Social Design , 2012 .

[10]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[11]  Karim R. Lakhani,et al.  Community, Joining, and Specialization in Open Source Software Innovation: A Case Study , 2003 .

[12]  Guido Hertel,et al.  Motivation of software developers in Open Source projects: an Internet-based survey of contributors to the Linux kernel , 2003 .

[13]  Angela P. Wetzel Internet, mail, and mixed‐mode surveys: The tailored design method , 2010 .

[14]  Brian Fitzgerald,et al.  Why Hackers Do What They Do: Understanding Motivation and Effort in Free/Open Source Software Projects , 2007 .

[15]  Beth A. Bechky,et al.  The Emergence of Governance in an Open Source Community , 2007 .

[16]  Audris Mockus,et al.  Developer fluency: achieving true mastery in software projects , 2010, FSE '10.

[17]  Eric A. von Hippel,et al.  How Open Source Software Works: 'Free' User-to-User Assistance? , 2000 .

[18]  M. Larsen,et al.  The Psychology of Survey Response , 2002 .

[19]  Christian Robottom Reis,et al.  An Overview of the Software Engineering Process and Tools in the Mozilla Project , 2002 .

[20]  V. Johnson Revised standards for statistical evidence , 2013, Proceedings of the National Academy of Sciences.

[21]  Robert DeLine,et al.  Information Needs in Collocated Software Development Teams , 2007, 29th International Conference on Software Engineering (ICSE'07).

[22]  Chen Zhang,et al.  Emergence of New Project Teams from Open Source Software Developer Networks: Impact of Prior Collaboration Ties , 2008, Inf. Syst. Res..

[23]  Walt Scacchi,et al.  Understanding the requirements for developing open source software systems , 2002, IEE Proc. Softw..

[24]  Marco Aurélio Gerosa,et al.  Why do newcomers abandon open source software projects? , 2013, 2013 6th International Workshop on Cooperative and Human Aspects of Software Engineering (CHASE).

[25]  Ingo Scholtes,et al.  The rise and fall of a central contributor: Dynamics of social organization and performance in the GENTOO community , 2013, 2013 6th International Workshop on Cooperative and Human Aspects of Software Engineering (CHASE).

[26]  Audris Mockus,et al.  Software Support Tools and Experimental Work , 2006, Empirical Software Engineering Issues.

[27]  L. Carvajal,et al.  IEEE Transactions on Software Engineering , 2016 .

[28]  Ronald H. Rasch,et al.  Factors Affecting Software Developers' Performance: An Integrated Approach , 1992, MIS Q..

[29]  J. Herbsleb,et al.  Two case studies of open source software development: Apache and Mozilla , 2002, TSEM.

[30]  Audris Mockus,et al.  Impact of Triage: A Study of Mozilla and Gnome , 2013, 2013 ACM / IEEE International Symposium on Empirical Software Engineering and Measurement.

[31]  Nicolas Ducheneaut,et al.  Socialization in an Open Source Software Community: A Socio-Technical Analysis , 2005, Computer Supported Cooperative Work (CSCW).

[32]  Audris Mockus,et al.  Succession: Measuring transfer of code and developer productivity , 2009, 2009 IEEE 31st International Conference on Software Engineering.

[33]  Gail C. Murphy,et al.  Reducing the effort of bug report triage: Recommenders for development-oriented decisions , 2011, TSEM.

[34]  Georg von Krogh,et al.  Open Source Software and the "Private-Collective" Innovation Model: Issues for Organization Science , 2003, Organ. Sci..

[35]  Audris Mockus,et al.  Product assignment recommender , 2014, ICSE Companion.

[36]  Audris Mockus,et al.  Variability and Reproducibility in Software Engineering: A Study of Four Companies that Developed the Same System , 2009, IEEE Transactions on Software Engineering.

[37]  Sonali K. Shah Motivation, Governance, and the Viability of Hybrid Forms in Open Source Software Development , 2006, Manag. Sci..

[38]  Audris Mockus,et al.  Engineering big data solutions , 2014, FOSE.

[39]  James D. Herbsleb,et al.  Vertical interaction in open software engineering communities , 2009 .

[40]  Daniel M. Germán,et al.  The GNOME project: a case study of open source, global software development , 2003, Softw. Process. Improv. Pract..

[41]  David J. Woehr,et al.  A QUANTITATIVE REVIEW OF THE RELATIONSHIP BETWEEN PERSON–ORGANIZATION FIT AND BEHAVIORAL OUTCOMES , 2006 .

[42]  Joon Koh,et al.  Encouraging participation in virtual communities , 2007, CACM.

[43]  Kate Ehrlich,et al.  All-for-one and one-for-all?: a multi-level analysis of communication patterns and individual performance in geographically distributed software development , 2012, CSCW.

[44]  Carla Simone,et al.  Coordination mechanisms: Towards a conceptual foundation of CSCW systems design , 1996, Computer Supported Cooperative Work (CSCW).

[45]  James D. Herbsleb,et al.  When Cultures Clash: Participation in Open Source Communities and Its Implications For Organizational Commitment , 2011, ICIS.

[46]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[47]  Melvin Blumberg,et al.  The Missing Opportunity in Organizational Research: Some Implications for a Theory of Work Performance , 1982 .

[48]  Sandra Slaughter,et al.  Understanding the Motivations, Participation, and Performance of Open Source Software Developers: A Longitudinal Study of the Apache Projects , 2006, Manag. Sci..

[49]  Kouichi Kishida,et al.  Evolution patterns of open-source software systems and communities , 2002, IWPSE '02.