GitHub: factors influencing project activity levels

Open source software projects typically extend the capabilities of their software by incorporating code contributions from a diverse cross-section of developers. This GitHub structural path modelling study captures the current top 100 JavaScript projects in operation for at least one year or more. It draws on three theories (information integration, planned behavior, and social translucence) to help frame its comparative path approach, and to show ways to speed the collaborative development of GitHub OSS projects. It shows a project's activity level increases with: (1) greater responder-group collaborative efforts, (2) increased numbers of major critical project version releases, and (3) the generation of further commits. However, the generation of additional forks negatively impacts overall project activity levels.

[1]  Ken-ichi Matsumoto,et al.  Characteristics of Sustainable OSS Projects: A Theoretical and Empirical Study , 2015, 2015 IEEE/ACM 8th International Workshop on Cooperative and Human Aspects of Software Engineering.

[2]  J. Hamilton,et al.  Expectations-to-value: connecting customers with business offerings , 2015 .

[3]  Jason J. Jung,et al.  Social big data: Recent achievements and new challenges , 2015, Information Fusion.

[4]  Cristina V. Lopes,et al.  Is Popularity a Measure of Quality? An Analysis of Maven Components , 2014, 2014 IEEE International Conference on Software Maintenance and Evolution.

[5]  Eleni Stroulia,et al.  Co-evolution of project documentation and popularity within github , 2014, MSR 2014.

[6]  B. Tabachnick,et al.  Using multivariate statistics, 5th ed. , 2007 .

[7]  B. Muthén,et al.  How to Use a Monte Carlo Study to Decide on Sample Size and Determine Power , 2002 .

[8]  I. Ajzen The theory of planned behavior , 1991 .

[9]  Eirini Kalliamvakou,et al.  Understanding "watchers" on GitHub , 2014, MSR 2014.

[10]  Chiara Francalanci,et al.  Firms' involvement in Open Source projects: A trade-off between software structural quality and popularity , 2011, J. Syst. Softw..

[11]  David Lo,et al.  Why and how developers fork what from whom in GitHub , 2017, Empirical Software Engineering.

[12]  James C. Anderson,et al.  STRUCTURAL EQUATION MODELING IN PRACTICE: A REVIEW AND RECOMMENDED TWO-STEP APPROACH , 1988 .

[13]  Leif Singer,et al.  Software engineering at the speed of light: how developers stay current using twitter , 2014, ICSE.

[14]  Yuming Zhou,et al.  What Are the Dominant Projects in the GitHub Python Ecosystem? , 2016, 2016 Third International Conference on Trustworthy Systems and their Applications (TSA).

[15]  Marcelo de Almeida Maia,et al.  Understanding the popularity of reporters and assignees in the Github , 2014, SEKE.

[16]  Yana Momchilova Mileva Mining the evolution of software component usage , 2012 .

[17]  Jordi Cabot,et al.  A Systematic Mapping Study of Software Development With GitHub , 2017, IEEE Access.

[18]  James D. Herbsleb,et al.  Impression formation in online peer production: activity traces and personal profiles in github , 2013, CSCW.

[19]  Rex B. Kline,et al.  Principles and Practice of Structural Equation Modeling , 1998 .

[20]  Marco Tulio Valente,et al.  Understanding the Factors That Impact the Popularity of GitHub Repositories , 2016, 2016 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[21]  John A. Hamilton,et al.  Understanding social network site consumer engagements , 2013, ACIS.

[22]  David Lo,et al.  Popularity, Interoperability, and Impact of Programming Languages in 100,000 Open Source Projects , 2013, 2013 IEEE 37th Annual Computer Software and Applications Conference.

[23]  NORA McDONALD,et al.  Modeling Distributed Collaboration on GitHub , 2014, Adv. Complex Syst..

[24]  Minghui Zhou,et al.  The Relationship Between Folder Use and the Number of Forks : A Case Study on Github Repositories , 2014 .

[26]  L. Harlow,et al.  Effects of estimation methods, number of indicators per factor, and improper solutions on structural equation modeling fit indices , 1995 .

[27]  Ang Li,et al.  An Exploratory Research of GitHub Based on Graph Model , 2015, 2015 Ninth International Conference on Frontier of Computer Science and Technology.

[28]  James D. Herbsleb,et al.  Social coding in GitHub: transparency and collaboration in an open software repository , 2012, CSCW.

[29]  John A. Hamilton,et al.  Engaging technologies-savvy consumers with the internet of things , 2015 .

[30]  John M. Carroll,et al.  Exploring the ecosystem of software developers on GitHub and other platforms , 2014, CSCW Companion '14.

[31]  Megan Squire Forge++: The Changing Landscape of FLOSS Development , 2014, 2014 47th Hawaii International Conference on System Sciences.

[32]  Bruce Ferwerda,et al.  GitHub developers use rockstars to overcome overflow of news , 2013, CHI Extended Abstracts.

[33]  H. E. Tinsley,et al.  Uses of factor analysis in counseling psychology research. , 1987 .

[34]  James D. Herbsleb,et al.  Let's talk about it: evaluating contributions through discussion in GitHub , 2014, SIGSOFT FSE.

[35]  Sarah Henderson How do people manage their documents?: an empirical investigation into personal document management practices among knowledge workers , 2009 .

[36]  Eirini Kalliamvakou,et al.  An in-depth study of the promises and perils of mining GitHub , 2016, Empirical Software Engineering.

[37]  Gang Yin,et al.  Reviewer Recommender of Pull-Requests in GitHub , 2014, 2014 IEEE International Conference on Software Maintenance and Evolution.