Towards Extracting the Role and Behavior of Contributors in Open-source Projects

Lately, the popular open source paradigm and the adoption of agile methodologies have changed the way software is developed. Effective collaboration within software teams has become crucial for building successful products. In this context, harnessing the data available in online code hosting facilities can help towards understanding how teams work and optimizing the development process. Although there are several approaches that mine contributions’ data, they usually view contributors as a uniform body of engineers, and focus mainly on the aspect of productivity while neglecting the quality of the work performed. In this work, we design a methodology for identifying engineer roles in development teams and determine the behaviors that prevail for each role. Using a dataset of GitHub projects, we perform clustering against the DevOps axis, thus identifying three roles: developers that are mainly preoccupied with code commits, operations engineers that focus on task assignment and acceptance testing, and the lately popular role of DevOps engineers that are a mix of both. Our analysis further extracts behavioral patterns for each role, this way assisting team leaders in knowing their team and effectively directing responsibilities to achieve optimal workload balancing and task allocation.

[1]  Dimitris Panagiotou,et al.  Combining activity metrics and contribution topics for software recommendations , 2012, 2012 Third International Workshop on Recommendation Systems for Software Engineering (RSSE).

[2]  Jordi Cabot,et al.  Exploring the use of labels to categorize issues in Open-Source Software projects , 2015, 2015 IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER).

[3]  Iulian Neamtiu,et al.  The Journal of Systems and Software 85 (2012) 2275–2292 Contents lists available at SciVerse ScienceDirect The Journal of Systems and Software , 2022 .

[4]  Eirini Kalliamvakou,et al.  Mediterranean Conference on Information Systems ( MCIS ) 2009 Measuring Developer Contribution From Software Repository Data , 2017 .

[5]  Themistoklis G. Diamantopoulos,et al.  Towards Modeling the User-perceived Quality of Source Code using Static Analysis Metrics , 2017, ICSOFT.

[6]  Shu Li,et al.  Analysis of Software Developer Activity on a Distributed Version Control System , 2016, 2016 30th International Conference on Advanced Information Networking and Applications Workshops (WAINA).

[7]  D LaToza Thomas Crowdsourcing in software engineering: models, motivations, and challenges , 2019, ICSE 2019.

[8]  Bernd Fischer,et al.  CVExplorer: Identifying candidate developers by mining and exploring their open source contributions , 2016, 2016 31st IEEE/ACM International Conference on Automated Software Engineering (ASE).

[9]  Ken-ichi Matsumoto,et al.  A Study of the Characteristics of Developers' Activities in GitHub , 2013, 2013 20th Asia-Pacific Software Engineering Conference (APSEC).

[10]  Xiaoping Fan,et al.  Exploring the Characteristics of Issue-Related Behaviors in GitHub Using Visualization Techniques , 2018, IEEE Access.

[11]  Gail C. Murphy,et al.  Who should fix this bug? , 2006, ICSE.

[12]  Andreas Zeller,et al.  Change Bursts as Defect Predictors , 2010, 2010 IEEE 21st International Symposium on Software Reliability Engineering.

[13]  Papamichail Michail,et al.  User-Perceived Source Code Quality Estimation Based on Static Analysis Metrics , 2016 .

[14]  Christoph Treude,et al.  Assessing developer contribution with repository mining-based metrics , 2015, 2015 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[15]  Parag C. Pendharkar,et al.  Benchmarking software development productivity of CMMI level 5 projects , 2015, Inf. Technol. Manag..

[16]  Liming Zhu,et al.  DevOps - A Software Architect's Perspective , 2015, SEI series in software engineering.

[17]  Benoit Baudry,et al.  "May the fork be with you": novel metrics to analyze collaboration on GitHub , 2014, WETSoM 2014.