Mining Communication Patterns in Software Development: A GitHub Analysis

Background: Studies related to human factors in software engineering are providing insightful information on the emotional state of contributors and the impact this has on the code. The open source software development paradigm involves different roles, and previous studies about emotions in software development have not taken into account what different roles might play when people express their feelings. Aim: We present an analysis of issues and commits on five GitHub projects distinguishing contributors between users and developers, and between one-commit and multi-commit developers. Method: We analyzed more than 650K comments from 130K issues of 64K contributors. We calculated emotions (love, joy, anger, sadness) and politeness of the comments related to the issues of the considered projects and introduced the definition of contributor fan-in and fan-out. Results: Results show that users and developers communicate differently as well as multi-commit developers and one-commit developers do. Conclusions: We provide empirical evidence that one-commit developers are more active and more polite in posting comments. Multi-commit developers are less active in posting comments, and while commenting, they are less polite than when commented.

[1]  Michele Marchesi,et al.  How diverse is your team? Investigating gender and nationality diversity in GitHub teams , 2017, Journal of Software Engineering Research and Development.

[2]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[3]  Marco Aurélio Gerosa,et al.  The hard life of open source software project newcomers , 2014, CHASE.

[4]  Natalia Juristo Juzgado,et al.  Towards understanding the relationship between team climate and software quality—a quasi-experimental study , 2008, Empirical Software Engineering.

[5]  Marco Aurélio Gerosa,et al.  Why do newcomers abandon open source software projects? , 2013, 2013 6th International Workshop on Cooperative and Human Aspects of Software Engineering (CHASE).

[6]  Julie F. Pallant,et al.  SPSS Survival Manual , 2020 .

[7]  Nicole Novielli,et al.  Towards discovering the role of emotions in stack overflow , 2014, SSE@SIGSOFT FSE.

[8]  M. E. Conway HOW DO COMMITTEES INVENT , 1967 .

[9]  R. Rosenthal Parametric measures of effect size. , 1994 .

[10]  Pekka Abrahamsson,et al.  Happy software developers solve problems better: psychological measurements in empirical software engineering , 2014, PeerJ.

[11]  Michele Marchesi,et al.  On the influence of maintenance activity types on the issue resolution time , 2014, PROMISE.

[12]  Mike Voight,et al.  A team building intervention program: application and evaluation with two university soccer teams. , 2001 .

[13]  Michele Marchesi,et al.  Are Bullies More Productive? Empirical Study of Affectiveness vs. Issue Fixing Time , 2015, 2015 IEEE/ACM 12th Working Conference on Mining Software Repositories.

[14]  Michele Marchesi,et al.  The JIRA Repository Dataset: Understanding Social Aspects of Software Development , 2015, PROMISE.

[15]  Michele Marchesi,et al.  Software development: do good manners matter? , 2016, PeerJ Comput. Sci..

[16]  Naoyasu Ubayashi,et al.  Magnet or sticky? an OSS project-by-project typology , 2014, MSR 2014.

[17]  A. Cockburn,et al.  Agile Software Development: The People Factor , 2001, Computer.

[18]  D. Yukelson Principles of effective team building interventions in sport: A direct services approach at Penn State University , 1997 .

[19]  Georgios Gousios,et al.  The GHTorent dataset and tool suite , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[20]  Eugene Kaluzniacky,et al.  Managing Psychological Factors in Information Systems Work: An Orientation to Emotional Intelligence , 2003 .

[21]  C. W. Hughes Emotion: Theory, Research and Experience , 1982 .

[22]  J. Russell A circumplex model of affect. , 1980 .

[23]  Audris Mockus,et al.  Impact of Triage: A Study of Mozilla and Gnome , 2013, 2013 ACM / IEEE International Symposium on Empirical Software Engineering and Measurement.

[24]  Amy Beth Warriner,et al.  Norms of valence, arousal, and dominance for 13,915 English lemmas , 2013, Behavior Research Methods.

[25]  Michele Marchesi,et al.  Arsonists or Firefighters? Affectiveness in Agile Software Development , 2016, XP.

[26]  Pekka Abrahamsson,et al.  Understanding the affect of developers: theoretical background and guidelines for psychoempirical software engineering , 2015, SSE@SIGSOFT FSE.

[27]  Michele Marchesi,et al.  On Measuring Affects of GitHub Issues' Commenters , 2018, 2018 IEEE/ACM 3rd International Workshop on Emotion Awareness in Software Engineering (SEmotion).

[28]  Jeffrey C. Carver,et al.  Understanding the Impressions, Motivations, and Barriers of One Time Code Contributors to FLOSS Projects: A Survey , 2017, 2017 IEEE/ACM 39th International Conference on Software Engineering (ICSE).

[29]  Martin Guha,et al.  APA Dictionary of Clinical Psychology , 2013 .

[30]  Audris Mockus,et al.  What make long term contributors: Willingness and opportunity in OSS community , 2012, 2012 34th International Conference on Software Engineering (ICSE).

[31]  Teresa M. Amabile,et al.  Affect and Creativity at Work , 2005 .

[32]  Pablo Gervás,et al.  SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis , 2012, LREC.

[33]  Minhaz Fahim Zibran,et al.  Towards understanding and exploiting developers' emotional variations in software engineering , 2016, 2016 IEEE 14th International Conference on Software Engineering Research, Management and Applications (SERA).

[34]  Cindy K. Chung,et al.  Linguistic Inquiry and Word Count (LIWC): Pronounced “Luke,” . . . and Other Useful Facts , 2012 .

[35]  Andrew G. Miner,et al.  State mood, task performance, and behavior at work: A within-persons approach , 2010 .

[36]  Serge Demeyer,et al.  An exploratory qualitative and quantitative analysis of emotions in issue report comments of open source systems , 2017, Empirical Software Engineering.

[37]  Lefteris Angelis,et al.  Towards individualized software engineering: empirical studies should collect psychometrics , 2008, CHASE.

[38]  Pekka Abrahamsson,et al.  Software Developers, Moods, Emotions, and Performance , 2014, IEEE Software.

[39]  Michele Marchesi,et al.  Would you mind fixing this issue? - An Empirical Analysis of Politeness and Attractiveness in Software Developed Using Agile Boards , 2015, XP.

[40]  Jure Leskovec,et al.  A computational approach to politeness with application to social factors , 2013, ACL.

[41]  R Gary,et al.  APA Dictionary of Clinical Psychology , 2012 .

[42]  Nicole Novielli,et al.  Sentiment Polarity Detection for Software Development , 2017, Empirical Software Engineering.

[43]  A. Erez,et al.  The influence of positive affect on the components of expectancy motivation. , 2002, The Journal of applied psychology.

[44]  Robert M. Hierons,et al.  Do moods affect programmers’ debug performance? , 2011, Cognition, Technology & Work.

[45]  Eleni Stroulia,et al.  On the Personality Traits of StackOverflow Users , 2013, 2013 IEEE International Conference on Software Maintenance.

[46]  Michele Marchesi,et al.  On the randomness and seasonality of affective metrics for software development , 2017, SAC.

[47]  Jeffrey C. Carver,et al.  Are One-Time Contributors Different? A Comparison to Core and Periphery Developers in FLOSS Repositories , 2017, 2017 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM).