Finding an effective classification technique to develop a software team composition model

Ineffective software team composition has become recognized as a prominent aspect of software project failures. Reports from results extracted from different theoretical personality models have produced contradicting fits, validity challenges, and missing guidance during software development personnel selection. It is also believed that the technique/s used while developing a model can impact the overall results. Thus, this study aims to (1) discover an effective classification technique to solve the problem and (2) develop a model for composition of the software development team. The model developed was composed of 3 predictors: team role, personality types, and gender variables; it also contained 1 outcome: team performance variable. The techniques used for model development were logistic regression, decision tree, and rough sets theory (RST). Higher prediction accuracy and reduced pattern complexity were the 2 parameters for selecting the effective technique. Based on the results, the Johnson algorithm (JA) of RST appeared to be an effective technique for a team composition model. The study has proposed a set of 24 decision rules for finding effective team members. These rules involve gender classification to highlight the appropriate personality profile for software developers. In the end, this study concludes that selecting an appropriate classification technique is one of the most important factors in developing effective models.

[1]  Rohaida Romli,et al.  ASSESSING PERSONALITY TYPES PREFERENCES AMONGST SOFTWARE DEVELOPERS: A CASE OF MALAYSIA , 2015 .

[2]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[3]  Richard H. Thayer,et al.  Software Engineering Project Management , 2000 .

[4]  Aleksander Ohrn,et al.  ROSETTA -- A Rough Set Toolkit for Analysis of Data , 1997 .

[5]  R. Pearson Species’ Distribution Modeling for Conservation Educators and Practitioners , 2010 .

[6]  Mia Hubert,et al.  Fast cross-validation of high-breakdown resampling methods for PCA , 2007, Comput. Stat. Data Anal..

[7]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[8]  Suzanne T Bell,et al.  Deep-level composition variables as predictors of team performance: a meta-analysis. , 2007, The Journal of applied psychology.

[9]  Kamal Imran Mohd Sharif,et al.  Software development team composition: Personality types of programmer and complex network , 2017 .

[10]  R. Ryan Nelson,et al.  IT Project Management: Infamous Failures, Classic Mistakes, and Best Practices , 2007, MIS Q. Executive.

[11]  Erran Carmel,et al.  Tactical Approaches for Alleviating Distance in Global Software Development , 2001, IEEE Softw..

[12]  Luiz Fernando Capretz Bringing the Human Factor to Software Engineering , 2014, IEEE Softw..

[13]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[14]  Kenneth L. Whipkey Identifying predictors of programming skill , 1984, SGCS.

[15]  CapretzLuiz Fernando,et al.  Making Sense of Software Development and Personality Types , 2010 .

[16]  Oguzhan Alagoz,et al.  Informatics in radiology: comparison of logistic regression and artificial neural network models in breast cancer risk estimation. , 2010, Radiographics : a review publication of the Radiological Society of North America, Inc.

[17]  Eileen M. Trauth,et al.  Cultural Diversity Challenges: Issues for Managing Globally Distributed Knowledge Workers in Software Development , 2008 .

[18]  Margaret H. Dunham,et al.  Data Mining: Introductory and Advanced Topics , 2002 .

[19]  Deborah Richards,et al.  Knowing‐doing gaps in ICT: gender and culture , 2013 .

[20]  Cezar Scarlat,et al.  Team Composition and Team Performance: Achieving Higher Quality Results in an International Higher Education Environment , 2013 .

[21]  Luiz Fernando Capretz,et al.  Making Sense of Software Development and Personality Types , 2010, IT Professional.

[22]  O. Colins,et al.  Psychopathic Personality in the General Population: Differences and Similarities Across Gender. , 2017, Journal of personality disorders.

[23]  David L. Olson,et al.  Advanced Data Mining Techniques , 2008 .

[24]  Tom DeMarco,et al.  Peopleware: Productive Projects and Teams , 1987 .

[25]  Abdul Rehman Gilal,et al.  A Rough-Fuzzy Inference System for Selecting Team Leader for Software Development Teams , 2017 .

[26]  Alan Howard On site: Software engineering project management , 2001, CACM.

[27]  Steve McConnell How to Defend an Unpopular Schedule , 1996, IEEE Softw..

[28]  A. Furnham The big five versus the big four: the relationship between the Myers-Briggs Type Indicator (MBTI) and NEO-PI five factor model of personality , 1996 .

[29]  Mazni Omar,et al.  Impact of personality and gender diversity on software development teams' performance , 2014, 2014 International Conference on Computer, Communications, and Control Technology (I4CT).

[30]  Mike Holcombe,et al.  A study into the effects of personality type and methodology on cohesion in software engineering teams , 2007, Behav. Inf. Technol..

[31]  Lucila Ohno-Machado,et al.  Logistic regression and artificial neural network classification models: a methodology review , 2002, J. Biomed. Informatics.

[32]  Abdul Rehman Gilal,et al.  Impact of software team composition methodology on the personality preferences of Malaysian students , 2016, 2016 3rd International Conference on Computer and Information Sciences (ICCOINS).

[33]  Mazni Omar,et al.  Identifying effective software engineering (SE) team personality types composition using rough set approach , 2010, 2010 International Symposium on Information Technology.

[34]  Azuraliza Abu Bakar,et al.  Predictive models for dengue outbreak using multiple rulebase classifiers , 2011, Proceedings of the 2011 International Conference on Electrical Engineering and Informatics.

[35]  James D. Arthur,et al.  An Objectives-Driven Process for Selecting Methods to Support Requirements Engineering Activities , 2005, 29th Annual IEEE/NASA Software Engineering Workshop.

[36]  Philip J. Stone,et al.  Experiments in induction , 1966 .

[37]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[38]  Kamal Imran Mohd Sharif,et al.  A rule-based approach for discovering effective software team composition , 2014 .

[39]  Rodney X. Sturdivant,et al.  Applied Logistic Regression: Hosmer/Applied Logistic Regression , 2005 .

[40]  Luiz Fernando Capretz,et al.  Evolution of software engineers' personality profile , 2012, SOEN.

[41]  Abdul Rehman Gilal,et al.  Making programmer suitable for team-leader: Software team composition based on personality types , 2015, 2015 International Symposium on Mathematical Sciences and Computing Research (iSMSC).

[42]  Kurt R. Linberg Software developer perceptions about software project failure: a case study , 1999, J. Syst. Softw..

[43]  Kasper Hornbæk,et al.  Some Whys and Hows of Experiments in Human-Computer Interaction , 2013, Found. Trends Hum. Comput. Interact..

[44]  Sotiris B. Kotsiantis,et al.  Machine learning: a review of classification and combining techniques , 2006, Artificial Intelligence Review.

[45]  Torgeir R. Hvidsten Fault Diagnosis in Rotating Machinery Using Rough Set Theory and ROSETTA , 1999 .

[46]  Shuib Basri,et al.  A rule-based model for software development team composition: Team leader role with personality types and gender classification , 2016, Inf. Softw. Technol..

[47]  Shane McIntosh,et al.  Revisiting the Impact of Classification Techniques on the Performance of Defect Prediction Models , 2015, 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering.

[48]  Mazni Omar,et al.  Finding the Effectiveness of Software Team Members Using Decision Tree , 2015 .

[49]  Shuib Basri,et al.  Balancing the personality of programmer: software development team composition , 2016 .

[50]  Shuib Basri,et al.  A Set of Rules for Constructing Gender-Based Personality Types' Composition for Software Programmer , 2015, DaEng.

[51]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[52]  Tore Dybå,et al.  Team effectiveness in software development: Human and cooperative aspects in team effectiveness models and priorities for future studies , 2012, 2012 5th International Workshop on Co-operative and Human Aspects of Software Engineering (CHASE).

[53]  Narasimhaiah Gorla,et al.  Who should work with whom?: building effective software project teams , 2004, CACM.

[54]  Z. Pawlak Rough set approach to knowledge-based decision support , 1997 .

[55]  T. Dawson,et al.  Selecting thresholds of occurrence in the prediction of species distributions , 2005 .

[56]  Charles Elkan,et al.  Optimal Thresholding of Classifiers to Maximize F1 Measure , 2014, ECML/PKDD.

[57]  Ricardo Colomo Palacios,et al.  Software Project Managers under the Team Software Process: A Study of Competences , 2010, Int. J. Inf. Technol. Proj. Manag..

[58]  Valentyna Ivanivna Stakhnevich,et al.  Gender and personality , 2011 .

[59]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[60]  Luiz Fernando Capretz,et al.  Personality types of Cuban software developers , 2011 .

[61]  Amela Karahasanovic,et al.  A survey of controlled experiments in software engineering , 2005, IEEE Transactions on Software Engineering.

[62]  A. K. Mahmood,et al.  Mapping job requirements of software engineers to Big Five Personality Traits , 2012, 2012 International Conference on Computer & Information Science (ICCIS).

[63]  Charles J. Capps,et al.  Information systems development project performance in the 21st century , 2010, SOEN.

[64]  David S. Johnson,et al.  Approximation algorithms for combinatorial problems , 1973, STOC.

[65]  Sotiris B. Kotsiantis,et al.  Supervised Machine Learning: A Review of Classification Techniques , 2007, Informatica.

[66]  Eileen M. Trauth Theorizing Gender and Information Technology Research , 2006 .

[67]  Luiz Fernando Capretz,et al.  Influence of personality types in software tasks choices , 2015, Comput. Hum. Behav..

[68]  Theresa Beaubouef,et al.  Rough Sets , 2019, Lecture Notes in Computer Science.

[69]  Naimah Mohd Hussin,et al.  Analyzing personality types to predict team performance , 2010, 2010 International Conference on Science and Social Research (CSSR 2010).

[70]  David Greathead,et al.  Does personality matter?: an analysis of code-review ability , 2007, CACM.

[71]  Tony Gorschek,et al.  Empirical evidence in global software engineering: a systematic review , 2010, Empirical Software Engineering.

[72]  D. Cox The Regression Analysis of Binary Sequences , 1958 .

[73]  Luiz Fernando Capretz,et al.  Soft sides of software , 2017, Inf. Softw. Technol..

[74]  Paul T. Costa,et al.  Personality stability and its implications for clinical psychology , 1986 .

[75]  Shari Lawrence Pfleeger,et al.  Experimental design and analysis in software engineering , 1995, Ann. Softw. Eng..

[76]  J. R. Quinlan Discovering rules by induction from large collections of examples Intro-ductory readings in expert s , 1979 .

[77]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[78]  Atreyi Kankanhalli,et al.  Cross-cultural differences and information systems developer values , 2004, Decis. Support Syst..

[79]  Luiz Fernando Capretz,et al.  Forty years of research on personality in software engineering: A mapping study , 2015, Comput. Hum. Behav..

[80]  Gregory Piatetsky-Shapiro,et al.  The KDD process for extracting useful knowledge from volumes of data , 1996, CACM.

[81]  Fabio Q. B. da Silva,et al.  Team building criteria in software projects: A mix-method replicated study , 2013, Inf. Softw. Technol..

[82]  Sherlock A. Licorish,et al.  Supporting agile team composition: A prototype tool for identifying personality (In)compatibilities , 2009, 2009 ICSE Workshop on Cooperative and Human Aspects on Software Engineering.

[83]  Kenneth J. Chapman,et al.  Can’t We Pick our Own Groups? The Influence of Group Selection Method on Group Dynamics and Outcomes , 2006 .

[84]  Yang Hui Ad Hoc Networks Based on Rough Set Distance Leaming Method , 2011 .