Empirical findings on team size and productivity in software development

The size of software project teams has been considered to be a driver of project productivity. Although there is a large literature on this, new publicly available software repositories allow us to empirically perform further research. In this paper we analyse the relationships between productivity, team size and other project variables using the International Software Benchmarking Standards Group (ISBSG) repository. To do so, we apply statistical approaches to a preprocessed subset of the ISBSG repository to facilitate the study. The results show some expected correlations between productivity, effort and time as well as corroborating some other beliefs concerning team size and productivity. In addition, this study concludes that in order to apply statistical or data mining techniques to these type of repositories extensive preprocessing of the data needs to be performed due to ambiguities, wrongly recorded values, missing values, unbalanced datasets, etc. Such preprocessing is a difficult and error prone activity that would need further guidance and information that is not always provided in the repository.

[1]  Reengineering Forum Proceedings of the 21st IEEE International Conference on Software Maintenance : ICSM 2005 , 2005 .

[2]  S. Green How Many Subjects Does It Take To Do A Regression Analysis. , 1991, Multivariate behavioral research.

[3]  Peter V. Norden Curve Fitting for a Model of Applied Research and Development Scheduling , 1958, IBM J. Res. Dev..

[4]  Marjan Hericko,et al.  An approach to optimizing software development team size , 2008, Inf. Process. Lett..

[5]  Allen S. Parrish,et al.  An Empirical Study Using Task Assignment Patterns to Improve the Accuracy of Software Effort Estimation , 2001, IEEE Trans. Software Eng..

[6]  Stephen R. Schach,et al.  A software metric for cost estimation and efficiency measurement in data processing system development , 1983, J. Syst. Softw..

[7]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[8]  June M. Verner Function Point Analysis , 2002 .

[9]  Giuliano Antoniol,et al.  Search-based techniques applied to optimization of project planning for a massive maintenance project , 2005, 21st IEEE International Conference on Software Maintenance (ICSM'05).

[10]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.

[11]  Enrique Alba,et al.  Software project management with GAs , 2007, Inf. Sci..

[12]  Enrique Alba,et al.  Using multi-objective metaheuristics to solve the software project scheduling problem , 2011, GECCO '11.

[13]  Peter Naudé,et al.  Strategic Software Development: Productivity Comparisons of General Development Programs , 2007 .

[14]  Enrique Alba,et al.  Today/future importance analysis , 2010, GECCO '10.

[15]  IEEE Std 1045-1992, IEEE Standard for Software Productivity Metrics , 1998 .

[16]  Guilherme Horta Travassos,et al.  Cross versus Within-Company Cost Estimation Studies: A Systematic Review , 2007, IEEE Transactions on Software Engineering.

[17]  June M. Verner,et al.  State of the practice: An exploratory analysis of schedule estimation and software project success prediction , 2007, Inf. Softw. Technol..

[18]  Giuliano Antoniol,et al.  Software project planning for robustness and completion time in the presence of uncertainty using multi objective search based software engineering , 2009, GECCO.

[19]  Giuliano Antoniol,et al.  The Effect of Communication Overhead on Software Maintenance Project Staffing: a Search-Based Approach , 2007, 2007 IEEE International Conference on Software Maintenance.

[20]  Stefan Biffl,et al.  Multiobjective evolutionary algorithm for software project portfolio optimization , 2010, GECCO '10.

[21]  Stuart E. Madnick,et al.  Software Project Dynamics: An Integrated Approach , 1991 .

[22]  Ioannis Stamelos,et al.  Software productivity and effort prediction with ordinal regression , 2005, Inf. Softw. Technol..

[23]  Parag C. Pendharkar,et al.  An empirical study of the impact of team size on software development effort , 2007, Inf. Technol. Manag..

[24]  José Javier Dolado,et al.  On the problem of the software cost function , 2001, Inf. Softw. Technol..

[25]  John E. Gaffney,et al.  Software Function, Source Lines of Code, and Development Effort Prediction: A Software Science Validation , 1983, IEEE Transactions on Software Engineering.

[26]  Fred P. Brooks,et al.  The Mythical Man-Month , 1975, Reliable Software.

[27]  Giuliano Antoniol,et al.  The use of search‐based optimization techniques to schedule and staff software projects: an approach and an empirical study , 2011, Softw. Pract. Exp..

[28]  Günther Ruhe,et al.  Search Based Software Engineering , 2013, Lecture Notes in Computer Science.

[29]  Jim Davies,et al.  Economies and diseconomies of scale in software development , 2011, J. Softw. Maintenance Res. Pract..

[30]  Giuliano Antoniol,et al.  Assessing staffing needs for a software maintenance project through queuing simulation , 2004, IEEE Transactions on Software Engineering.

[31]  Enrique Alba,et al.  Management of Software Projects with GAs , 2005 .

[32]  Jürgen Münch,et al.  Factors Influencing Software Development Productivity - State-of-the-Art and Industrial Experiences , 2009, Adv. Comput..

[33]  Tarek K. Abdel-Hamid,et al.  The Dynamics of Software Project Staffing: A System Dynamics Based Simulation Approach , 1989, IEEE Trans. Software Eng..

[34]  Stefan Biffl,et al.  Applied Soft Computing Software Project Portfolio Optimization with Advanced Multiobjective Evolutionary Algorithms , 2022 .

[35]  Lawrence H. Putnam,et al.  A General Empirical Solution to the Macro Software Sizing and Estimating Problem , 1978, IEEE Transactions on Software Engineering.

[36]  Suku Nair,et al.  A Model for Software Development Effort and Cost Estimation , 1997, IEEE Trans. Software Eng..

[37]  Montgomery Phister A model of the software development process , 1981, J. Syst. Softw..