Effort estimation by characterizing developer activity

During the latest years libre (free, open source) software has gained a lot of attention from the industry. Following this interest, the research community is also studying it. For instance, many teams are performing quantitative analysis on the large quantity of data which is publicly available from the development repositories maintained by libre software projects. However, not much of this research is focused on cost or effort estimations, despite its importance (for instance, for companies developing libre software or collaborating with libre software projects), and the availability of some data which could be useful for this purpose. Our position is that classical effort estimation models can be improved from the study of these data, at least when applied to libre software. In this paper, we focus on the characterization of developer activity, which we argue can improve effort estimation. This activity can be traced with a lot of detail, and the resulting data can also be used for validation of any effort estimation model.

[1]  Jesús M. González-Barahona,et al.  Legal Notice , 2005 .

[2]  Barry W. Boehm,et al.  Software Engineering Economics , 1993, IEEE Transactions on Software Engineering.

[3]  Audris Mockus,et al.  Identifying reasons for software changes using historic databases , 2000, Proceedings 2000 International Conference on Software Maintenance.

[4]  H. E. Dunsmore,et al.  Software engineering metrics and models , 1986 .

[5]  Gregorio Robles,et al.  Remote analysis and measurement of libre software systems by means of the CVSAnalY tool , 2004, ICSE 2004.

[6]  Jesus M. Gonzalez-Barahona Community structure of modules in the Apache project , 2004, ICSE 2004.

[7]  Gregorio Robles,et al.  The processes of joining in global distributed software projects , 2006, GSD '06.

[8]  Gregorio Robles,et al.  An Empirical Approach to Software Archaeology , 2005 .

[9]  Kieran Healy,et al.  The Ecology of Open-Source Software Development , 2003 .

[10]  J. Herbsleb,et al.  Two case studies of open source software development: Apache and Mozilla , 2002, TSEM.

[11]  Jesús M. González-Barahona,et al.  GluTheos: Automating the Retrieval and Analysis of Data from Publicly Available Software Repositories , 2004, MSR.

[12]  D HerbslebJames,et al.  Two case studies of open source software development , 2002 .

[13]  Daniel M. Germán,et al.  An empirical study of fine-grained software modifications , 2004, 20th IEEE International Conference on Software Maintenance, 2004. Proceedings..

[14]  Ellis Horowitz,et al.  Software Cost Estimation with COCOMO II , 2000 .

[15]  Michael W. Godfrey,et al.  Evolution in open source software: a case study , 2000, Proceedings 2000 International Conference on Software Maintenance.

[16]  Jesús M. González-Barahona,et al.  Applying Social Network Analysis to the Information in CVS Repositories , 2004, MSR.