Modeling success in FLOSS project groups

A significant challenge in software engineering is accurately modeling projects in order to correctly forecast success or failure. The primary difficulty is that software development efforts are complex in terms of both the technical and social aspects of the engineering environment. This is compounded by the lack of real data that captures both the measures of success in performing a process, and the measures that reflect a group's social dynamics. This research focuses on the development of a model for predicting software project success that leverages the wealth of available open source project data in order to accurately forecast the behavior of those software engineering groups. The model accounts for both the technical elements of software engineering and the social elements that drive the decisions of individual developers. Agent-based simulations are used to represent the complexity of the group interactions, and the behavior of each agent is based on the acquired open source software engineering data. For four of the five project success measures, the results indicate that the developed model represents the underlying data well and provides accurate predictions of open source project success indicators.

[1]  Ron Kohavi,et al.  Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[2]  J. Hackman,et al.  The design of work teams , 1987 .

[3]  HighWire Press Philosophical Transactions of the Royal Society of London , 1781, The London Medical Journal.

[4]  Sean Luke,et al.  MASON: A Multiagent Simulation Environment , 2005, Simul..

[5]  Guy A. Schiavone,et al.  Spatial Data Analysis as a Software Quality Modeling Technique , 2003, SEKE.

[6]  Kevin Crowston,et al.  Effective Work Practices for FLOSS Development: A Model and Propositions , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[7]  Chen Zhang,et al.  Impact of Social Ties on Open Source Project Team Formation , 2006, OSS.

[8]  T. Bayes An essay towards solving a problem in the doctrine of chances , 2003 .

[9]  H. E. Dunsmore,et al.  Software engineering metrics and models , 1986 .

[10]  Margaret J. Robertson,et al.  Design and Analysis of Experiments , 2006, Handbook of statistics.

[11]  Kevin Crowston,et al.  Core and Periphery in Free/Libre and Open Source Software Team Communications , 2006, Proceedings of the 39th Annual Hawaii International Conference on System Sciences (HICSS'06).

[12]  Taghi M. Khoshgoftaar,et al.  Predicting software errors, during development, using nonlinear regression models: a comparative study , 1992 .

[13]  Stephen H. Kan,et al.  Metrics and Models in Software Quality Engineering , 1994, SOEN.

[14]  Kevin Crowston,et al.  Towards a Portfolio of FLOSS Project Success Measures , 2004, ICSE 2004.

[15]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[16]  Kevin Crowston,et al.  Defining Open Source Software Project Success , 2003, ICIS.

[17]  Akif Günes Koru,et al.  An Exploratory Study on the Evolution of OSS Developer Communities , 2008, Proceedings of the 41st Annual Hawaii International Conference on System Sciences (HICSS 2008).

[18]  Tze-Jie Yu,et al.  Identifying Error-Prone Software—An Empirical Study , 1985, IEEE Transactions on Software Engineering.