Success Factors of Open Source Software Projects using Datamining Technique

We present a research to find the success factors of 5000 most-downloaded Open Source projects at sourceforge.net website. The common parameters of these projects such as rank, download, activity, members, translation, operating systems, license, programming language, user interface, topic, and duration are extracted and classified into a Database. The Association Rule Datamining technique is used to find the rules that determines the success factors of the Open Source project using Weka Datamining tool. These useful rules is then interpreted as the success factors that may be enforced by future initiator of Open Source project in order to be successful. The study find some interesting results in which these success factors may or may not be influenced directly by the project's initiator and other project's members.

[1]  Izzat Alsmadi,et al.  Open Source Evolution Analysis , 2006, 2006 22nd IEEE International Conference on Software Maintenance.

[2]  Sebastian Spaeth,et al.  Knowledge Reuse in Open Source Software: An Exploratory Study of 15 Open Source Projects , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[3]  Yutaka Yamauchi,et al.  Collaboration with Lean Media: how open-source software succeeds , 2000, CSCW '00.

[4]  James M. Bieman,et al.  The FreeBSD project: a replication case study of open source development , 2005, IEEE Transactions on Software Engineering.

[5]  Audris Mockus,et al.  A case study of open source software development: the Apache server , 2000, Proceedings of the 2000 International Conference on Software Engineering. ICSE 2000 the New Millennium.

[6]  Sebastian Spaeth,et al.  Sampling in Open Source Software Development: The Case for Using the Debian GNU/Linux Distribution , 2007, 2007 40th Annual Hawaii International Conference on System Sciences (HICSS'07).

[7]  Mary Shaw,et al.  Finding predictors of field defects for open source software systems in commonly available data sources: a case study of OpenBSD , 2005, 11th IEEE International Software Metrics Symposium (METRICS'05).

[8]  Eric S. Raymond,et al.  The cathedral and the bazaar , 1998, First Monday.

[9]  Xu Hui An Analysis of Activity , 2004 .

[10]  Gregory R. Madey,et al.  Analysis of Activity in the Open Source Software Development Community , 2007, 2007 40th Annual Hawaii International Conference on System Sciences (HICSS'07).

[11]  J. Herbsleb,et al.  Two case studies of open source software development: Apache and Mozilla , 2002, TSEM.

[12]  David M. Nichols,et al.  Exploring Usability Discussions in Open Source Development , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[13]  Rolly Intan A PROPOSAL OF FUZZY MULTIDIMENSIONAL ASSOCIATION RULES , 2007 .

[14]  Kevin Crowston,et al.  Effective work practices for software engineering: free/libre open source software development , 2004, WISER '04.

[15]  Andrea Capiluppi,et al.  Studying the evolution of open source systems at different levels of granularity: two case studies , 2004 .

[16]  James D. Herbsleb,et al.  A case study of open source tools and practices in a commercial setting , 2005 .

[17]  Barbara Paech,et al.  Open Source Requirements Engineering , 2006, 14th IEEE International Requirements Engineering Conference (RE'06).

[18]  Katherine J. Stewart,et al.  Observations on patterns of development in open source software projects , 2005, ACM SIGSOFT Softw. Eng. Notes.