Investigating Open Source Project Success: A Data Mining Approach to Model Formulation, Validation and Testing

This paper demonstrates the use of Data Mining (DM) techniques in exploratory research. A robust model for identifying the factors that explain the success of Open Source Software (OSS) projects is created, validated and tested. The predictive modeling techniques of Logistic Regression (LR), Decision Trees (DT) and Neural Networks (NN) are used together in this analysis. Using Text Mining results in the predictive modeling process strengthens the model. SAS ® Enterprise Miner and SAS ® Text Miner are used in this research.