OSSEAN: Mining Crowd Wisdom in Open Source Communities

Nowadays open source software represents a successful crowd-based software production model and is becoming an ecosystem combining huge amounts of software producers (such as software developers) and consumers (such as software users and customers). Lots of research work has been conducted on analyzing software artifacts created by producers, but few of them reveal the power of feedback from consumers which we believe is very important for the evaluation and evolution of open source software. This paper introduces OSSEAN, a platform for Open Source Software Evaluating, Analyzing and Networking. OSSEAN divides the open source communities into two groups: software production communities and software consumption communities. The former contain structured software artifacts such as projects, source code and issues, while the latter are full of textual documents with rich semantics of user feedback. We show the power of OSSEAN with some interesting demos by analyzing more than 200 thousands of open source projects and 10 million documents.

[1]  Gang Yin,et al.  Reviewer Recommender of Pull-Requests in GitHub , 2014, 2014 IEEE International Conference on Software Maintenance and Evolution.

[2]  Joel Ossher,et al.  Sourcerer: An internet-scale software repository , 2009, 2009 ICSE Workshop on Search-Driven Development-Users, Infrastructure, Tools and Evaluation.

[3]  Lena Mamykina,et al.  Design lessons from the fastest q&a site in the west , 2011, CHI.

[4]  Audris Mockus Amassing and indexing a large sample of version control systems: Towards the census of public source code history , 2009, 2009 6th IEEE International Working Conference on Mining Software Repositories.

[5]  Ioannis Stamelos,et al.  Federated Search for Open Source Software Reuse , 2012, 2012 38th Euromicro Conference on Software Engineering and Advanced Applications.

[6]  Shinji Kusumoto,et al.  Component rank: relative significance rank for software component search , 2003, 25th International Conference on Software Engineering, 2003. Proceedings..

[7]  Philip J. Guo,et al.  Linux Kernel Developer Responses to Static Analysis Bug Reports , 2009, USENIX Annual Technical Conference.

[8]  Kevin Crowston,et al.  FLOSSmole: A Collaborative Repository for FLOSS Research Data and Analyses , 2006, Int. J. Inf. Technol. Web Eng..

[9]  Michael Hahsler,et al.  Discussion of a Large-Scale Open Source Data Collection Methodology , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.