Forge++: The Changing Landscape of FLOSS Development

Software forges are centralized online systems that provide useful tools to help distributed development teams work together, especially in free, libre, and open source software (FLOSS). Forge-provided tools may include web space, version control systems, mailing lists and communication forums, bug tracking systems, file downloads, wikis, and the like. Empirical software engineering researchers can mine the artifacts from these tools to better understand how FLOSS is made. As the landscape of distributed software development has grown and changed, the tools needed to make FLOSS have changed as well. There are three newer tools at the center of FLOSS development today: distributed version control based forges (like Github), programmer question-and-answer communities (like Stack Overflow), and paste bin tools (like Gist or Pastebin.com). These tools are extending and changing the toolset used for FLOSS development, and redefining what a software forge looks like. The main contributions of this paper are to describe each of these tools, to identify the data and artifacts available for mining from these tools, and to outline some of the ways researchers can use these artifacts to continue to understand how FLOSS is made.

[1]  Daniel M. Germán,et al.  The promises and perils of mining git , 2009, 2009 6th IEEE International Working Conference on Mining Software Repositories.

[2]  Kevin Crowston,et al.  FLOSSmole: A Collaborative Repository for FLOSS Research Data and Analyses , 2006, Int. J. Inf. Technol. Web Eng..

[3]  Megan Squire,et al.  How the FLOSS Research Community Uses Email Archives , 2012, Int. J. Open Source Softw. Process..

[4]  Aristide Fattori,et al.  Peering into the Muddy Waters of Pastebin , 2012, ERCIM News.

[5]  Yongqin Gao,et al.  A Research Collaboratory for Open Source Software Research , 2007, First International Workshop on Emerging Trends in FLOSS Research and Development (FLOSS'07: ICSE Workshops 2007).

[6]  David Williams,et al.  Describing the Software Forge Ecosystem , 2012, 2012 45th Hawaii International Conference on System Sciences.

[7]  Georgios Gousios,et al.  The GHTorent dataset and tool suite , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[8]  Scott Grant,et al.  Encouraging user behaviour with achievements: An empirical study , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[9]  Kevin Crowston,et al.  The Perils and Pitfalls of Mining SourceForge , 2004, MSR.

[10]  Jure Leskovec,et al.  Steering user behavior with badges , 2013, WWW.

[11]  Xu Ben,et al.  Mining Developer Contribution in Open Source Software Using Visualization Techniques , 2013, 2013 Third International Conference on Intelligent System Design and Engineering Applications.

[12]  Charles A. Sutton,et al.  Mining source code repositories at massive scale using language modeling , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[13]  Daniela E. Damian,et al.  The promises and perils of mining GitHub , 2009, MSR 2014.

[14]  James D. Herbsleb,et al.  Social coding in GitHub: transparency and collaboration in an open software repository , 2012, CSCW.

[15]  Sotiris Ioannidis,et al.  Outsourcing Malicious Infrastructure to the Cloud , 2011, 2011 First SysSec Workshop.

[16]  J. Merrills,et al.  Terms of Service , 2008 .

[17]  Jesús M. González-Barahona,et al.  Tools for the Study of the Usual Data Sources found in Libre Software Projects , 2009, Int. J. Open Source Softw. Process..

[18]  Jeffrey Heer,et al.  Visualizing collaboration and influence in the open-source software community , 2011, MSR '11.