The business models of software/platform as a service have contributed to developers dependence on the Internet. Developers can rapidly point each other and consumers to the newest software changes with the power of the hyper link. But, developers are not limited to referencing software changes to one another through the web. Other shared hypermedia might include links to: Stack Overflow, Twitter, and issue trackers. This work explores the software traceability of Uniform Resource Locators (URLs) which software developers leave in commit messages and software repositories. URLs are easily extracted from commit messages and source code. Therefore, it would be useful to researchers if URLs provide additional insight on project development. To assess traceability, manual topic labelling is evaluated against automated topic labelling on URL data sets. This work also shows differences between URL data collected from commit messages versus URL data collected from source code. As well, this work explores outlying software projects with many URLs in case these projects do not provide meaningful software relationship information. Results from manual topic labelling show promise under evaluation while automated topic labelling did not yield precise topics. Further investigation of manual and automated topic analysis would be useful.
[1]
Iman Keivanloo,et al.
Software trustworthiness 2.0 - A semantic web enabled global source code analysis approach
,
2014,
J. Syst. Softw..
[2]
Ahmed E. Hassan,et al.
What are developers talking about? An analysis of topics and trends in Stack Overflow
,
2014,
Empirical Software Engineering.
[3]
Abram Hindle,et al.
Relating requirements to implementation via topic analysis: Do topics extracted from requirements make sense to managers and developers?
,
2012,
2012 28th IEEE International Conference on Software Maintenance (ICSM).
[4]
Hridesh Rajan,et al.
Boa: A language and infrastructure for analyzing ultra-large-scale software repositories
,
2013,
2013 35th International Conference on Software Engineering (ICSE).
[5]
Daniela E. Damian,et al.
The promises and perils of mining GitHub
,
2009,
MSR 2014.