论文信息 - Mining the maintenance history of a legacy software system

Mining the maintenance history of a legacy software system

A considerable amount of system maintenance experience can be found in bug tracking and source code configuration management systems. Data mining and machine learning techniques allow one to extract models from past experience that can be used in future predictions. By mining the software change record, one can therefore generate models that can be used in future maintenance activities. In this paper, we present an example of such a model that represents a relation between pairs of files and show how it can be extracted from the software update records of a real world legacy system. We show how different sources of data can be used to extract sets of features useful in describing this model, as well as how results are affected by these different feature sets and their combinations. Our best results were obtained from text-based features, i.e. those extracted from words in the problem reports as opposed to syntactic structures in the source code.

[1] James P. Egan,et al. Signal detection theory and ROC analysis , 1975 .

[2] Robert C. Holte,et al. Very Simple Classification Rules Perform Well on Most Commonly Used Datasets , 1993, Machine Learning.

[3] John Shawe-Taylor,et al. The Set Covering Machine , 2003, J. Mach. Learn. Res..

[4] Anas N. Al-Rabadi,et al. A comparison of modified reconstructability analysis and Ashenhurst‐Curtis decomposition of Boolean functions , 2004 .

[5] Stan Matwin,et al. Supporting software maintenance by mining software update records , 2001, Proceedings IEEE International Conference on Software Maintenance. ICSM 2001.

[6] Dunja Mladenic,et al. Text-learning and related intelligent agents: a survey , 1999, IEEE Intell. Syst..

[7] David D. Lewis,et al. Representation and Learning in Information Retrieval , 1991 .

[8] Harald C. Gall,et al. An evaluation of reverse engineering tool capabilities , 1998, J. Softw. Maintenance Res. Pract..

[9] Timothy Lethbridge,et al. A little knowledge can go a long way towards program understanding , 1997, Proceedings Fifth International Workshop on Program Comprehension. IWPC'97.

[10] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.

[11] Kostas Kontogiannis,et al. Workshop report: The two-day workshop on Research Issues in the Intersection between Software Engineering and Artificial Intelligence (held in conjunction with ICSE-16) , 1995, Automated Software Engineering.