Poster: Predicting Components for Issue Reports Using Deep Learning with Information Retrieval

Assigning an issue to the correct component(s) is challenging, especially for large-scale projects which have are up to hundreds of components. We propose a prediction model which learns from historical issues reports and recommends the most relevant components for new issues. Our model uses the deep learning Long Short-Term Memory to automatically learns semantic features representing an issue report, and combines them with the traditional textual similarity features. An extensive evaluation on 142,025 issues from 11 large projects shows our approach outperforms alternative techniques with an average 60% improvement in predictive performance.

[1]  John Baldacchino,et al.  Learning to Be , 2014 .

[2]  Ashish Sureka,et al.  Learning to Classify Bug Reports into Components , 2012, TOOLS.

[3]  Ioannis Stamelos,et al.  Extracting Components from Open Source: The Component Adaptation Environment (COPE) Approach , 2012, 2012 38th Euromicro Conference on Software Engineering and Advanced Applications.

[4]  Thomas Zimmermann,et al.  What Makes a Good Bug Report? , 2008, IEEE Transactions on Software Engineering.