论文信息 - Comparative Code Structure Analysis using Deep Learning for Performance Prediction

Comparative Code Structure Analysis using Deep Learning for Performance Prediction

Performance analysis has always been an afterthought during the application development process, focusing on application correctness first. The learning curve of the existing static and dynamic analysis tools are steep, which requires understanding low-level details to interpret the findings for actionable optimizations. Additionally, application performance is a function of a number of unknowns stemming from the application-, runtime-, and interactions between the OS and underlying hardware, making it difficult to model using any deep learning technique, especially without a large labeled dataset. In this paper, we address both of these problems by presenting a large corpus of a labeled dataset for the community and take a comparative analysis approach to mitigate all unknowns except their source code differences between different correct implementations of the same problem. We put the power of deep learning to the test for automatically extracting information from the hierarchical structure of abstract syntax trees to represent source code. This paper aims to assess the feasibility of using purely static information (e.g., abstract syntax tree or AST) of applications to predict performance change based on the change in code structure. This research will enable performance-aware application development since every version of the application will continue to contribute to the corpora, which will enhance the performance of the model. We evaluate several deep learning-based representation learning techniques for source code. Our results show that tree-based Long Short-Term Memory (LSTM) models can leverage source code's hierarchical structure to discover latent representations. Specifically, LSTM-based predictive models built using a single problem and a combination of multiple problems can correctly predict if a source code will perform better or worse up to 84% and 73% of the time, respectively.

Jayaraman J. Thiagarajan | Tanzima Z. Islam | Nathan Pinnow | Tarek Ramadan | Chase Phelps

[1] Christopher D. Manning,et al. Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.

[2] Bojan Cukic,et al. Software defect prediction using semi-supervised learning with dimension reduction , 2012, 2012 Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering.

[3] Yoshimasa Tsuruoka,et al. Tree-to-Sequence Attentional Neural Machine Translation , 2016, ACL.

[4] Akihiro Yamamoto,et al. Automatic Source Code Summarization with Extended Tree-LSTM , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[5] Paul D. Hovland,et al. Generating Performance Bounds from Source Code , 2010, 2010 39th International Conference on Parallel Processing Workshops.

[6] Razvan Pascanu,et al. On the difficulty of training recurrent neural networks , 2012, ICML.

[7] Rachel Greenstadt,et al. Source Code Authorship Attribution Using Long Short-Term Memory Based Networks , 2017, ESORICS.

[8] Boyana Norris,et al. Mira: A Framework for Static Performance Analysis , 2017, 2017 IEEE International Conference on Cluster Computing (CLUSTER).

[9] Jürgen Schmidhuber,et al. Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.

[10] Max Welling,et al. Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[11] Premkumar T. Devanbu,et al. On the naturalness of software , 2016, Commun. ACM.

[12] Hermann Ney,et al. LSTM Neural Networks for Language Modeling , 2012, INTERSPEECH.

[13] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[14] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[15] Chris Cummins,et al. End-to-End Deep Learning of Optimization Heuristics , 2017, 2017 26th International Conference on Parallel Architectures and Compilation Techniques (PACT).

[16] Xin Sun,et al. Classifying Bug Reports into Bugs and Non-bugs Using LSTM , 2018, Internetware.

[17] Dan Quinlan,et al. The ROSE Source-to-Source Compiler Infrastructure , 2011 .

[18] Zhi Jin,et al. Building Program Vector Representations for Deep Learning , 2014, KSEM.

[19] Phil Blunsom,et al. A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[20] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[21] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[22] Jeffrey S. Foster,et al. Understanding source code evolution using abstract syntax tree matching , 2005, MSR.

[23] Aditya K. Ghose,et al. A deep tree-based model for software defect prediction , 2018, ArXiv.

[24] Takuya Akiba,et al. Optuna: A Next-generation Hyperparameter Optimization Framework , 2019, KDD.

[25] Steven Skiena,et al. DeepWalk: online learning of social representations , 2014, KDD.

[26] Max Welling,et al. Modeling Relational Data with Graph Convolutional Networks , 2017, ESWC.

[27] Philip S. Yu,et al. Multi-modal Attention Network Learning for Semantic Source Code Retrieval , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[28] Tao Wang,et al. TBCNN: A Tree-Based Convolutional Neural Network for Programming Language Processing , 2014, ArXiv.

[29] Sam Malek,et al. Mining the execution history of a software system to infer the best time for its adaptation , 2012, SIGSOFT FSE.

[30] Yanfang Ye,et al. Heterogeneous Graph Attention Network , 2019, WWW.