A Survey on Deep Learning for Software Engineering

In 2006, Geoffrey Hinton proposed the concept of training ''Deep Neural Networks (DNNs)'' and an improved model training method to break the bottleneck of neural network development. More recently, the introduction of AlphaGo in 2016 demonstrated the powerful learning ability of deep learning and its enormous potential. Deep learning has been increasingly used to develop state-of-the-art software engineering (SE) research tools due to its ability to boost performance for various SE tasks. There are many factors, e.g., deep learning model selection, internal structure differences, and model optimization techniques, that may have an impact on the performance of DNNs applied in SE. Few works to date focus on summarizing, classifying, and analyzing the application of deep learning techniques in SE. To fill this gap, we performed a survey to analyse the relevant studies published since 2006. We first provide an example to illustrate how deep learning techniques are used in SE. We then summarize and classify different deep learning techniques used in SE. We analyzed key optimization technologies used in these deep learning models, and finally describe a range of key research topics using DNNs in SE. Based on our findings, we present a set of current challenges remaining to be investigated and outline a proposed research road map highlighting key opportunities for future work.

[1]  Cristina V. Lopes,et al.  SourcererCC: Scaling Code Clone Detection to Big-Code , 2015, 2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE).

[2]  Gabriele Bavota,et al.  Deep Learning Similarities from Different Representations of Source Code , 2018, 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR).

[3]  K. M. Annervaz,et al.  Towards Accurate Duplicate Bug Retrieval Using Deep Learning Techniques , 2017, 2017 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[4]  Passakorn Phannachitta,et al.  On an optimal analogy-based software effort estimation , 2020, Inf. Softw. Technol..

[5]  Jonas Paul Winkler,et al.  Predicting How to Test Requirements: An Automated Approach , 2019, 2019 IEEE 27th International Requirements Engineering Conference (RE).

[6]  José Nelson Amaral,et al.  Syntax and sensibility: Using language models to detect and correct syntax errors , 2018, 2018 IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[7]  Philip S. Yu,et al.  Multi-modal Attention Network Learning for Semantic Source Code Retrieval , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[8]  David Lo,et al.  Deep specification mining , 2018, ISSTA.

[9]  Hui Liu,et al.  Deep Learning Based Code Smell Detection , 2021, IEEE Transactions on Software Engineering.

[10]  Hongyu Zhang,et al.  DeepPerf: Performance Prediction for Configurable Software with Deep Sparse Neural Network , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[11]  Graham Neubig,et al.  DIRE: A Neural Approach to Decompiled Identifier Naming , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[12]  Rishabh Singh,et al.  Learn&Fuzz: Machine learning for input fuzzing , 2017, 2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE).

[13]  Zhenchang Xing,et al.  Predicting semantically linkable knowledge in developer online forums via convolutional neural network , 2016, 2016 31st IEEE/ACM International Conference on Automated Software Engineering (ASE).

[14]  Zhenchang Xing,et al.  Domain-specific machine translation with recurrent neural network for software localization , 2019, Empirical Software Engineering.

[15]  Alexander Serebrenik,et al.  Was Self-Admitted Technical Debt Removal a Real Removal? An In-Depth Perspective , 2018, 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR).

[16]  Long Chen,et al.  DeepLink: A Code Knowledge Graph Based Deep Learning Approach for Issue-Commit Link Recovery , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[17]  Yitong Li,et al.  CoCoNuT: combining context-aware neural translation models using ensemble for program repair , 2020, ISSTA.

[18]  Wei Li,et al.  DeepFL: integrating multiple fault diagnosis dimensions for deep fault localization , 2019, ISSTA.

[19]  Sancheng Peng,et al.  Detection and Prevention of Code Injection Attacks on HTML5-Based Apps , 2015, 2015 Third International Conference on Advanced Cloud and Big Data.

[20]  Xiaoguang Mao,et al.  CNN-FL: An Effective Approach for Localizing Faults using Convolutional Neural Networks , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[21]  Xiao Liu,et al.  FastTagRec: fast tag recommendation for software information sites , 2018, Automated Software Engineering.

[22]  Xiang Chen,et al.  Improving defect prediction with deep forest , 2019, Inf. Softw. Technol..

[23]  Zhi Jin,et al.  Detecting Code Clones with Graph Neural Network and Flow-Augmented Abstract Syntax Tree , 2020, 2020 IEEE 27th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[24]  Liming Zhu,et al.  Wireframe-based UI Design Search through Image Autoencoder , 2020, ACM Trans. Softw. Eng. Methodol..

[25]  Xin Wang,et al.  Textout: Detecting Text-Layout Bugs in Mobile Apps via Visualization-Oriented Learning , 2019, 2019 IEEE 30th International Symposium on Software Reliability Engineering (ISSRE).

[26]  Zhenchang Xing,et al.  Easy-to-Deploy API Extraction by Multi-Level Feature Embedding and Transfer Learning , 2019, IEEE Transactions on Software Engineering.

[27]  Hui Zhao,et al.  SeqFuzzer: An Industrial Protocol Fuzzing Framework from a Deep Learning Perspective , 2019, 2019 12th IEEE Conference on Software Testing, Validation and Verification (ICST).

[28]  Lei Ma,et al.  Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[29]  JiangZhen Ming,et al.  Predicting Node Failures in an Ultra-Large-Scale Cloud Computing Platform , 2020 .

[30]  Alain Abran,et al.  A systematic literature review: Opinion mining studies from mobile app store user reviews , 2017, J. Syst. Softw..

[31]  Bin Li,et al.  Analyzing bug fix for automatic bug cause classification , 2020, J. Syst. Softw..

[32]  Harald C. Gall,et al.  Suggesting Comment Completions for Python using Neural Language Models , 2020, 2020 IEEE 27th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[33]  Zhenchang Xing,et al.  Learning a dual-language vector space for domain-specific cross-lingual question retrieval , 2016, 2016 31st IEEE/ACM International Conference on Automated Software Engineering (ASE).

[34]  Erik Linstead,et al.  A Deep Learning Approach to Identifying Source Code in Images and Video , 2018, 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR).

[35]  G. S. Mahapatra,et al.  Neural network for software reliability analysis of dynamically weighted NHPP growth models with imperfect debugging , 2018, Softw. Test. Verification Reliab..

[36]  Yves Le Traon,et al.  Learning to Spot and Refactor Inconsistent Method Names , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[37]  Martin White,et al.  Sorting and Transforming Program Repair Ingredients via Deep Learning Code Similarities , 2017, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[38]  Hwee Tou Ng,et al.  Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction , 2019, AAAI.

[39]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[40]  Bin Liu,et al.  Software defect prediction using stacked denoising autoencoders and two-stage ensemble learning , 2017, Inf. Softw. Technol..

[41]  Sen Zhang,et al.  Systematic Comprehension for Developer Reply in Mobile System Forum , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[42]  Zhenchang Xing,et al.  Which Variables Should I Log? , 2021, IEEE Transactions on Software Engineering.

[43]  Pushmeet Kohli,et al.  Neuro-Symbolic Program Corrector for Introductory Programming Assignments , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[44]  Razvan C. Bunescu,et al.  Learning to rank relevant files for bug reports using domain knowledge , 2014, SIGSOFT FSE.

[45]  Cheng Zhou,et al.  Improving software bug-specific named entity recognition with deep neural network , 2020, J. Syst. Softw..

[46]  Richard E. Fairley,et al.  Guide to the Software Engineering Body of Knowledge (SWEBOK(R)): Version 3.0 , 2014 .

[47]  David Lo,et al.  Automating Intention Mining , 2020, IEEE Transactions on Software Engineering.

[48]  Xin Peng,et al.  DeepLink: Recovering issue-commit links based on deep learning , 2019, J. Syst. Softw..

[49]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[50]  Zhenchang Xing,et al.  psc2code: Denoising Code Extraction from Programming Screencasts , 2020, ACM Trans. Softw. Eng. Methodol..

[51]  Andreas Vogelsang,et al.  Extraction of System States from Natural Language Requirements , 2019, 2019 IEEE 27th International Requirements Engineering Conference (RE).

[52]  Zhenchang Xing,et al.  A Neural Model for Method Name Generation from Functional Description , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[53]  Tim Menzies,et al.  Easy over hard: a case study on deep learning , 2017, ESEC/SIGSOFT FSE.

[54]  Alexander Egyed,et al.  Feature Maps: A Comprehensible Software Representation for Design Pattern Detection , 2018, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[55]  Meng Yan,et al.  Automating Just-In-Time Comment Updating , 2020, 2020 35th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[56]  Barbara G. Ryder,et al.  CCLearner: A Deep Learning-Based Clone Detection Approach , 2017, 2017 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[57]  Song Wang,et al.  Automatically Learning Semantic Features for Defect Prediction , 2016, 2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE).

[58]  Collin McMillan,et al.  Automatically generating commit messages from diffs using neural machine translation , 2017, 2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE).

[59]  David Lo,et al.  DeepJIT: An End-to-End Deep Learning Framework for Just-in-Time Defect Prediction , 2019, 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR).

[60]  Neeraj Kumar Goyal,et al.  Software development efforts prediction using artificial neural network , 2016, IET Softw..

[61]  Lingming Zhang,et al.  An Extensive Study on Cross-Project Predictive Mutation Testing , 2019, 2019 12th IEEE Conference on Software Testing, Validation and Verification (ICST).

[62]  Martin White,et al.  Deep learning code fragments for code clone detection , 2016, 2016 31st IEEE/ACM International Conference on Automated Software Engineering (ASE).

[63]  Taolue Chen,et al.  Augmenting Java method comments generation with context information based on neural networks , 2019, J. Syst. Softw..

[64]  Long Chen,et al.  Neural Detection of Semantic Code Clones Via Tree-Based Convolution , 2019, 2019 IEEE/ACM 27th International Conference on Program Comprehension (ICPC).

[65]  Michael Pradel,et al.  NL2Type: Inferring JavaScript Function Types from Natural Language Information , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[66]  Tim Menzies,et al.  A Deep Learning Model for Estimating Story Points , 2016, IEEE Transactions on Software Engineering.

[67]  Hui Liu,et al.  Deep Learning Based Feature Envy Detection , 2018, 2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE).

[68]  Chris Cummins,et al.  Compiler fuzzing through deep learning , 2018, ISSTA.

[69]  Kai Petersen,et al.  Guidelines for conducting systematic mapping studies in software engineering: An update , 2015, Inf. Softw. Technol..

[70]  David Lo,et al.  Deep Transfer Bug Localization , 2019, IEEE Transactions on Software Engineering.

[71]  Andreas Vogelsang,et al.  Extraction of System States from Natural Language Requirements , 2019, 2019 IEEE 27th International Requirements Engineering Conference (RE).

[72]  Neil A. Ernst,et al.  Cross-Dataset Design Discussion Mining , 2020, 2020 IEEE 27th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[73]  Martin White,et al.  Toward Deep Learning Software Repositories , 2015, 2015 IEEE/ACM 12th Working Conference on Mining Software Repositories.

[74]  Chao Chen,et al.  DRLgencert: Deep Learning-Based Automated Testing of Certificate Verification in SSL/TLS Implementations , 2018, 2018 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[75]  Foutse Khomh,et al.  Deep Learning Anti-Patterns from Code Metrics History , 2019, 2019 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[76]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[77]  Adel Nadjaran Toosi,et al.  ThermoSim: Deep Learning based Framework for Modeling and Simulation of Thermal-aware Resource Management for Cloud Computing Environments , 2020, J. Syst. Softw..

[78]  Xiaodong Gu,et al.  Deep API learning , 2016, SIGSOFT FSE.

[79]  Christian Bird,et al.  Deep learning type inference , 2018, ESEC/SIGSOFT FSE.

[80]  Lionel C. Briand,et al.  Testing advanced driver assistance systems using multi-objective search and neural networks , 2016, 2016 31st IEEE/ACM International Conference on Automated Software Engineering (ASE).

[81]  Shaohua Wang,et al.  Extracting API Tips from Developer Question and Answer Websites , 2019, 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR).

[82]  Cuiyun Gao,et al.  On the Replicability and Reproducibility of Deep Learning in Software Engineering , 2020, ACM Trans. Softw. Eng. Methodol..

[83]  Hui Yang,et al.  Deep Review Sharing , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[84]  Tao Chen,et al.  DeepSQLi: deep semantic learning for testing SQL injection , 2020, ISSTA.

[85]  Dave Towey,et al.  A Survey on Adaptive Random Testing , 2019, IEEE Transactions on Software Engineering.

[86]  Aditya K. Ghose,et al.  Lessons Learned from Using a Deep Tree-Based Model for Software Defect Prediction in Practice , 2019, 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR).

[87]  Hoa Khanh Dam,et al.  Automatic Feature Learning for Predicting Vulnerable Software Components , 2021, IEEE Transactions on Software Engineering.

[88]  Gang Zhao,et al.  DeepSim: deep learning code functional similarity , 2018, ESEC/SIGSOFT FSE.

[89]  Yang Liu,et al.  From UI Design Image to GUI Skeleton: A Neural Machine Translator to Bootstrap Mobile GUI Implementation , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[90]  Mark T True,et al.  Software Requirements , 2005 .

[91]  David Lo,et al.  PatchNet: Hierarchical Deep Learning-Based Stable Patch Identification for the Linux Kernel , 2019, IEEE Transactions on Software Engineering.

[92]  Philip S. Yu,et al.  Improving Automatic Source Code Summarization via Deep Reinforcement Learning , 2018, 2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE).

[93]  Jaechang Nam,et al.  Deep Semantic Feature Learning for Software Defect Prediction , 2020, IEEE Transactions on Software Engineering.

[94]  Graham Neubig,et al.  Learning to Mine Aligned Code and Natural Language Pairs from Stack Overflow , 2018, 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR).

[95]  Qingkai Shi,et al.  Functional code clone detection with syntax and semantics fusion learning , 2020, ISSTA.

[96]  Adam A. Porter,et al.  An Empirical Assessment of Machine Learning Approaches for Triaging Reports of a Java Static Analysis Tool , 2019, 2019 12th IEEE Conference on Software Testing, Validation and Verification (ICST).

[97]  Anh Tuan Nguyen,et al.  Bug Localization with Combination of Deep Learning and Information Retrieval , 2017, 2017 IEEE/ACM 25th International Conference on Program Comprehension (ICPC).

[98]  Trong Duc Nguyen,et al.  A deep neural network language model with contexts for source code , 2018, 2018 IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[99]  Yuanfang Cai,et al.  TECCD: A Tree Embedding Approach for Code Clone Detection , 2019, 2019 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[100]  Alper Sen,et al.  Virtualization of stateful services via machine learning , 2019, Software Quality Journal.

[101]  Zhenchang Xing,et al.  Neural Network-based Detection of Self-Admitted Technical Debt: From Performance to Explainability , 2019, ACM Trans. Softw. Eng. Methodol..

[102]  David Lo,et al.  Deep Code Comment Generation , 2018, 2018 IEEE/ACM 26th International Conference on Program Comprehension (ICPC).

[103]  Yueqi Chen,et al.  RENN: Efficient Reverse Execution with Neural-Network-Assisted Alias Analysis , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[104]  Nikolaos Tsantalis,et al.  Using Natural Language Processing to Automatically Detect Self-Admitted Technical Debt , 2017, IEEE Transactions on Software Engineering.

[105]  Xiaodong Gu,et al.  Deep Code Search , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[106]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[107]  Lingxiao Jiang,et al.  Bilateral Dependency Neural Networks for Cross-Language Algorithm Classification , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[108]  Aysh Alhroob,et al.  The use of artificial neural networks for extracting actions and actors from requirements document , 2018, Inf. Softw. Technol..

[109]  Yuan Wang,et al.  A Code-Description Representation Learning Model Based on Attention , 2020, 2020 IEEE 27th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[110]  Miroslaw Staron,et al.  Deep learning model for end-to-end approximation of COSMIC functional size based on use-case names , 2020, Inf. Softw. Technol..

[111]  Xiangyu Zhang,et al.  Automatic Text Input Generation for Mobile Testing , 2017, 2017 IEEE/ACM 39th International Conference on Software Engineering (ICSE).

[112]  Hailong Sun,et al.  A Novel Neural Source Code Representation Based on Abstract Syntax Tree , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[113]  Santanu Kumar Rath,et al.  Hybrid functional link artificial neural network approach for predicting maintainability of object-oriented software , 2016, J. Syst. Softw..

[114]  Artur Andrzejak,et al.  Learning-Based Recursive Aggregation of Abstract Syntax Trees for Code Clone Detection , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[115]  Alexander Serebrenik,et al.  Automatically Learning Patterns for Self-Admitted Technical Debt Removal , 2020, 2020 IEEE 27th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[116]  Tim Menzies,et al.  500+ Times Faster than Deep Learning: (A Case Study Exploring Faster Methods for Text Mining StackOverflow) , 2018, 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR).

[117]  Kevin A. Schneider,et al.  CLCDSA: Cross Language Code Clone Detection using Syntactical Features and API Documentation , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[118]  Alain Abran,et al.  Neural networks for predicting the duration of new software projects , 2015, J. Syst. Softw..

[119]  Saurabh Bagchi,et al.  PySE: Automatic Worst-Case Test Generation by Reinforcement Learning , 2019, 2019 12th IEEE Conference on Software Testing, Validation and Verification (ICST).

[120]  Denys Poshyvanyk,et al.  Machine Learning-Based Prototyping of Graphical User Interfaces for Mobile Apps , 2018, IEEE Transactions on Software Engineering.

[121]  Geoffrey E. Hinton,et al.  Deep Belief Networks for phone recognition , 2009 .

[122]  Burak Turhan,et al.  A Systematic Literature Review and Meta-Analysis on Cross Project Defect Prediction , 2019, IEEE Transactions on Software Engineering.

[123]  Kishor S. Trivedi,et al.  Supervised Representation Learning Approach for Cross-Project Aging-Related Bug Prediction , 2019, 2019 IEEE 30th International Symposium on Software Reliability Engineering (ISSRE).

[124]  Zhenchang Xing,et al.  Learning to Predict Severity of Software Vulnerability Using Only Vulnerability Description , 2017, 2017 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[125]  Shin Yoo,et al.  Classifying False Positive Static Checker Alarms in Continuous Integration Using Convolutional Neural Networks , 2019, 2019 12th IEEE Conference on Software Testing, Validation and Verification (ICST).

[126]  Rongxin Wu,et al.  How Well Do Change Sequences Predict Defects? Sequence Learning from Software Changes , 2020, IEEE Transactions on Software Engineering.

[127]  Michael R. Lyu,et al.  An Empirical Study of Common Challenges in Developing Deep Learning Applications , 2019, 2019 IEEE 30th International Symposium on Software Reliability Engineering (ISSRE).

[128]  Collin McMillan,et al.  A Neural Model for Generating Natural Language Summaries of Program Subroutines , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[129]  Senzhang Wang,et al.  CodeGRU: Context-aware Deep Learning with Gated Recurrent Unit for Source Code Modeling , 2019, Inf. Softw. Technol..

[130]  Ahmed E. Hassan,et al.  Predicting Node Failures in an Ultra-Large-Scale Cloud Computing Platform , 2020, ACM Trans. Softw. Eng. Methodol..

[131]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.

[132]  Abram Hindle,et al.  Deep Green: Modelling Time-Series of Software Energy Consumption , 2017, 2017 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[133]  Zachary Eberhart,et al.  Adapting Neural Text Classification for Improved Software Categorization , 2018, 2018 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[134]  Zhenchang Xing,et al.  ActionNet: Vision-Based Workflow Action Recognition From Programming Screencasts , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[135]  Eric Schulte,et al.  Using recurrent neural networks for decompilation , 2018, 2018 IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[136]  Xuandong Li,et al.  Reinforcement learning based curiosity-driven testing of Android applications , 2020, ISSTA.

[137]  Xiapu Luo,et al.  LDFR: Learning deep feature representation for software defect prediction , 2019, J. Syst. Softw..

[138]  Shigeru Chiba,et al.  Cross-Language Clone Detection by Learning Over Abstract Syntax Trees , 2019, 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR).

[139]  Christoph Treude,et al.  Automatic Generation of Pull Request Descriptions , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[140]  Foutse Khomh,et al.  Keep it simple: Is deep learning good for linguistic smell detection? , 2018, 2018 IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[141]  Yan Xiao,et al.  Improving bug localization with word embedding and enhanced convolutional neural networks , 2019, Inf. Softw. Technol..

[142]  Gabriele Bavota,et al.  An Empirical Study on Learning Bug-Fixing Patches in the Wild via Neural Machine Translation , 2018, ACM Trans. Softw. Eng. Methodol..

[143]  David Lo,et al.  Prediction of relatedness in stack overflow: deep learning vs. SVM: a reproducibility study , 2018, ESEM.

[144]  Xiao Liu,et al.  Is deep learning better than traditional approaches in tag recommendation for software information sites? , 2019, Inf. Softw. Technol..

[145]  Yan Xiao,et al.  Improving code readability classification using convolutional neural networks , 2018, Inf. Softw. Technol..

[146]  Douglas M. Hawkins,et al.  The Problem of Overfitting , 2004, J. Chem. Inf. Model..

[147]  Junfeng Tian,et al.  Software trustworthiness evaluation model based on a behaviour trajectory matrix , 2020, Inf. Softw. Technol..

[148]  Somesh Jha,et al.  Neural-augmented static analysis of Android communication , 2018, ESEC/SIGSOFT FSE.

[149]  Li Deng,et al.  A tutorial survey of architectures, algorithms, and applications for deep learning , 2014, APSIPA Transactions on Signal and Information Processing.

[150]  Yuming Zhou,et al.  Connecting software metrics across versions to predict defects , 2017, 2018 IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[151]  Zhendong Su,et al.  DECKARD: Scalable and Accurate Tree-Based Detection of Code Clones , 2007, 29th International Conference on Software Engineering (ICSE'07).

[152]  Zhen Li,et al.  BVDetector: A program slice-based binary code vulnerability intelligent detection system , 2020, Inf. Softw. Technol..

[153]  Koushik Sen,et al.  When deep learning met code search , 2019, ESEC/SIGSOFT FSE.

[154]  Sonia Haiduc,et al.  Code Localization in Programming Screencasts , 2020, Empirical Software Engineering.

[155]  Lingling Fan,et al.  CORE: Automating Review Recommendation for Code Changes , 2019, 2020 IEEE 27th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[156]  Qasim Umer,et al.  Deep Learning Based Identification of Suspicious Return Statements , 2020, 2020 IEEE 27th International Conference on Software Analysis, Evolution and Reengineering (SANER).