On the Replicability and Reproducibility of Deep Learning in Software Engineering

Deep learning (DL) techniques have gained significant popularity among software engineering (SE) researchers in recent years. This is because they can often solve many SE challenges without enormous manual feature engineering effort and complex domain knowledge. Although many DL studies have reported substantial advantages over other state-of-the-art models on effectiveness, they often ignore two factors: (1) replicability - whether the reported experimental result can be approximately reproduced in high probability with the same DL model and the same data; and (2) reproducibility - whether one reported experimental findings can be reproduced by new experiments with the same experimental protocol and DL model, but different sampled real-world data. Unlike traditional machine learning (ML) models, DL studies commonly overlook these two factors and declare them as minor threats or leave them for future work. This is mainly due to high model complexity with many manually set parameters and the time-consuming optimization process. In this study, we conducted a literature review on 93 DL studies recently published in twenty SE journals or conferences. Our statistics show the urgency of investigating these two factors in SE. Moreover, we re-ran four representative DL models in SE. Experimental results show the importance of replicability and reproducibility, where the reported performance of a DL model could not be replicated for an unstable optimization process. Reproducibility could be substantially compromised if the model training is not convergent, or if performance is sensitive to the size of vocabulary and testing data. It is therefore urgent for the SE community to provide a long-lasting link to a replication package, enhance DL-based solution stability and convergence, and avoid performance sensitivity on different sampled data.

[1]  Xin Xia,et al.  Cross-Project Change-Proneness Prediction , 2018, 2018 IEEE 42nd Annual Computer Software and Applications Conference (COMPSAC).

[2]  Amela Karahasanovic,et al.  A survey of controlled experiments in software engineering , 2005, IEEE Transactions on Software Engineering.

[3]  Xiao Liu,et al.  Is deep learning better than traditional approaches in tag recommendation for software information sites? , 2019, Inf. Softw. Technol..

[4]  Kevin A. Schneider,et al.  CLCDSA: Cross Language Code Clone Detection using Syntactical Features and API Documentation , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[5]  John E. Boylan,et al.  Reproducibility in forecasting research , 2015 .

[6]  Bin Liu,et al.  Software defect prediction using stacked denoising autoencoders and two-stage ensemble learning , 2017, Inf. Softw. Technol..

[7]  Natalia Juristo Juzgado,et al.  Replications of software engineering experiments , 2013, Empirical Software Engineering.

[8]  Tao Wang,et al.  Convolutional Neural Networks over Tree Structures for Programming Language Processing , 2014, AAAI.

[9]  Xiao Liu,et al.  FastTagRec: fast tag recommendation for software information sites , 2018, Automated Software Engineering.

[10]  Martin White,et al.  Sorting and Transforming Program Repair Ingredients via Deep Learning Code Similarities , 2017, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[11]  David Lo,et al.  PatchNet: Hierarchical Deep Learning-Based Stable Patch Identification for the Linux Kernel , 2019, IEEE Transactions on Software Engineering.

[12]  Fabio Q. B. da Silva,et al.  Replication of empirical studies in software engineering research: a systematic mapping study , 2012, Empirical Software Engineering.

[13]  Zachary Eberhart,et al.  Adapting Neural Text Classification for Improved Software Categorization , 2018, 2018 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[14]  David Lo,et al.  Deep Code Comment Generation , 2018, 2018 IEEE/ACM 26th International Conference on Program Comprehension (ICPC).

[15]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[16]  G. S. Mahapatra,et al.  Neural network for software reliability analysis of dynamically weighted NHPP growth models with imperfect debugging , 2018, Softw. Test. Verification Reliab..

[17]  Alexander Egyed,et al.  Feature Maps: A Comprehensible Software Representation for Design Pattern Detection , 2018, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[18]  Zhenchang Xing,et al.  Easy-to-Deploy API Extraction by Multi-Level Feature Embedding and Transfer Learning , 2019, IEEE Transactions on Software Engineering.

[19]  Ah Chung Tsoi,et al.  Face recognition: a convolutional neural-network approach , 1997, IEEE Trans. Neural Networks.

[20]  Xin Wang,et al.  Textout: Detecting Text-Layout Bugs in Mobile Apps via Visualization-Oriented Learning , 2019, 2019 IEEE 30th International Symposium on Software Reliability Engineering (ISSRE).

[21]  Graham Neubig,et al.  DIRE: A Neural Approach to Decompiled Identifier Naming , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[22]  Jan Bosch,et al.  Software Engineering Challenges of Deep Learning , 2018, 2018 44th Euromicro Conference on Software Engineering and Advanced Applications (SEAA).

[23]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[24]  Zhenchang Xing,et al.  Learning to Predict Severity of Software Vulnerability Using Only Vulnerability Description , 2017, 2017 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[25]  Rongxin Wu,et al.  How Well Do Change Sequences Predict Defects? Sequence Learning from Software Changes , 2020, IEEE Transactions on Software Engineering.

[26]  Neeraj Kumar Goyal,et al.  Software development efforts prediction using artificial neural network , 2016, IET Softw..

[27]  Abram Hindle,et al.  Deep Green: Modelling Time-Series of Software Energy Consumption , 2017, 2017 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[28]  Samy Bengio,et al.  Tensor2Tensor for Neural Machine Translation , 2018, AMTA.

[29]  Santanu Kumar Rath,et al.  Hybrid functional link artificial neural network approach for predicting maintainability of object-oriented software , 2016, J. Syst. Softw..

[30]  Saurabh Bagchi,et al.  PySE: Automatic Worst-Case Test Generation by Reinforcement Learning , 2019, 2019 12th IEEE Conference on Software Testing, Validation and Verification (ICST).

[31]  Amadeu Anderlin Neto A Strategy to Support Replications of Controlled Experiments in Software Engineering , 2019, SOEN.

[32]  Foutse Khomh,et al.  Deep Learning Anti-Patterns from Code Metrics History , 2019, 2019 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[33]  Rishabh Singh,et al.  Learn&Fuzz: Machine learning for input fuzzing , 2017, 2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE).

[34]  Long Chen,et al.  DeepLink: A Code Knowledge Graph Based Deep Learning Approach for Issue-Commit Link Recovery , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[35]  Zhenchang Xing,et al.  Learning a dual-language vector space for domain-specific cross-lingual question retrieval , 2016, 2016 31st IEEE/ACM International Conference on Automated Software Engineering (ASE).

[36]  Gregory V. Wilson,et al.  On the difficulty of replicating human subjects studies in software engineering , 2008, 2008 ACM/IEEE 30th International Conference on Software Engineering.

[37]  Tim Menzies,et al.  A Deep Learning Model for Estimating Story Points , 2016, IEEE Transactions on Software Engineering.

[38]  Martin White,et al.  Deep learning code fragments for code clone detection , 2016, 2016 31st IEEE/ACM International Conference on Automated Software Engineering (ASE).

[39]  Piek T. J. M. Vossen,et al.  Replicability and reproducibility of research results for human language technology: introducing an LRE special section , 2017, Lang. Resour. Evaluation.

[40]  Audris Mockus,et al.  Variability and Reproducibility in Software Engineering: A Study of Four Companies that Developed the Same System , 2009, IEEE Transactions on Software Engineering.

[41]  Zhenchang Xing,et al.  Domain-specific machine translation with recurrent neural network for software localization , 2019, Empirical Software Engineering.

[42]  Xiaoguang Mao,et al.  CNN-FL: An Effective Approach for Localizing Faults using Convolutional Neural Networks , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[43]  Jean Ponce,et al.  Computer Vision: A Modern Approach , 2002 .

[44]  Michael Pradel,et al.  NL2Type: Inferring JavaScript Function Types from Natural Language Information , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[45]  Xiaodong Gu,et al.  Deep API learning , 2016, SIGSOFT FSE.

[46]  Christian Bird,et al.  Deep learning type inference , 2018, ESEC/SIGSOFT FSE.

[47]  Ling Xu,et al.  Automated change-prone class prediction on unlabeled dataset using unsupervised method , 2017, Inf. Softw. Technol..

[48]  Shin Yoo,et al.  Classifying False Positive Static Checker Alarms in Continuous Integration Using Convolutional Neural Networks , 2019, 2019 12th IEEE Conference on Software Testing, Validation and Verification (ICST).

[49]  Hui Yang,et al.  Deep Review Sharing , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[50]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[51]  Hailong Sun,et al.  A Novel Neural Source Code Representation Based on Abstract Syntax Tree , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[52]  Philip S. Yu,et al.  Multi-modal Attention Network Learning for Semantic Source Code Retrieval , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[53]  Yan Xiao,et al.  Improving code readability classification using convolutional neural networks , 2018, Inf. Softw. Technol..

[54]  Eric Schulte,et al.  Using recurrent neural networks for decompilation , 2018, 2018 IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[55]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[56]  Nazareno Aguirre,et al.  Training Binary Classifiers as Data Structure Invariants , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[57]  Christoph Treude,et al.  Automatic Generation of Pull Request Descriptions , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[58]  Wei Li,et al.  DeepFL: integrating multiple fault diagnosis dimensions for deep fault localization , 2019, ISSTA.

[59]  Yuanfang Cai,et al.  TECCD: A Tree Embedding Approach for Code Clone Detection , 2019, 2019 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[60]  Harald C. Gall,et al.  Using Docker Containers to Improve Reproducibility in Software and Web Engineering Research , 2016, ICWE.

[61]  Xin Peng,et al.  DeepLink: Recovering issue-commit links based on deep learning , 2019, J. Syst. Softw..

[62]  Joelle Pineau,et al.  How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation , 2016, EMNLP.

[63]  Yan Xiao,et al.  Improving bug localization with word embedding and enhanced convolutional neural networks , 2019, Inf. Softw. Technol..

[64]  Haibo Hu,et al.  A recommender system for developer onboarding , 2018, ICSE.

[65]  Taolue Chen,et al.  Augmenting Java method comments generation with context information based on neural networks , 2019, J. Syst. Softw..

[66]  Ameet Talwalkar,et al.  Random Search and Reproducibility for Neural Architecture Search , 2019, UAI.

[67]  Natalia Juristo Juzgado,et al.  Replications types in experimental disciplines , 2010, ESEM '10.

[68]  Chao Liu,et al.  A two-phase transfer learning model for cross-project defect prediction , 2019, Inf. Softw. Technol..

[69]  Hui Liu,et al.  Deep Learning Based Feature Envy Detection , 2018, 2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE).

[70]  Collin McMillan,et al.  A Neural Model for Generating Natural Language Summaries of Program Subroutines , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[71]  Natalia Juristo Juzgado,et al.  Understanding replication of experiments in software engineering: A classification , 2014, Inf. Softw. Technol..

[72]  Xiaodong Gu,et al.  Deep Code Search , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[73]  Chong-Ho Choi,et al.  Sensitivity analysis of multilayer perceptron with differentiable activation functions , 1992, IEEE Trans. Neural Networks.

[74]  Sancheng Peng,et al.  New deep learning method to detect code injection attacks on hybrid applications , 2018, J. Syst. Softw..

[75]  Alain Abran,et al.  Neural networks for predicting the duration of new software projects , 2015, J. Syst. Softw..

[76]  Thomas Fischer,et al.  Deep learning with long short-term memory networks for financial market predictions , 2017, Eur. J. Oper. Res..

[77]  Chin-Yew Lin,et al.  Looking for a Few Good Metrics: ROUGE and its Evaluation , 2004 .

[78]  Trong Duc Nguyen,et al.  A deep neural network language model with contexts for source code , 2018, 2018 IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[79]  Philip S. Yu,et al.  Improving Automatic Source Code Summarization via Deep Reinforcement Learning , 2018, 2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE).

[80]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[81]  Song Wang,et al.  Automatically Learning Semantic Features for Defect Prediction , 2016, 2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE).

[82]  Lingxiao Jiang,et al.  Bilateral Dependency Neural Networks for Cross-Language Algorithm Classification , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[83]  Hongyu Zhang,et al.  DeepPerf: Performance Prediction for Configurable Software with Deep Sparse Neural Network , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[84]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[85]  Anh Tuan Nguyen,et al.  Bug Localization with Combination of Deep Learning and Information Retrieval , 2017, 2017 IEEE/ACM 25th International Conference on Program Comprehension (ICPC).

[86]  Yuming Zhou,et al.  Connecting software metrics across versions to predict defects , 2017, 2018 IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[87]  Nizar Habash,et al.  Four Techniques for Online Handling of Out-of-Vocabulary Words in Arabic-English Statistical Machine Translation , 2008, ACL.

[88]  Christopher D. Manning,et al.  Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[89]  Suman Jana,et al.  DeepTest: Automated Testing of Deep-Neural-Network-Driven Autonomous Cars , 2017, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[90]  Gang Zhao,et al.  DeepSim: deep learning code functional similarity , 2018, ESEC/SIGSOFT FSE.

[91]  Premkumar T. Devanbu,et al.  Are deep neural networks the best choice for modeling source code? , 2017, ESEC/SIGSOFT FSE.

[92]  Gabriele Bavota,et al.  On Learning Meaningful Code Changes Via Neural Machine Translation , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[93]  Xin Xia,et al.  Improving Code Search with Co-Attentive Representation Learning , 2020, 2020 IEEE/ACM 28th International Conference on Program Comprehension (ICPC).

[94]  Hoa Khanh Dam,et al.  Automatic Feature Learning for Predicting Vulnerable Software Components , 2021, IEEE Transactions on Software Engineering.

[95]  Andrew W. Senior,et al.  Long short-term memory recurrent neural network architectures for large scale acoustic modeling , 2014, INTERSPEECH.

[96]  Chao Liu,et al.  Deep Metric Learning for Software Change-Proneness Prediction , 2018, IScIDE.

[97]  Michael R. Lyu,et al.  Automating App Review Response Generation , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[98]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[99]  Koushik Sen,et al.  When deep learning met code search , 2019, ESEC/SIGSOFT FSE.

[100]  Georgios Gousios,et al.  A note on rigour and replicability , 2012, SOEN.

[101]  Moataz A. Ahmed,et al.  Machine Learning-Based Software Quality Prediction Models: State of the Art , 2013, 2013 International Conference on Information Science and Applications (ICISA).

[102]  Mengning Yang,et al.  Self-learning Change-prone Class Prediction , 2016, SEKE.

[103]  Artur Andrzejak,et al.  Learning-Based Recursive Aggregation of Abstract Syntax Trees for Code Clone Detection , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[104]  Somesh Jha,et al.  Neural-augmented static analysis of Android communication , 2018, ESEC/SIGSOFT FSE.

[105]  Gabriele Bavota,et al.  Learning How to Mutate Source Code from Bug-Fixes , 2018, 2019 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[106]  Zhenchang Xing,et al.  Predicting semantically linkable knowledge in developer online forums via convolutional neural network , 2016, 2016 31st IEEE/ACM International Conference on Automated Software Engineering (ASE).

[107]  Collin McMillan,et al.  Automatically generating commit messages from diffs using neural machine translation , 2017, 2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE).

[108]  Tracy Hall,et al.  Reproducibility and replicability of software defect prediction studies , 2018, Inf. Softw. Technol..

[109]  Dong Liu,et al.  Unsupervised Deep Bug Report Summarization , 2018, 2018 IEEE/ACM 26th International Conference on Program Comprehension (ICPC).

[110]  Grigori Melnik,et al.  On the success of empirical studies in the international conference on software engineering , 2006, ICSE.

[111]  Pushmeet Kohli,et al.  Neuro-Symbolic Program Corrector for Introductory Programming Assignments , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[112]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[113]  Lionel C. Briand,et al.  Testing advanced driver assistance systems using multi-objective search and neural networks , 2016, 2016 31st IEEE/ACM International Conference on Automated Software Engineering (ASE).

[114]  Yueqi Chen,et al.  RENN: Efficient Reverse Execution with Neural-Network-Assisted Alias Analysis , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[115]  Harald C. Gall,et al.  Software Mining Studies: Goals, Approaches, Artifacts, and Replicability , 2013, LASER Summer School.

[116]  Hermann Ney,et al.  A Deep Learning Approach to Machine Transliteration , 2009, WMT@EACL.

[117]  Darrel C. Ince,et al.  The case for open computer programs , 2012, Nature.

[118]  Arvinder Kaur,et al.  Application of machine learning methods for software effort prediction , 2010, SOEN.

[119]  Richard Szeliski,et al.  Computer Vision - Algorithms and Applications , 2011, Texts in Computer Science.

[120]  Sen Zhang,et al.  Systematic Comprehension for Developer Reply in Mobile System Forum , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[121]  Yves Le Traon,et al.  Learning to Spot and Refactor Inconsistent Method Names , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[122]  Natalia Juristo Juzgado,et al.  The role of non-exact replications in software engineering experiments , 2011, Empirical Software Engineering.

[123]  Zhenchang Xing,et al.  A Neural Model for Method Name Generation from Functional Description , 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[124]  Ling Xu,et al.  Learning to Aggregate: An Automated Aggregation Method for Software Quality Model , 2017, 2017 IEEE/ACM 39th International Conference on Software Engineering Companion (ICSE-C).

[125]  Long Chen,et al.  Neural Detection of Semantic Code Clones Via Tree-Based Convolution , 2019, 2019 IEEE/ACM 27th International Conference on Program Comprehension (ICPC).

[126]  M. Roper,et al.  Replication of Software Engineering Experiments , 2000 .

[127]  Xiangyu Zhang,et al.  Automatic Text Input Generation for Mobile Testing , 2017, 2017 IEEE/ACM 39th International Conference on Software Engineering (ICSE).

[128]  David Lo,et al.  Deep Transfer Bug Localization , 2019, IEEE Transactions on Software Engineering.

[129]  Yang Liu,et al.  From UI Design Image to GUI Skeleton: A Neural Machine Translator to Bootstrap Mobile GUI Implementation , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[130]  K. M. Annervaz,et al.  Towards Accurate Duplicate Bug Retrieval Using Deep Learning Techniques , 2017, 2017 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[131]  Jaechang Nam,et al.  Deep Semantic Feature Learning for Software Defect Prediction , 2020, IEEE Transactions on Software Engineering.

[132]  Yutaka Matsuo,et al.  Learning Feature Representations from Change Dependency Graphs for Defect Prediction , 2017, 2017 IEEE 28th International Symposium on Software Reliability Engineering (ISSRE).

[133]  Terrence L. Fine,et al.  Feedforward Neural Network Methodology , 1999, Information Science and Statistics.

[134]  Lukás Burget,et al.  Extensions of recurrent neural network language model , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[135]  Lei Ma,et al.  Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[136]  Chao Chen,et al.  DRLgencert: Deep Learning-Based Automated Testing of Certificate Verification in SSL/TLS Implementations , 2018, 2018 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[137]  Lorenzo Rosasco,et al.  Theory of Deep Learning III: explaining the non-overfitting puzzle , 2017, ArXiv.

[138]  Jesús M. González-Barahona,et al.  On the reproducibility of empirical software engineering studies based on data retrieved from development repositories , 2011, Empirical Software Engineering.

[139]  Hui Liu,et al.  Deep Learning Based Code Smell Detection , 2021, IEEE Transactions on Software Engineering.

[140]  A. T. C. Goh,et al.  Back-propagation neural networks for modeling complex systems , 1995, Artif. Intell. Eng..

[141]  Zhenchang Xing,et al.  Which Variables Should I Log? , 2021, IEEE Transactions on Software Engineering.

[142]  D. Cox,et al.  SOME QUICK SIGN TESTS FOR TREND IN LOCATION AND DISPERSION , 1955 .

[143]  Zhenchang Xing,et al.  ActionNet: Vision-Based Workflow Action Recognition From Programming Screencasts , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[144]  Barbara G. Ryder,et al.  CCLearner: A Deep Learning-Based Clone Detection Approach , 2017, 2017 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[145]  Jane Cleland-Huang,et al.  Semantically Enhanced Software Traceability Using Deep Learning Techniques , 2017, 2017 IEEE/ACM 39th International Conference on Software Engineering (ICSE).

[146]  Denys Poshyvanyk,et al.  Machine Learning-Based Prototyping of Graphical User Interfaces for Mobile Apps , 2018, IEEE Transactions on Software Engineering.

[147]  Gabriele Bavota,et al.  An Empirical Study on Learning Bug-Fixing Patches in the Wild via Neural Machine Translation , 2018, ACM Trans. Softw. Eng. Methodol..

[148]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[149]  Chao Liu,et al.  Recommending GitHub Projects for Developer Onboarding , 2018, IEEE Access.

[150]  Xiapu Luo,et al.  LDFR: Learning deep feature representation for software defect prediction , 2019, J. Syst. Softw..

[151]  Aysh Alhroob,et al.  The use of artificial neural networks for extracting actions and actors from requirements document , 2018, Inf. Softw. Technol..

[152]  Zhenchang Xing,et al.  Neural Network-based Detection of Self-Admitted Technical Debt: From Performance to Explainability , 2019, ACM Trans. Softw. Eng. Methodol..

[153]  Junjie Chen,et al.  Continuous Incident Triage for Large-Scale Online Service Systems , 2019, 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[154]  Joni-Kristian Kämäräinen,et al.  Differential Evolution Training Algorithm for Feed-Forward Neural Networks , 2003, Neural Processing Letters.

[155]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[156]  Alexander M. Rush,et al.  Character-Aware Neural Language Models , 2015, AAAI.

[157]  David Lo,et al.  Automating Intention Mining , 2020, IEEE Transactions on Software Engineering.

[158]  Pearl Brereton,et al.  Meta-analysis for families of experiments in software engineering: a systematic review and reproducibility and validity assessment , 2019, Empirical Software Engineering.

[159]  Boaz Barak,et al.  Deep double descent: where bigger models and more data hurt , 2019, ICLR.