Public Git Archive: A Big Code Dataset for All
暂无分享,去创建一个
[1] Andreas Krause,et al. Predicting Program Properties from "Big Code" , 2015, POPL.
[2] Rohan Padhye,et al. A study of external community contribution to open-source projects on GitHub , 2014, MSR 2014.
[3] Eirini Kalliamvakou,et al. An in-depth study of the promises and perils of mining GitHub , 2016, Empirical Software Engineering.
[4] Alexander Serebrenik,et al. Empirical Analysis of the Relationship between CC and SLOC in a Large Corpus of Java Methods , 2014, 2014 IEEE International Conference on Software Maintenance and Evolution.
[5] Claire Le Goues,et al. GenProg: A Generic Method for Automatic Software Repair , 2012, IEEE Transactions on Software Engineering.
[6] Jing Li,et al. The Qualitas Corpus: A Curated Collection of Java Code for Empirical Studies , 2010, 2010 Asia Pacific Software Engineering Conference.
[7] Charles A. Sutton,et al. Suggesting accurate method and class names , 2015, ESEC/SIGSOFT FSE.
[8] Zheng Gao,et al. To Type or Not to Type: Quantifying Detectable Bugs in JavaScript , 2017, 2017 IEEE/ACM 39th International Conference on Software Engineering (ICSE).
[9] Meiyappan Nagappan,et al. Diversity in software engineering research , 2016, Perspectives on Data Science for Software Engineering.
[10] Charles A. Sutton,et al. Mining source code repositories at massive scale using language modeling , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).
[11] Zhendong Su,et al. On the naturalness of software , 2012, ICSE 2012.
[12] Georgios Gousios,et al. Lean GHTorrent: GitHub data on demand , 2014, MSR 2014.
[13] Michael W. Godfrey,et al. Cloning by accident: an empirical study of source code cloning across software systems , 2005, 2005 International Symposium on Empirical Software Engineering, 2005..
[14] Mark Harman,et al. Automated software transplantation , 2015, ISSTA.
[15] Premkumar T. Devanbu,et al. On the naturalness of software , 2016, Commun. ACM.
[16] Benoit Baudry,et al. On Analyzing the Topology of Commit Histories in Decentralized Version Control Systems , 2014, 2014 IEEE International Conference on Software Maintenance and Evolution.
[17] Georgios Gousios,et al. The GHTorent dataset and tool suite , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).
[18] Baishakhi Ray,et al. Some from Here, Some from There: Cross-Project Code Reuse in GitHub , 2017, 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR).
[19] Jan Vitek,et al. DéjàVu: a map of code duplicates on GitHub , 2017, Proc. ACM Program. Lang..
[20] Stéphane Ducasse,et al. Semantic clustering: Identifying topics in source code , 2007, Inf. Softw. Technol..
[21] Jordi Cabot,et al. Findings from GitHub: Methods, Datasets and Limitations , 2016, 2016 IEEE/ACM 13th Working Conference on Mining Software Repositories (MSR).
[22] Hridesh Rajan,et al. A study of repetitiveness of code changes in software evolution , 2013, 2013 28th IEEE/ACM International Conference on Automated Software Engineering (ASE).
[23] Roberto Di Cosmo,et al. Software Heritage: Why and How to Preserve Software Source Code , 2017, iPRES.