Usage and attribution of Stack Overflow code snippets in GitHub projects

[1]  Mircea Lungu,et al.  Geo-locating the knowledge transfer in StackOverflow , 2013, SSE 2013.

[2]  Chaiyong Ragkhitwetsagul,et al.  Measuring Code Similarity in Large-Scaled Code Corpora , 2016, 2016 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[3]  Eleni Stroulia,et al.  Involvement, contribution and influence in GitHub and stack overflow , 2014, CASCON.

[4]  Daniel M. Germán,et al.  Understanding the Usage, Impact, and Adoption of Non-OSI Approved Licenses , 2018, 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR).

[5]  Charles A. Sutton,et al.  Why, when, and what: Analyzing Stack Overflow questions by topic, type, and code , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[6]  Alexander Serebrenik,et al.  Gender, Representation and Online Participation: A Quantitative Study of StackOverflow , 2012, SocialInformatics.

[7]  Meiyappan Nagappan,et al.  Curating GitHub for engineered software projects , 2017, Empirical Software Engineering.

[8]  A. Nederhof Methods of coping with social desirability bias: A review. , 1985 .

[9]  Michele Lanza,et al.  Seahawk: Stack Overflow in the IDE , 2013, 2013 35th International Conference on Software Engineering (ICSE).

[10]  T. W. Korner,et al.  What makes a good code , 1988 .

[11]  Cristina V. Lopes,et al.  From Query to Usable Code: An Analysis of Stack Overflow Code Snippets , 2016, 2016 IEEE/ACM 13th Working Conference on Mining Software Repositories (MSR).

[12]  Georgios Gousios,et al.  The GHTorent dataset and tool suite , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[13]  Zhenchang Xing,et al.  What do developers search for on the web? , 2017, Empirical Software Engineering.

[14]  Christoph Treude,et al.  NLP2Code: Code Snippet Content Assist via Natural Language Tasks , 2017, 2017 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[15]  Michael Backes,et al.  Stack Overflow Considered Harmful? The Impact of Copy&Paste on Android Application Security , 2017, 2017 IEEE Symposium on Security and Privacy (SP).

[16]  Jeffrey C. Carver,et al.  Building reputation in StackOverflow: An empirical investigation , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[17]  Alessandro Bozzon,et al.  Linking Accounts across Social Networks: the Case of StackOverflow, Github and Twitter , 2015, KDWeb.

[18]  Alessandro Bozzon,et al.  Asking the right question in collaborative q&a systems , 2014, HT.

[19]  Michael Philippsen,et al.  Finding Plagiarisms among a Set of Programs with JPlag , 2002, J. Univers. Comput. Sci..

[20]  Mariette DiChristina,et al.  Promises and Perils. , 2015 .

[21]  Cristina V. Lopes,et al.  Stack Overflow in Github: Any Snippets There? , 2017, 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR).

[22]  Gail C. Murphy,et al.  Do Software Developers Understand Open Source Licenses? , 2017, 2017 IEEE/ACM 25th International Conference on Program Comprehension (ICPC).

[23]  Daniel M. Germán,et al.  Code siblings: Technical and legal implications of copying code between applications , 2009, 2009 6th IEEE International Working Conference on Mining Software Repositories.

[24]  Reid Holmes,et al.  Making sense of online code snippets , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[25]  Seyed M. M. Tahaghoghi,et al.  Efficient plagiarism detection for large code repositories , 2007, Softw. Pract. Exp..

[26]  Michael Backes,et al.  You Get Where You're Looking for: The Impact of Information Sources on Code Security , 2016, 2016 IEEE Symposium on Security and Privacy (SP).

[27]  Pedro Rangel Henriques,et al.  Plagiarism Detection: A Tool Survey and Comparison , 2014, SLATE.

[28]  Amiram Yehudai,et al.  Example Overflow: Using social media for code recommendation , 2012, 2012 Third International Workshop on Recommendation Systems for Software Engineering (RSSE).

[29]  Christoph Treude,et al.  SOTorrent: Reconstructing and Analyzing the Evolution of Stack Overflow Posts , 2018, 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR).

[30]  Fintan Culwin,et al.  A Comparison of Source Code Plagiarism Detection Engines , 2004, Comput. Sci. Educ..

[31]  Nuthan Munaiah,et al.  Curating GitHub for engineered software projects , 2017, Empirical Software Engineering.

[32]  Scott R. Klemmer,et al.  Example-centric programming: integrating web search into the development environment , 2010, CHI.

[33]  Christopher Vendome,et al.  A Large Scale Study of License Usage on GitHub , 2015, 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering.

[34]  Gabriele Bavota,et al.  Automatically assessing code understandability: How far are we? , 2017, 2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE).

[35]  Gail C. Murphy,et al.  Investigating whether and how software developers understand open source software licensing , 2018, Empirical Software Engineering.

[36]  Miryung Kim,et al.  Are Code Examples on an Online Q&A Forum Reliable?: A Study of API Misuse on Stack Overflow , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[37]  Jan Vitek,et al.  DéjàVu: a map of code duplicates on GitHub , 2017, Proc. ACM Program. Lang..

[38]  Alexander Serebrenik,et al.  StackOverflow and GitHub: Associations between Software Development and Crowdsourced Knowledge , 2013, 2013 International Conference on Social Computing.

[39]  Joachim Henkel,et al.  License risks from ad hoc reuse of code from the internet , 2011, Commun. ACM.

[40]  Christoph Treude,et al.  Augmenting API Documentation with Insights from Stack Overflow , 2016, 2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE).

[41]  Michele Lanza,et al.  Understanding and Classifying the Quality of Technical Forum Questions , 2014, 2014 14th International Conference on Quality Software.

[42]  Andrew M. St. Laurent Understanding Open Source and Free Software Licensing , 2004 .

[43]  David Lo,et al.  An empirical study on developer interactions in StackOverflow , 2013, SAC '13.

[44]  Frank Maurer,et al.  What makes a good code example?: A study of programming Q&A in StackOverflow , 2012, 2012 28th IEEE International Conference on Software Maintenance (ICSM).

[45]  Rabe Abdalkareem,et al.  On code reuse from StackOverflow: An exploratory study on Android apps , 2017, Inf. Softw. Technol..

[46]  Jeffrey S. Bucholtz,et al.  UNITED STATES DISTRICT COURT FOR THE NORTHERN DISTRICT OF CALIFORNIA , 2008 .

[47]  Daniel M. Germán,et al.  License integration patterns: Addressing license mismatches in component-based development , 2009, 2009 IEEE 31st International Conference on Software Engineering.

[48]  Baishakhi Ray,et al.  Some from Here, Some from There: Cross-Project Code Reuse in GitHub , 2017, 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR).

[49]  Daniela E. Damian,et al.  The promises and perils of mining GitHub , 2009, MSR 2014.

[50]  Alberto Bacchelli,et al.  Quality Questions Need Quality Code: Classifying Code Fragments on Stack Overflow , 2015, 2015 IEEE/ACM 12th Working Conference on Mining Software Repositories.

[51]  Christoph Treude,et al.  Understanding Stack Overflow Code Fragments , 2017, 2017 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[52]  Emerson R. Murphy-Hill,et al.  Is programming knowledge related to age? An exploration of stack overflow , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[53]  Georgios Gousios,et al.  GHTorrent: Github's data from a firehose , 2012, 2012 9th IEEE Working Conference on Mining Software Repositories (MSR).

[54]  M. Begg An introduction to categorical data analysis (2nd edn). Alan Agresti, John Wiley & Sons, Inc., Hoboken, New Jersey, 2007. No. of Pages: 400. Price: $100.95. ISBN: 978‐0‐471‐22618‐5 , 2009 .

[55]  Cristina V. Lopes,et al.  SourcererCC: Scaling Code Clone Detection to Big-Code , 2015, 2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE).

[56]  A. Agresti An introduction to categorical data analysis , 1997 .

[57]  Justin Zobel,et al.  Efficient plagiarism detection for large code repositories , 2007 .

[58]  Foutse Khomh,et al.  Stack Overflow: A code laundering platform? , 2017, 2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER).

[59]  Christoph Treude,et al.  How do programmers ask and answer questions on the web?: NIER track , 2011, 2011 33rd International Conference on Software Engineering (ICSE).

[60]  James E. Bartlett,et al.  Organizational research: Determining appropriate sample size in survey research , 2001 .

[61]  Chanchal Kumar Roy,et al.  Comparison and evaluation of code clone detection techniques and tools: A qualitative approach , 2009, Sci. Comput. Program..