A comparison of plagiarism detection tools

In this paper we compare five tools for detecting plagiarism in source code texts: JPlag, Marble, moss, Plaggie, and sim. The tools are compared with respect to their features and performance. For the performance comparison we carried out two experiments: to compare the sensitivity of the tools for different plagiarism techniques we have applied the tools to a set of intentionally plagiarised programs. To get a picture of the precision of the tools, we have run the tools on several incarnations of a student assignment and compared the top 10’s of the results.

[1]  Padraig Cunningham,et al.  Using CBR techniques to detect plagiarism in computing assignments , 1993 .

[2]  Michael J. Wise,et al.  YAP3: improved detection of similarities in computer program and other texts , 1996, SIGCSE '96.

[3]  Lutz Prechelt,et al.  JPlag: Finding plagiarisms among a set of programs , 2000 .

[4]  Edward L. Jones Metrics based plagarism monitoring , 2001 .

[5]  Daniel Shawcross Wilkerson,et al.  Winnowing: local algorithms for document fingerprinting , 2003, SIGMOD '03.

[6]  Xin Chen,et al.  Shared information and program plagiarism detection , 2004, IEEE Transactions on Information Theory.

[7]  Fintan Culwin,et al.  A Comparison of Source Code Plagiarism Detection Engines , 2004, Comput. Sci. Educ..

[8]  Athena Vakali,et al.  PDetect: A Clustering Approach for Detecting Plagiarism in Source Code Datasets , 2005, Comput. J..

[9]  Jurriaan Hage Programmeerplagiaatdetectie met Marble , 2006 .

[10]  Seyed M. M. Tahaghoghi,et al.  Plagiarism detection across programming languages , 2006, ACSC.

[11]  Philip S. Yu,et al.  GPLAG: detection of software plagiarism by program dependence graph analysis , 2006, KDD '06.

[12]  Sami Surakka,et al.  Plaggie: GNU-licensed source code plagiarism detection engine for Java exercises , 2006, Baltic Sea '06.

[13]  Justin Zobel,et al.  Efficient plagiarism detection for large code repositories , 2007 .

[14]  Sanjay Goel,et al.  Plagiarism and its Detection in Programming Languages , 2008 .