Investigating Student Plagiarism Patterns and Correlations to Grades

We analyzed 6 semesters of data from a large enrollment data structures course to identify instances of plagiarism in 4 assignments. We find that the majority of the identified plagiarism instances involve cross-semester cheating and are performed by students for whom the plagiarism is an isolated event (in the studied assignments). Second, we find that providing students an opportunity to work with a partner doesn't decrease the incidence of plagiarism. Third, while plagiarism on a given assignment is correlated with better than average scores on that assignment, plagiarism is negatively correlated with final grades in both the course that the plagiarism occurred and in a subsequent related course. Finally, we briefly describe the Algae open-source suite of plagiarism detectors and characterize the kinds of obfuscation that students apply to their plagiarized submissions and observe that no single algorithm appears to be sufficient to detect all of the cases.

[1]  M. Zeidner,et al.  Test Anxiety: The State of the Art , 1998 .

[2]  Manuel Cebrián,et al.  AC: An Integrated Source Code Plagiarism Detection Environment , 2007, ArXiv.

[3]  Christian S. Collberg,et al.  A Taxonomy of Obfuscating Transformations , 1997 .

[4]  K.W. Bowyer,et al.  Experience using "MOSS" to detect cheating on programming assignments , 1999, FIE'99 Frontiers in Education. 29th Annual Frontiers in Education Conference. Designing the Future of Science and Engineering Education. Conference Proceedings (IEEE Cat. No.99CH37011.

[5]  Justin Zobel,et al.  Efficient plagiarism detection for large code repositories , 2007 .

[6]  Daniel Shawcross Wilkerson,et al.  Winnowing: local algorithms for document fingerprinting , 2003, SIGMOD '03.

[7]  Michael Philippsen,et al.  Finding Plagiarisms among a Set of Programs with JPlag , 2002, J. Univers. Comput. Sci..

[8]  Samuel L. Grier,et al.  A tool that detects plagiarism in Pascal programs , 1981, SIGCSE '81.

[9]  Michelle Craig,et al.  Plagiarism detection using feature-based neural networks , 2007, SIGCSE.

[10]  Judithe Sheard,et al.  Cheating and plagiarism: perceptions and practices of first year IT students , 2002, ITiCSE '02.

[11]  Sencun Zhu,et al.  Value-based program characterization and its application to software plagiarism detection , 2011, 2011 33rd International Conference on Software Engineering (ICSE).

[12]  C. Jinshong Hwang,et al.  Using an effective grading method for preventing plagiarism of programming assignments , 1982, SIGCSE '82.

[13]  A. Pai,et al.  Plagiarism Detection System , 2013 .

[14]  Geoff Whale Software metrics and plagiarism detection , 1990, J. Syst. Softw..

[15]  References , 1971 .

[16]  Nicholas Tran,et al.  Sim: a utility for detecting similarity in computer programs , 1999, SIGCSE '99.