Distance indices for the detection of similarity in C programs

There has been proliferation in the use of plagiarized articles or source code amongst student and research community. This paper focus on an efficient method that can differentiate between plagiarized and non-plagiarized programs. Similarity/Distance measurement techniques are used to classify the test file. Thirty six distance metrics are used to determine intra class and inter class proximity. Unseen file not used for frequency extraction are predicted with higher accuracy. This depict that our proposed model using intra/inter family threshold can be implemented to identify plagiarized programs with better detection rate.

[1]  Jens Krinke,et al.  Identifying similar code with program dependence graphs , 2001, Proceedings Eighth Working Conference on Reverse Engineering.

[2]  Zhoujun Li,et al.  BUAA_AntiPlagiarism: A System To Detect Plagiarism for C Source Code , 2009, 2009 International Conference on Computational Intelligence and Software Engineering.

[3]  Seyed M. M. Tahaghoghi,et al.  Plagiarism detection across programming languages , 2006, ACSC.

[4]  A. S. Bin-Habtoor,et al.  A Survey on Plagiarism Detection Systems , 2012 .

[5]  Hermann A. Maurer,et al.  Plagiarism - A Survey , 2006, J. Univers. Comput. Sci..

[6]  Georgina Cosma,et al.  An Approach to Source-Code Plagiarism Detection and Investigation Using Latent Semantic Analysis , 2012, IEEE Transactions on Computers.

[7]  Romain Robbes,et al.  Language-Independent Clone Detection Applied to Plagiarism Detection , 2010, 2010 10th IEEE Working Conference on Source Code Analysis and Manipulation.

[8]  Doreswamy A Study on Similarity Measure Functions on Engineering Materials Selection , 2011 .

[9]  Basavaraju Muddu,et al.  CPDP: A robust technique for plagiarism detection in source code , 2013, 2013 7th International Workshop on Software Clones (IWSC).

[10]  Wolfgang Granzer Source Code Plagiarism in Computer Engineering Courses , 2013 .

[11]  Inggriani Liem,et al.  Automatic Source Code Plagiarism Detection , 2009, 2009 10th ACIS International Conference on Software Engineering, Artificial Intelligences, Networking and Parallel/Distributed Computing.

[12]  Sung-Hyuk Cha Comprehensive Survey on Distance/Similarity Measures between Probability Density Functions , 2007 .

[13]  Xiao Li,et al.  The Source Code Plagiarism Detection Using AST , 2010, 2010 International Symposium on Intelligence Information Processing and Trusted Computing.

[14]  Mike Joy,et al.  Towards a Definition of Source-Code Plagiarism , 2008, IEEE Transactions on Education.