Plagiarism Detection by Identifying the Equations

Abstract In academia Plagiarism means copying of other work without author's permission. Presently available system mainly focuses on software plagiarism. They mainly based on token analysis, linguistic patterns, taxonomy and textual features. In this paper we mainly concentrate on research papers to check whether the documents are plagiarized or not. So far not much work has been done to detect plagiarism in research document. Our work focuses on the similarity of different simple equations present in a document. It can easily extract those equations from the documents, compare them even if the variables are changed in plagiarized document with the original one and can detect if the document is plagiarized or not. This method will not work if the research paper does not contain any equation.

[1]  Hwan-Gue Cho,et al.  Detecting and tracing plagiarized documents by reconstruction plagiarism-evolution tree , 2008, 2008 8th IEEE International Conference on Computer and Information Technology.

[2]  Naomie Salim,et al.  Understanding Plagiarism Linguistic Patterns, Textual Features, and Detection Methods , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[3]  Lila Guterman Plagiarism and Other Sins Seem Rife in Science Journals, a Digital Sleuth Finds. , 2008 .

[4]  Baojiang Cui,et al.  Type Redefinition Plagiarism Detection of Token-Based Comparison , 2010, 2010 International Conference on Multimedia Information Networking and Security.

[5]  Shinji Kusumoto,et al.  CCFinder: A Multilinguistic Token-Based Code Clone Detection System for Large Scale Source Code , 2002, IEEE Trans. Software Eng..