Fingerprinting based detection system for identifying plagiarism in Malayalam text documents

Plagiarism is a serious problem in the present day scenario. The easy availability of information of all sorts over the web is the major reason. This paper presents a detection system based on fingerprinting for identifying copy in Malayalam text-based documents. The challenge in identifying plagiarism in Malayalam documents is due to the intricate linguistic composition of Malayalam. Malayalam is agglutinative as well as morphologically rich language. In this paper, a procedure for plagiarism detection of Malayalam documents to identify similarity between documents is presented. This method establishes the extent of similarity between any two documents. The winnowing algorithm is used to compute the fingerprints at sentence level. The method improves the search time with more accuracy in the detection process.

[1]  Mohamed El Bachir Menai,et al.  APlag: A plagiarism checker for Arabic texts , 2011, 2011 6th International Conference on Computer Science & Education (ICCSE).

[2]  Hector Garcia-Molina,et al.  SCAM: A Copy Detection Mechanism for Digital Documents , 1995, DL.

[3]  Richard M. Karp,et al.  Efficient Randomized Pattern-Matching Algorithms , 1987, IBM J. Res. Dev..

[4]  Mohamed El Bachir Menai,et al.  Similarity detection in Java programming assignments , 2010, 2010 5th International Conference on Computer Science & Education.

[5]  Benno Stein,et al.  Near Similarity Search and Plagiarism Analysis , 2005, GfKl.

[6]  Hector Garcia-Molina,et al.  Copy detection mechanisms for digital documents , 1995, SIGMOD '95.

[7]  SteinBenno,et al.  Plagiarism analysis, authorship identification, and near-duplicate detection PAN'07 , 2007 .

[8]  Justin Zobel,et al.  Methods for Identifying Versioned and Plagiarized Documents , 2003, J. Assoc. Inf. Sci. Technol..

[9]  Máté Pataki Plagiarism Detection and Document Chunking Methods , 2003, WWW.

[10]  Benno Stein,et al.  Plagiarism Detection Without Reference Collections , 2006, GfKl.

[11]  Rynson W. H. Lau,et al.  CHECK: a document plagiarism detection system , 1997, SAC '97.

[12]  Daniel Shawcross Wilkerson,et al.  Winnowing: local algorithms for document fingerprinting , 2003, SIGMOD '03.

[13]  Janis Grundspenkis,et al.  Computer-based plagiarism detection methods and tools: an overview , 2007, CompSysTech '07.