Survey on Plagiarism Detection Systems and Their Comparison

Plagiarism occurs when a person uses someone’s work, ideas, words, expressions without giving the required attribution. Plagiarism is a common problem in fields like academia, Research papers, Publications, Patents, etc. In this paper, we deliberate the techniques for detecting the extrinsic plagiarism. These techniques are based on linguistic features, Semantic role labelling, vector space model, Fuzzy semantic string matching, and N-gram approach. And are tested on PAN plagiarism corpus 2009 and PAN plagiarism corpus 2011.

[1]  Naomie Salim,et al.  Plagiarism detection scheme based on Semantic Role Labeling , 2012, 2012 International Conference on Information Retrieval & Knowledge Management.

[2]  Sebastián A. Ríos,et al.  FastDocode: Finding Approximated Segments of N-Grams for Document Copy Detection - Lab Report for PAN at CLEF 2010 , 2010, CLEF.

[3]  Rasim M. Alguliyev,et al.  A linguistic treatment for automatic external plagiarism detection , 2017, Knowl. Based Syst..

[4]  Naomie Salim,et al.  An improved plagiarism detection scheme based on semantic role labeling , 2012, Appl. Soft Comput..

[5]  Cristian Grozea,et al.  Who's the Thief? Automatic Detection of the Direction of Plagiarism , 2010, CICLing.

[6]  Roman Kern,et al.  External and Intrinsic Plagiarism Detection Using Vector Space Models , 2009 .

[7]  Rasim M. Alguliyev,et al.  PDLK: Plagiarism detection using linguistic knowledge , 2015, Expert Syst. Appl..

[8]  Vishal Gupta,et al.  A Novel Technique for Detecting Plagiarism in Documents Exploiting Information Sources , 2017, Cognitive Computation.

[9]  Traian Rebedea,et al.  Automatic Plagiarism Detection System for Specialized Corpora , 2013, 2013 19th International Conference on Control Systems and Computer Science.

[10]  Asif Ekbal,et al.  Plagiarism detection in text using Vector Space Model , 2012, 2012 12th International Conference on Hybrid Intelligent Systems (HIS).

[11]  Ling Liu,et al.  Output privacy in data mining , 2011, TODS.

[12]  Ashraf S. Hussein Arabic document similarity analysis using n-grams and singular value decomposition , 2015, 2015 IEEE 9th International Conference on Research Challenges in Information Science (RCIS).

[13]  Juan D. Velásquez,et al.  Text mining applied to plagiarism detection: The use of words for detecting deviations in the writing style , 2013, Expert Syst. Appl..

[14]  Naomie Salim,et al.  Fuzzy Semantic-Based String Similarity for Extrinsic Plagiarism Detection - Lab Report for PAN at CLEF 2010 , 2010, CLEF.

[15]  Jeffrey Xu Yu,et al.  Efficient similarity joins for near-duplicate detection , 2011, TODS.

[16]  Sangeetha Jamal,et al.  An Improved SRL Based Plagiarism Detection Technique Using Sentence Ranking , 2015 .