Plagiarism Detection Techniques for Arabic Script Languages: A Literature Review

Plagiarism is generally defined as literary theft and academic dishonesty. This considered as the serious issue in an academic documents and texts. There are numerous of plagiarism detection techniques have been developed for various natural languages, mainly English. In this paper we investigate and review the plagiarism detection techniques and algorithms which have been developed for Arabic Script Languages (ASL), and providing a literature review of the utilized methods in terms of techniques and outcomes.  The result of this paper will help the researchers who are going to commence their development and extend their researches in ASL like Arabic, Persian, Urdu, and Kurdish.

[1]  Taher Rahgooy,et al.  Persian Plagiarism Detection Using Sentence Correlations , 2016, FIRE.

[2]  James A. Malcolm,et al.  Plagiarism is Easy, but also Easy To Detect , 2006 .

[3]  Mohamed El Bachir Menai,et al.  Naïve Bayes classifiers for authorship attribution of Arabic texts , 2014, J. King Saud Univ. Comput. Inf. Sci..

[4]  Halim Sayoud,et al.  Authorship Attribution of Short Historical Arabic Texts Based on Lexical Features , 2013, 2013 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery.

[5]  Kayvan Bijari,et al.  A Deep Learning Approach to Persian Plagiarism Detection , 2016, FIRE.

[6]  Naomie Salim,et al.  Existing plagiarism detection techniques: A systematic mapping of the scholarly literature , 2015, Online Inf. Rev..

[7]  Naomie Salim,et al.  Work in Progress: Developing Arabic Plagiarism Detection Tool for E-Learning Systems , 2009, 2009 International Association of Computer Science and Information Technology - Spring Conference.

[8]  Abdul Wahab,et al.  Copy detection in urdu language documents using n-grams model , 2011, International Conference on Computer Networks and Information Technology.

[9]  Paolo Rosso,et al.  Intrinsic Plagiarism Detection in Arabic Text: Preliminary Experiments , 2012 .

[10]  Ashraf Elnagar,et al.  A Plagiarism Detection System for Arabic Text-Based Documents , 2012, PAISI.

[11]  Ahmed Fawzi Otoom,et al.  Towards author identification of Arabic text articles , 2014, 2014 5th International Conference on Information and Communication Systems (ICICS).

[12]  Faramarz Safi Esfahani,et al.  A Plagiarism Detection Approach Based on SVM for Persian Texts , 2016, FIRE.

[13]  Paul Dourish What is Plagiarism , 2011 .

[14]  Hamid Ahangarbahan,et al.  A Fuzzy Approach for Ambiguity Reduction in Text Similarity Estimation (Case Study: Persian Web Contents) , 2015 .

[15]  Naomie Salim,et al.  Understanding Plagiarism Linguistic Patterns, Textual Features, and Detection Methods , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[16]  Mohran H. J. Al-Bayed Intelligent Plagiarism Detection for Electronic Documents , 2017 .

[17]  Mahinnaz Mirdehghan,et al.  Persian, Urdu, and Pashto: A comparative orthographic analysis , 2010 .

[18]  Azadeh Shakery,et al.  A Pairwise Document Analysis Approach for Monolingual Plagiarism Detection , 2016, FIRE.

[19]  Rakian Shima,et al.  A PERSIAN FUZZY PLAGIARISM DETECTION APPROACH , 2015 .

[20]  Morteza Rezaei Sharifabadi,et al.  Mahak Samim: A Corpus of Persian Academic Texts for Evaluating Plagiarism Detection Systems , 2016, FIRE.

[21]  Naomie Salim,et al.  Survey of Text Plagiarism Detection , 2012 .

[22]  Salar Mohtaj,et al.  Developing Monolingual Persian Corpus for Extrinsic Plagiarism Detection Using Artificial Obfuscation: Notebook for PAN at CLEF 2015 , 2015, CLEF.

[23]  A. Elnagar,et al.  A fingerprinting-based plagiarism detection system for Arabic text-based documents , 2012, 2012 8th International Conference on Computing Technology and Information Management (NCM and ICNIT).

[24]  Naomie Salim,et al.  Features Based Text Similarity Detection , 2010, ArXiv.

[25]  Kayvan Bijari,et al.  Graph-based Approach to Text Alignment for Plagiarism Detection in Persian Documents , 2016, FIRE.

[26]  Sh. Rafieian,et al.  Plagiarism checker for Persian (PCP) texts using hash-based tree representative fingerprinting , 2016 .

[27]  Hermann A. Maurer,et al.  Plagiarism - A Survey , 2006, J. Univers. Comput. Sci..

[28]  Agha Ali Raza,et al.  N-Gram Based Authorship Attribution in Urdu Poetry , 2009 .

[29]  Mohamed El Bachir Menai,et al.  Detection of Plagiarism in Arabic Documents , 2012 .

[30]  Mohamed El Bachir Menai,et al.  APlag: A plagiarism checker for Arabic texts , 2011, 2011 6th International Conference on Computer Science & Education (ICCSE).

[31]  Naomie Salim,et al.  On the use of fuzzy information retrieval for gauging similarity of Arabic documents , 2009, 2009 Second International Conference on the Applications of Digital Information and Web Technologies.

[32]  Lee Gillam,et al.  From English to Persian: Conversion of Text Alignment for Plagiarism Detection , 2016, FIRE.

[33]  Maryam Mahmoodi,et al.  Design a Persian Automated Plagiarism Detector (AMZPPD) , 2014, ArXiv.

[34]  F. Safi-Esfahani,et al.  English-Persian Plagiarism Detection based on a Semantic Approach , 2017 .

[35]  Vaclav Snasel,et al.  Survey of Plagiarism Detection Methods , 2011, 2011 Fifth Asia Modelling Symposium.

[36]  A. S. Bin-Habtoor,et al.  A Survey on Plagiarism Detection Systems , 2012 .