A Knowledge Based Approach to Detection of Idea Plagiarism in Online Research Publications

Plagiarism is on the rise because of the easy access to information through World Wide Web. Web pages are growing in the internet on daily basis. Researchers want to be well connected globally to popularize their ideas. Therefore, allowing download of research documents are inevitable. However, this falls prey to those who turn the cake and spoil the issue. Even unknowingly, a researcher ends in verbatim copying of other former researchers’ ideologies or conclusions to quote / use in their own research paper. This paper presents an analysis of NLP based plagiarism detection approaches which leads to proposing of an ontology based solution to detect text plagiarism more meaningfully. We address wordword and paraphrasing techniques and investigate the use of ontology in detecting idea plagiarism. The main objective is to investigate the exclusion of ‘Related Work’ section and the use of WordNet for plagiarism detection in research publications. Keywords— Plagiarism, WordNet, Ontology, Natural

[1]  Dil Muhammad Akbar Hussain,et al.  Plagiarism Detection Based on SCAM Algorithm , 2011, IMECS 2011.

[2]  Péter Szeredi,et al.  A Generic framework for plagiarism detection in programs , .

[3]  Andrei Z. Broder,et al.  On the resemblance and containment of documents , 1997, Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171).

[4]  Hector Garcia-Molina,et al.  SCAM: A Copy Detection Mechanism for Digital Documents , 1995, DL.

[5]  Michael J. Wise,et al.  YAP3: improved detection of similarities in computer program and other texts , 1996, SIGCSE '96.

[6]  Nicholas Tran,et al.  Sim: a utility for detecting similarity in computer programs , 1999, SIGCSE '99.

[7]  Lubna Sheikh,et al.  Is plagiarism more prevalent in some forms of assessment than others , 2004 .

[8]  Anand Kumar,et al.  Text mining and ontologies in biomedicine: Making sense of raw text , 2005, Briefings Bioinform..

[9]  Mohammed Bennamoun,et al.  Determining Termhood for Learning Domain Ontologies in a Probabilistic Framework , 2007, AusDM.

[10]  HARDSCAPE proaucis,et al.  Tools of the trade , 1995, Nature.

[11]  Gerhard Weikum,et al.  Automated construction and growth of large ontology , 2009 .

[12]  Hermann A. Maurer,et al.  Plagiarism - A Survey , 2006, J. Univers. Comput. Sci..

[13]  Hector Garcia-Molina,et al.  Copy detection mechanisms for digital documents , 1995, SIGMOD '95.

[14]  M CarlsteadSara,et al.  The Grace Hopper Celebration of Women in Computing , 1994 .

[15]  Harith Alani Ontology Construction from Online Ontologies , 2006 .

[16]  Eric Atwell,et al.  Customising a Copying-Identifier for Biomedical Science Student Reports: Comparing Simple and Smart Analyses , 2002, AICS.

[17]  Samuel Fernando,et al.  A Semantic Similarity Approach to Paraphrase Detection , 2008 .

[18]  Rynson W. H. Lau,et al.  CHECK: a document plagiarism detection system , 1997, SAC '97.

[19]  James A. Malcolm,et al.  Detecting Short Passages of Similar Text in Large Document Collections , 2001, EMNLP.

[20]  Alexander F. Gelbukh,et al.  PPChecker: Plagiarism Pattern Checker in Document Copy Detection , 2006, TSD.

[21]  Jöran Beel,et al.  Citation based plagiarism detection: a new approach to identify plagiarized work language independently , 2010, HT '10.

[22]  Stéphane Ducasse,et al.  A language independent approach for detecting duplicated code , 1999, Proceedings IEEE International Conference on Software Maintenance - 1999 (ICSM'99). 'Software Maintenance for Business Change' (Cat. No.99CB36360).

[23]  Hector Garcia-Molina,et al.  Building a scalable and accurate copy detection mechanism , 1996, DL '96.

[24]  Lutz Prechelt,et al.  JPlag: Finding plagiarisms among a set of programs , 2000 .

[25]  Kenneth Ward Church,et al.  Dotplot : a program for exploring self-similarity in millions of lines of text and code , 1993 .

[26]  Asunción Gómez-Pérez,et al.  Methodologies, tools and languages for building ontologies: Where is their meeting point? , 2003, Data Knowl. Eng..