论文信息 - Plagiarism Alignment Detection by Merging Context Seeds Notebook for PAN at CLEF 2014

Plagiarism Alignment Detection by Merging Context Seeds Notebook for PAN at CLEF 2014

We describe our submitted algorithm to the text alignment sub-task of the plagiarism detection task in the PAN2014 challenge that achieved a plagdet score 0.855. By extracting contextual features for each document character and grouping those that are relevant for a given pair of documents, we generate seeds of atomic plagiarism cases. These are then merged by an agglomerative singlelinkage strategy using a defined distance measure.

Philipp Gross | Pashutan Modaresi

[1] Benno Stein,et al. An Evaluation Framework for Plagiarism Detection , 2010, COLING.

[2] Benno Stein,et al. Recent Trends in Digital Text Forensics and Its Evaluation - Plagiarism Detection, Author Identification, and Author Profiling , 2013, CLEF.