论文信息 - Meta Analysis within Authorship Verification

Meta Analysis within Authorship Verification

In an authorship verification problem one is given writing examples from an author A, and one is asked to determine whether or not each text in fact was written by A. In a more general form of the authorship verification problem one is given a single document d only, and the question is whether or not d contains sections from other authors. The heart of authorship verification is the quantization of an author's writing style along with an outlier analysis to identify anomalies. Human readers are well-versed in detecting such spurious sections since they combine a highly-developed sense for wording with context-dependent meta knowledge in their analysis. The intention of this paper is to compile an overview of the algorithmic building blocks for authorship verification. In particular, we introduce authorship verification problems as decision problems, discuss possibilities for the use of meta knowledge, and apply meta analysis to post- process unreliable style analysis results. Our meta analysis combines a confidence-based majority decision with the unmasking approach of Koppel and Schler. With this strategy we can improve the analysis quality in our experiments by 33% in terms of the F-measure.

Benno Stein | Nedim Lipka | Sven Meyer zu Eissen

[1] Efstathios Stamatatos. Author Identification Using Imbalanced and Limited Training Texts , 2007 .

[2] Benno Stein,et al. Genre classification of Web pages user study and feasibility analysis , 2004 .

[3] Sven Meyer. Genre Classification of Web Pages User Study and Feasibility Analysis , 2004 .

[4] Efstathios Stamatatos,et al. Computer-Based Authorship Attribution Without Lexical Measures , 2001, Comput. Humanit..

[5] Mark Stefik,et al. Introduction to knowledge systems , 1995 .

[6] Mitchell P. Marcus,et al. Topic segmentation: algorithms and applications , 1998 .

[7] Robert P. W. Duin,et al. Combining One-Class Classifiers , 2001, Multiple Classifier Systems.

[8] R. P. Fishburne,et al. Derivation of New Readability Formulas (Automated Readability Index, Fog Count and Flesch Reading Ease Formula) for Navy Enlisted Personnel , 1975 .

[9] Moshe Koppel,et al. Exploiting Stylistic Idiosyncrasies for Authorship Attribution , 2003 .

[10] J. Chall,et al. A FORMULA FOR PREDICTING READABILITY , 1948 .

[11] Benno Stein,et al. Genre Classification of Web Pages , 2004, KI.