AISB 2008 Convention Communication, Interaction and Social Intelligence

Techniques developed for synchronic text classification problems are applied to a significantly diachronic dataset. The scale of the temporal categories appears to matter. The problem addressed is that of using automated text classification methods to temporally locateThe Donation of Constantine. The results reported do not contradict the analysis of Lorenzo Valla from 1440, claiming the document a forgery, but suggest that it is a very good forgery. This contributes to establishing the validity of these classification methods as applied to temporal categories and small datasets.

[1]  Eduard H. Hovy,et al.  Pragmatics and Natural Language Generation , 1990, Artif. Intell..

[2]  David Sharp,et al.  Ngram and Bayesian Classification of Documents for Topic and Authorship , 2003, Lit. Linguistic Comput..

[3]  J. Grieve Quantitative authorship attribution:a history and evaluation of techniques , 2005 .

[4]  Eduard Hovy,et al.  Generating Natural Language Under Pragmatic Constraints , 1988 .

[5]  Ehud Reiter,et al.  Squibs and Discussions: Human Variation and Lexical Choice , 2002, CL.

[6]  H. van Halteren,et al.  Outside the cave of shadows: using syntactic annotation to enhance authorship attribution , 1996 .

[7]  John Burrows,et al.  Word-Patterns and Story-Shapes: The Statistical Analysis of Narrative Style , 1987 .

[8]  M. Pagel,et al.  Frequency of word-use predicts rates of lexical evolution throughout Indo-European history , 2007, Nature.

[9]  Graeme Hirst,et al.  Book Reviews: Longman Grammar of Spoken and Written English , 2001, Computational Linguistics.

[10]  Fuchun Peng,et al.  N-GRAM-BASED AUTHOR PROFILES FOR AUTHORSHIP ATTRIBUTION , 2003 .

[11]  Grzegorz Kondrak,et al.  Computing and Historical Phonology , 2007, SIGMORPHON.

[12]  ENVIR,et al.  FOOD FOR HEALTH. , 1933, California and western medicine.

[13]  Ehud Reiter,et al.  Knowledge Acquisition for Natural Language Generation , 2000, INLG.

[14]  Jan Svartvik,et al.  A __ comprehensive grammar of the English language , 1988 .

[15]  David L. Hoover,et al.  Delta Prime? , 2004, Lit. Linguistic Comput..

[16]  Carl Vogel,et al.  N-gram Distributions in Texts as Proxy for Textual Fingerprints , 2007 .

[17]  Bas Aarts,et al.  Syntactic gradience : the nature of grammatical indeterminacy , 2007 .

[18]  Adam Kilgarriff,et al.  Language is never, ever, ever, random , 2005 .

[19]  Susan Brewer,et al.  Information storage and retrieval , 1959, ACM '59.

[20]  Adam Kilgarriff,et al.  Corpus Similarity and Homogeneity via Word Frequency , 1996 .

[21]  Carole E. Chaski,et al.  Empirical evaluations of language-based author identification techniques , 2001 .

[22]  Frederick Mosteller,et al.  Applied Bayesian and classical inference : the case of the Federalist papers , 1984 .

[23]  Efstathios Stamatatos,et al.  A user-assisted business letter generator dealing with text's stylistic variations , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[24]  Simon Kirby,et al.  Measuring Language Divergence by Intra-Lexical Comparison , 2006, ACL.

[25]  Carl Vogel,et al.  Hearing Voices in the Poetry of Brendan Kennelly , 2007 .

[26]  Ehud Reiter,et al.  Should Corpora Texts Be Gold Standards for NLG? , 2002, INLG.

[27]  Piskorski Jakub,et al.  Mining Massive Data Sets for Security , 2008 .

[28]  Carl Vogel,et al.  Group Dialects in an Online Community , 2007 .

[29]  J. Milton,et al.  Language Independent Authorship Attribution using Character Level Language Models , 2003 .

[30]  Efstathios Stamatatos,et al.  Computer-Based Authorship Attribution Without Lexical Measures , 2001, Comput. Humanit..

[31]  Hichem Frigui,et al.  Simultaneous Clustering and Dynamic Keyword Weighting for Text Documents , 2004 .

[32]  Erez Lieberman,et al.  Quantifying the evolutionary dynamics of language , 2007, Nature.

[33]  C. G. Herbermann Book Review: The Catholic Encyclopedia; An International Work of Reference on the Constitution, Doctrine, Discipline, and History of the Catholic Church , 1914 .