Lachmannian Archetype Reconstruction for Ancient Manuscript Corpora

Two goals are targeted by computer philology for ancient manuscript corpora: firstly, making an edition, that is roughly speaking one text version representing the whole corpus, which contains variety induced through copy errors and other processes and secondly, producing a stemma. A stemma is a graphbased visualization of the copy history with manuscripts as nodes and copy events as edges. Its root, the so-called archetype, is the supposed original text or urtext from which all subsequent copies are made. Our main contribution is to present one of the first computational approaches to automatic archetype reconstruction and to introduce the first textbased evaluation for automatically produced archetypes. We compare a philologically generated archetype with one generated by bioinformatic software.

[1]  M. Spencer,et al.  Phylogenetics of artificial manuscripts. , 2004, Journal of theoretical biology.

[2]  Robert J. O’Hara Trees of History in Systematics and Philology , 1996 .

[3]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[4]  R. Shamir,et al.  A fast algorithm for joint reconstruction of ancestral amino acid sequences. , 2000, Molecular biology and evolution.

[5]  K. Verstrepen,et al.  Reconstruction of Ancestral Metabolic Enzymes Reveals Molecular Mechanisms Underlying Evolutionary Innovation through Gene Duplication , 2012, PLoS biology.

[6]  M. Nei,et al.  A new method of inference of ancestral nucleotide and amino acid sequences. , 1995, Genetics.

[7]  Eric S. Lander,et al.  Sequencing the nuclear genome of the extinct woolly mammoth , 2008, Nature.

[8]  Martin L. West,et al.  Textual criticism and editorial technique applicable to Greek and Latin texts , 1973 .

[9]  Caroline Macé,et al.  Beyond the tree of texts: Building an empirical model of scribal variation through graph analysis of texts and stemmata , 2013, Lit. Linguistic Comput..

[10]  M. V. Mulken,et al.  Studies in Stemmatology , 1996 .

[11]  C. Randal Linder,et al.  Benchmark datasets and software for developing and testing methods for large-scale multiple sequence alignment and phylogenetic inference , 2010, PLoS currents.

[12]  Giorgio Pasquali,et al.  Storia della tradizione e critica del testo , 1952 .

[13]  Donovan Anderson,et al.  The Genesis of Lachmann’s Method , 2008 .

[14]  Mihai Albu,et al.  Testing methods on an artificially created textual tradition , 2006 .

[15]  P. Robinson,et al.  Cladistic analysis of an Old Norse manuscript tradition , 1996 .

[16]  Tuomas Heikkilä,et al.  Evaluating methods for computer-assisted stemmatology using artificial benchmark data sets , 2009, Lit. Linguistic Comput..

[17]  Christopher J. Howe,et al.  The phylogeny of The Canterbury Tales , 1998, Nature.

[18]  M. V. Mulken,et al.  Studies in Stemmatology II , 2004 .

[19]  Christopher J. Howe,et al.  Responding to Criticisms of Phylogenetic Methods in Stemmatology , 2012 .

[20]  Joseph Bédier,et al.  La tradition manuscrite du Lai de l'Ombre. Réflexions sur l'art d'éditer les anciens textes (deuxième article) , 1928 .

[21]  U. Chatterjee,et al.  Effect of unconventional feeds on production cost, growth performance and expression of quantitative genes in growing pigs , 2022, Journal of the Indonesian Tropical Animal Agriculture.