An open problem in computational stemmatology - a model for contamination

In this contribution, two open problems in computational stemmatology are being considered. The first one is contamination, an umbrella term referring to all phenomena of admixture of text variants resulting from scribes considering more than one manuscript or even memory when copying a text. This problem is one of the biggest to date in stemmatology since it implies an entirely different formal approach to the reconstruction of the copy history of a tradition and in turn to the reconstruction of an urtext. (Maas 1937) famously stated that there is no remedy against contamination and (Pasquali and Pieraccioni 1952) coined the terms 'open' vs. 'closed' recensions to distinguish contaminated from uncontaminated. We present a graph theoretical model which formally accommodates traditions with any degree of contamination while maintaining a temporal ordering and give combinatorial numbers and formula on the implication for numbers of possible scenarios.

[1]  R. Tarrant Texts, Editors, and Readers: Methods and Problems in Latin Textual Criticism , 2016 .

[2]  Albert B. Lord,et al.  The Singer of Tales , 1961 .

[3]  T. Forshaw Everything you always wanted to know , 1977 .

[4]  Joseph Bédier,et al.  La tradition manuscrite du Lai de l'Ombre. Réflexions sur l'art d'éditer les anciens textes (deuxième article) , 1928 .

[5]  Marina Buzzoni,et al.  Open versus closed recensions (Pasquali): Pros and cons of some methods for computer-assisted stemmatology , 2016, Digit. Scholarsh. Humanit..

[6]  Martin L. West,et al.  Textual criticism and editorial technique applicable to Greek and Latin texts , 1973 .

[7]  S. Schwager,et al.  Mathematical Philology: Entropy Information in Refining Classical Texts' Reconstruction, and Early Philologists' Anticipation of Information Theory , 2010, PloS one.

[8]  Caroline Macé,et al.  Beyond the tree of texts: Building an empirical model of scribal variation through graph analysis of texts and stemmata , 2013, Lit. Linguistic Comput..

[9]  Donovan Anderson,et al.  The Genesis of Lachmann’s Method , 2008 .

[10]  Tandy J. Warnow,et al.  Analyzing the Order of Items in Manuscripts of The Canterbury Tales , 2003, Computers and the Humanities.

[11]  Caroline Macé,et al.  Parvum lexicon stemmatologicum. A brief lexicon of stemmatology , 2015 .

[12]  Fernando Báez,et al.  A Universal History of the Destruction of Books: From Ancient Sumer to Modern Iraq , 2010 .

[13]  Luay Nakhleh,et al.  Phylogenetic networks , 2004 .

[14]  R. Gregory Textual Criticism , 1994, Perception.

[15]  H. Hoenigswald,et al.  12 The Upside-down Cladogram: Problems in Manuscript Affiliation , 1987 .

[16]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[17]  Vinton A. Dearing,et al.  Principles and practice of textual analysis , 1977 .

[18]  W. Greg,et al.  The calculus of variants : an essay on textual criticism , 1928 .

[19]  C. A. Trypanis,et al.  The Making of Homeric Verse: The Collected Papers of Milman Parry , 1971 .

[20]  Takeo Yamada,et al.  Listing all the minimum spanning trees in an undirected graph , 2010, Int. J. Comput. Math..

[21]  A. Silvas The Rule of St. Basil in Latin and English: A Revised Critical Edition , 2013 .

[22]  Tuomas Heikkilä,et al.  Evaluating methods for computer-assisted stemmatology using artificial benchmark data sets , 2009, Lit. Linguistic Comput..

[23]  Odd Einar Haugen The silva portentosa of stemmatology: Bifurcation in the recension of Old Norse manuscripts , 2016, Digit. Scholarsh. Humanit..

[24]  A. A. den Hollander How shock waves revealed successive contamination: A cardiogram of early sixteenth-century printed Dutch Bibles , 2004 .

[25]  Giorgio Pasquali,et al.  Storia della tradizione e critica del testo , 1952 .

[26]  J. Foley How to read an oral poem , 2002 .

[27]  Manuel Bodirsky,et al.  Generating Labeled Planar Graphs Uniformly at Random , 2003, ICALP.

[28]  H. Quentin Essais de critique textuelle (ecdotique) , 2022 .

[29]  Armin Hoenen Tools, evaluation and preprocessing for stemmatology , 2018 .

[30]  Ali Pinar,et al.  On Clustering on Graphs with Multiple Edge Types , 2011, Internet Math..

[31]  Armin Hoenen From Manuscripts to Archetypes through Iterative Clustering , 2018, LREC.

[32]  P. Wright Counting and Constructing Minimal Spanning Trees , 2000 .

[33]  A. Cayley A theorem on trees , 2009 .

[34]  A. Dress,et al.  Split decomposition: a new and useful approach to phylogenetic analysis of distance data. , 1992, Molecular phylogenetics and evolution.

[35]  Armin Hoenen,et al.  How Many Stemmata with Root Degree k? , 2017, MOL.