Biclustering Readings and Manuscripts via Non-negative Matrix Factorization, with Application to the Text of Jude

The text-critical practice of grouping witnesses into families or texttypes often faces two obstacles: Contamination in the manuscript tradition, and co-dependence in identifying characteristic readings and manuscripts. We introduce non-negative matrix factorization (NMF) as a simple, unsupervised, and efficient way to cluster large numbers of manuscripts and readings simultaneously while summarizing contamination using an easy-to-interpret mixture model. We apply this method to an extensive collation of the New Testament epistle of Jude and show that the resulting clusters correspond to human-identified textual families from existing research.

[1]  Kenneth Keumsang Yoo The Classification of the Greek Manuscripts of 1 Peter with Special Emphasis on Methodology , 2001 .

[2]  E. C. Colwell Is there a Lectionary Text of the Gospels? , 1932, Harvard Theological Review.

[3]  Stephen E. Robertson,et al.  Understanding inverse document frequency: on theoretical arguments for IDF , 2004, J. Documentation.

[4]  W. L. Richards A Critique of a New Testament Text-Critical Methodology: The Claremont Profile Method , 1977 .

[5]  Frederik Wisse The profile method for the classification and evaluation of manuscript evidence : as applied to the continuous Greek text of the Gospel of Luke , 1984 .

[6]  Karthik Devarajan,et al.  Nonnegative Matrix Factorization: An Analytical and Interpretive Tool in Computational Biology , 2008, PLoS Comput. Biol..

[7]  W. L. Richards The Classification of the Greek Manuscripts of the Johannine Epistles , 1977 .

[8]  Guillaume Bouchard,et al.  Fast and Efficient Estimation of Individual Ancestry Coefficients , 2014, Genetics.

[9]  Patrik O. Hoyer,et al.  Non-negative Matrix Factorization with Sparseness Constraints , 2004, J. Mach. Learn. Res..

[10]  B. Ehrman The Use of Group Profiles for the Classification of New Testament Documentary Evidence , 1987 .

[11]  Matthew Spencer,et al.  Representing Multiple Pathways of Textual Flow in the Greek Manuscripts of the Letter of James Using Reduced Median Networks , 2004, Comput. Humanit..

[12]  C. Baldwin The so-called mixed text : an examination of the non-Alexandrian and non-Byzantine text-type in the Catholic Epistles , 2007 .

[13]  C. Niccum The Voice of the Manuscripts on the Silence of Women: The External Evidence for 1 Cor 14.34–5 , 1997, New Testament Studies.

[14]  K. Wachtel 7. Towards a Redefinition of External Criteria: The Role of Coherence in Assessing the Origin of Variants , 2008 .

[15]  E. C. Colwell Genealogical Method: Its Achievements and Its Limitations , 1947 .

[16]  Christos Boutsidis,et al.  SVD based initialization: A head start for nonnegative matrix factorization , 2008, Pattern Recognit..

[17]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[18]  H. Houghton Recent Developments in New Testament Textual Criticism , 2011 .

[19]  Marinka Zitnik,et al.  NIMFA: A Python Library for Nonnegative Matrix Factorization , 2012, J. Mach. Learn. Res..

[20]  Jing Zhao,et al.  Document Clustering Based on Nonnegative Sparse Matrix Factorization , 2005, ICNC.

[21]  K. Junack ZU DEN GRIECHISCHEN LEKTIONAREN UND IHRER ÜBERLIEFERUNG DER KATHOLISCHEN BRIEFE , 1972 .

[22]  T. Robertson Relationships among the non-Byzantine manuscripts of 2 Peter / Terry Robertson. , 2001 .

[23]  K. W. Kim Codices 1582, 1739, and Origen , 1950 .

[24]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[25]  Luigi Grippo,et al.  On the convergence of the block nonlinear Gauss-Seidel method under convex constraints , 2000, Oper. Res. Lett..

[26]  S. Sra Nonnegative Matrix Approximation: Algorithms and Applications , 2006 .

[27]  W. J. Hickie The New Testament in the Original Greek , 2022 .

[28]  Chih-Jen Lin,et al.  Projected Gradient Methods for Nonnegative Matrix Factorization , 2007, Neural Computation.

[29]  H. V. Soden Die Schriften des Neuen Testaments in ihrer ältesten erreichbaren Textgestalt , 1902 .

[30]  M. Robinson,et al.  The New Testament in the Original Greek, Byzantine Textform 2005 , 2013 .

[31]  Ernest C. Colwell,et al.  Method in Locating a Newly-Discovered Manuscript , 1969 .

[32]  Gerd Mink,et al.  Problems of a highly contaminated tradition: the New Testament: Stemmata of variants as a source of a genealogy for witnesses , 2004 .