A Method for Scribe Distinction in Medieval Manuscripts Using Page Layout Features

In the framework of Palaeography, the use of digital image processing techniques has received increasing attention in recent years, resulting in a new research field commonly denoted as "digital palaeography". In such a field, a key role is played by both pattern recognition and feature extraction methods, which provide quantitative arguments for supporting expert deductions. In this paper, we present a pattern recognition system which tries to solve a typical palaeographic problem: to distinguish the different scribes who have worked together to the transcription of a single medieval book. In the specific case of a high standardized book typology (the so called Latin "Giant Bible"), we wished to verify if the extraction of certain specifically devised features, concerning the layout of the page, allowed to obtain satisfactory results. To this aim, we have also performed a statistical analysis of the considered features in order to characterize their discriminant power. The experiments, performed on a large dataset of digital images from the so called "Avila Bible" - a giant Latin copy of the whole Bible produced during the XII century between Italy and Spain - confirmed the effectiveness of the proposed method.

[1]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[2]  Huan Liu,et al.  Chi2: feature selection and discretization of numeric attributes , 1995, Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence.

[3]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[4]  D. Black The theory of committees and elections , 1959 .

[5]  Lucas Rodrigues,et al.  Noir et Blanc , 2010, Anagrama.

[6]  Busch Hannah,et al.  Kodikologie und Paläographie im digitalen Zeitalter 4 / Codicology and Palaeography in the Digital Age 4 , 2009 .

[7]  Peter Stokes,et al.  Computer-Aided Palaeography, Present and Future , 2009 .

[8]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[9]  Marilena Maniaci,et al.  Prime considerazioni sulla genesi e la storia della Bibbia di Ávila , 2012 .

[10]  金田 重郎,et al.  C4.5: Programs for Machine Learning (書評) , 1995 .

[11]  Marianna E. Gurrado «Graphoskop», uno strumento informatico per l'analisi paleografica quantitativa , 2009 .

[12]  Arianna Ciula The Palaeographical Method under the Light of a Digital Approach , 2009 .

[13]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[14]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[15]  Geoffrey E. Hinton,et al.  Learning representations by back-propagation errors, nature , 1986 .