Medieval manuscript layout model

Medieval manuscript layouts are quite complex. Additionally to their main text flow, which can spread over one or several columns, such manuscripts contain also other textual elements such as insertions, annotations, and corrections. They are often richly decorated with ornaments, illustrations, and drop capitals making their layout even more complex. In this paper we propose a generic layout model to represent their physical structure. To achieve this goal we propose to use four layers in order to distinguish between the different graphical elements. In this paper we show how this model is used to represent automatic segmentation results and how it allows a quantitative measure of their accuracy.

[1]  R. Manmatha,et al.  Word spotting for historical documents , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[2]  Laurence Likforman-Sulem,et al.  Text line segmentation of historical documents: a survey , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[3]  Volkmar Frinken,et al.  Automatic Transcription of Handwritten Medieval Documents , 2009, 2009 15th International Conference on Virtual Systems and Multimedia.

[4]  Jean-Yves Ramel,et al.  AGORA: the interactive document image analysis tool of the BVH project , 2006, Second International Conference on Document Image Analysis for Libraries (DIAL'06).

[5]  Jean-Yves Ramel,et al.  User-driven page layout analysis of historical printed books , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[6]  Simone Calderara,et al.  "Inside the bible": segmentation, annotation and retrieval for a new browsing experience , 2008, MIR '08.

[7]  Lambert Schomaker,et al.  Layout Analysis of Handwritten Historical Documents for Searching the Archive of the Cabinet of the Dutch Queen , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[8]  Josep Lladós,et al.  HistoSketch: A Semi-Automatic Annotation Tool for Archival Documents , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[9]  Philippe Régnier,et al.  Extraction automatisée de lignes et de fragments textuels dans les images de manuscrits d'auteur du XIXe siècle , 2009 .

[10]  Frank Lebourgeois,et al.  DEBORA: Digital AccEss to BOoks of the RenAissance , 2006, International Journal of Document Analysis and Recognition (IJDAR).