Harmonizing and publishing heterogeneous premodern manuscript metadata as Linked Open Data

Manuscripts are a crucial form of evidence for research into all aspects of premodern European history and culture, and there are numerous databases devoted to describing them in detail. This descriptive information, however, is typically available only in separate data silos based on incompatible data models and user interfaces. As a result, it has been difficult to study manuscripts comprehensively across these various platforms. To address this challenge, a team of manuscript scholars and computer scientists worked to create “Mapping Manuscript Migrations” (MMM), a semantic portal, and a Linked Open Data service. MMM stands as a successful proof of concept for integrating distinct manuscript datasets into a shared platform for research and discovery with the potential for future expansion. This paper will discuss the major products of the MMM project: a unified data model, a repeatable data transformation pipeline, a Linked Open Data knowledge graph, and a Semantic Web portal. It will also examine the crucial importance of an iterative process of multidisciplinary collaboration embedded throughout the project, enabling humanities researchers to shape the development of a digital platform and tools, while also enabling the same researchers to ask more sophisticated and comprehensive research questions of the aggregated data.

[1]  Florian Berger,et al.  Generating Structured Data by Nontechnical Experts in Research Settings , 2018, i-com.

[2]  Hanno Wijsman The Bibale Database at the IRHT: A Digital Tool for Researching Manuscript Provenance , 2016 .

[3]  Toby Burrows Connecting Medieval and Renaissance Manuscript Collections , 2018 .

[4]  Colin Clark,et al.  ZEN AND THE ART , 2009 .

[6]  S. W. Holman A challenge. , 1955, Medical technicians bulletin.

[7]  M. Kestemont,et al.  The Digital Middle Ages: An Introduction , 2017, Speculum.

[8]  W. P. Stoneman Titulus. Identifying Medieval Latin Texts: An Evidence-Based Approach , 2005 .

[9]  David Lewis,et al.  Mapping Manuscript Migrations Knowledge Graph: Data for Tracing the History and Provenance of Medieval and Renaissance Manuscripts , 2020 .

[10]  Stefanie Gehrke,et al.  Biblissima's Prototype on Medieval Manuscript Illuminations and their Context , 2015, SW4SHD@ESWC.

[11]  Daniel Tunkelang,et al.  Faceted Search , 2009, Synthesis Lectures on Information Concepts, Retrieval, and Services.

[12]  Eero Hyvönen,et al.  A Linked Open Data Service and Portal for Pre-modern Manuscript Research , 2019, DHN.

[13]  Martin Doerr,et al.  FRBRoo: Enabling a Common View of Information from Memory Institutions , 2009 .

[14]  Martin Doerr,et al.  The CIDOC Conceptual Reference Module: An Ontological Approach to Semantic Interoperability of Metadata , 2003, AI Mag..

[15]  Yitzchak Miller,et al.  Ontology‐based analysis of the large collection of historical Hebrew manuscripts , 2018 .

[16]  Frank van Harmelen,et al.  Semantic technologies for historical research: A survey , 2014, Semantic Web.

[17]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[18]  S. Peroni,et al.  Development of an ontology for modelling medieval manuscripts: the case of Progetto IRNERIO , 2020 .

[19]  Eero Hyvönen,et al.  Sampo-UI: A full stack JavaScript framework for developing semantic portal user interfaces , 2021, Semantic Web.

[20]  Kai Eckert,et al.  DM2E: A Linked Data source of Digitised Manuscripts for the Digital Humanities , 2017, Semantic Web.

[21]  György Fazekas,et al.  Realising a Layered Digital Library: Exploration and Analysis of the Live Music Archive through Linked Data , 2017, 2017 ACM/IEEE Joint Conference on Digital Libraries (JCDL).

[22]  Eero Hyvönen,et al.  Preventing ontology interoperability problems instead of solving them , 2010, Semantic Web.

[23]  H. Hotson,et al.  Reassembling the Republic of Letters in the Digital Age , 2019 .

[24]  Birthe Christensen Care and Conservation of Manuscripts 10 – proceedings of the tenth international seminar held at the University of Copenhagen , 2010 .

[25]  Gary Marchionini,et al.  Exploratory search , 2006, Commun. ACM.

[26]  Patrick Le Bœuf,et al.  Modeling Rare and Unique Documents: Using FRBROO/CIDOC CRM , 2012 .

[27]  Eero Hyvönen,et al.  Linked Data Finland: A 7-star Model and Platform for Publishing and Re-using Linked Datasets , 2014, ESWC.

[28]  Jouni Tuominen,et al.  Linked Open Data Vocabularies and Identifiers for Medieval Studies , 2019, DHN.

[29]  Anna Bellotto,et al.  Medieval manuscript descriptions and the Semantic Web: analysing the impact of CIDOC CRM on Italian codicological-paleographical data , 2020, Digit. Humanit. Q..

[30]  Martin Doerr,et al.  Implementing the CIDOC Conceptual Reference Model in RDF , 2021 .