Semantic enrichment for enhancing LAM data and supporting digital humanities. Review article

With the rapid development of the digital humanities (DH) field, demands for historical and cultural heritage data have generated deep interest the data provided by libraries, archives, and museums (LAMs). In order to enhance LAM data’s quality and discoverability while enabling a self-sustaining ecosystem, “semantic enrichment” becomes a strategy increasingly used by LAMs during recent years. This article introduces a number of semantic enrichment methods and efforts that can be applied to LAM data at various levels, aiming to support deeper and wider exploration and use of LAM data in DH research. The real cases, research projects, experiments, and pilot studies shared in this article demonstrate endless potential for LAM data, whether they are structured, semi-structured, or unstructured, regardless of what types of original artifacts carry the data. Following their roadmaps would encourage more effective initiatives and strengthen this effort to maximize LAM data’s discoverability, use- and reuse-ability, and their value in the mainstream of DH and Semantic Web.

[1]  Marcia Lei Zeng,et al.  Using a Semantic Analysis Tool to Generate Subject Access Points: A Study Using Panofsky's Theory and Two Research Samples , 2014 .

[2]  Slav Petrov,et al.  Syntactic Annotations for the Google Books NGram Corpus , 2012, ACL.

[3]  Marcia Lei Zeng,et al.  Navigating the Intersection of Library Bibliographic Data and Linked Music Information Sources: A Study of the Identification of Useful Metadata Elements for Interlinking , 2013 .

[4]  Eero Hyvönen Cultural Heritage Linked Data on the Semantic Web : Three Case Studies Using the Sampo Model , 2017 .

[5]  E. Hyvönen,et al.  Demonstrating BiographySampo in Solving Digital Humanities Research Problems in Biography and Prosopography , 2019 .

[6]  Hilary K. Thorsen,et al.  Linked Open Data and the Cultural Heritage Landscape , 2016 .

[7]  J. Stephen Downie,et al.  Low-cost semantic enhancement to digital library metadata and indexing: Simple yet effective strategies , 2016, 2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL).

[8]  Ccsds Secretariat,et al.  Reference Model for an Open Archival Information System (OAIS) , 1999 .

[9]  Dustin Lange,et al.  Data Is the New Oil , 2018, Towards User-Centric Transport in Europe.

[10]  Maja Zumer,et al.  Panel: Maximizing the usage of value vocabularies in the linked data ecosystem , 2013, ASIST.

[11]  Eero Hyvönen,et al.  NameSampo: A Linked Open Data Infrastructure and Workbench for Toponomastic Research , 2018, GeoHumanities@SIGSPATIAL.

[12]  Frédéric Kaplan,et al.  A Map for Big Data Research in Digital Humanities , 2015, Front. Digit. Humanit..

[13]  Maja Zumer,et al.  IFLA LRM - Finally Here , 2017, Dublin Core Conference.

[14]  Marcia Lei Zeng,et al.  Knowledge Organization Systems (KOS) in the Semantic Web: a multi-dimensional review , 2018, International Journal on Digital Libraries.

[15]  Diane Vizine-Goetz,et al.  Mining MARC's Hidden Treasures: Initial Investigations Into How Notes of the Past Might Shape Our Future , 2016 .

[16]  Christine L. Borgman,et al.  Big Data, Little Data, No Data: Scholarship in the Networked World , 2014 .

[17]  Mickaël Coustaty,et al.  Adaptive Edit-Distance and Regression Approach for Post-OCR Text Correction , 2018, ICADL.

[18]  Maja Žumer IFLA Library Reference Model (IFLA LRM)— Harmonisation of the FRBR Family , 2018 .

[19]  Maria Cristina Pattuelli,et al.  Accidental discovery, intentional inquiry: Leveraging linked data to uncover the women of jazz , 2017, Digit. Scholarsh. Humanit..

[20]  Violeta Damjanovic,et al.  Semantic Enhancement : The Key to Massive and Heterogeneous Data Pools , 2011 .

[21]  Mirna Willer,et al.  Standard library metadata models and structures for the Semantic Web , 2011 .

[22]  Marcia Lei Zeng,et al.  Smart Data for Digital Humanities , 2017, J. Data Inf. Sci..

[23]  L. Floridi Information: A Very Short Introduction , 2010 .

[24]  Patrik Svensson,et al.  The Landscape of Digital Humanities , 2010, Digit. Humanit. Q..

[25]  Joan Fragaszy Troyano,et al.  Big? Smart? Clean? Messy? Data in the Humanities , 2018 .

[26]  Marcia Lei Zeng,et al.  Exploring methods to improve access to Music resources by aligning library Data with Linked Data: A report of methodologies and preliminary findings , 2013, J. Assoc. Inf. Sci. Technol..

[27]  J. Stephen Downie,et al.  Improving Access to Large-scale Digital Libraries ThroughSemantic-enhanced Search and Disambiguation , 2015, JCDL.

[28]  Eric Gossett,et al.  Big Data: A Revolution That Will Transform How We Live, Work, and Think , 2015 .

[29]  Antoine Isaac,et al.  Library Linked Data Incubator Group Final Report , 2011 .

[30]  Adam Jatowt,et al.  Evaluating the Impact of OCR Errors on Topic Modeling , 2018, ICADL.

[31]  Fotis Janndis,et al.  Digital Humanities , 2016, Inform. Spektrum.

[32]  Maria Cristina Pattuelli,et al.  Personal name vocabularies as linked open data: A case study of jazz artist names , 2012, J. Inf. Sci..

[33]  Antoine Isaac,et al.  Automatic Enrichments with Controlled Vocabularies in Europeana: Challenges and Consequences , 2014, EuroMed.

[34]  Philipp Mayr,et al.  Interlinking Large-scale Library Data with Authority Records , 2017, Front. Digit. Humanit..

[35]  Getaneh Alemu,et al.  Linked data for libraries: benefits of a conceptual shift from library-specific record structures to RDF-based data models , 2012 .

[36]  Eero Hyvönen,et al.  Publishing Second World War History as Linked Data Events on the Semantic Web , 2016, DH.