Realising a Layered Digital Library: Exploration and Analysis of the Live Music Archive through Linked Data

Building upon a collection with functionality for discovery and analysis has been described by Lynch as a `layered' approach to digital libraries. Meanwhile, as digital corpora have grown in size, their analysis is necessarily supplemented by automated application of computational methods, which can create layers of information as intricate and complex as those within the content itself. This combination of layers - aggregating homogeneous collections, specialised analyses, and new observations - requires a flexible approach to systems implementation which enables pathways through the layers via common points of understanding, while simultaneously accommodating the emergence of previously unforeseen layers. In this paper we follow a Linked Data approach to build a layered digital library based on content from the Internet Archive Live Music Archive. Starting from the recorded audio and basic information in the Archive, we first deploy a layer of catalogue metadata which allows an initial - if imperfect - consolidation of performer, song, and venue information. A processing layer extracts audio features from the original recordings, workflow provenance, and summary feature metadata. A further analysis layer provides tools for the user to combine audio and feature data, discovered and reconciled using interlinked catalogue and feature metadata from layers below. Finally, we demonstrate the feasibility of the system through an investigation of `key typicality' across performances. This highlights the need to incorporate robustness to inevitable `imperfections' when undertaking scholarship within the digital library, be that from mislabelling, poor quality audio, or intrinsic limitations of computational methods. We do so not with the assumption that a `perfect' version can be reached; but that a key benefit of a layered approach is to allow accurate representations of information to be discovered, combined, and investigated for informed interpretation.

[1]  Michalis Stefanidakis,et al.  Linked Data URIs and Libraries: The Story So Far , 2015, D Lib Mag..

[2]  David De Roure Executable Music Documents , 2014, DLfM '14.

[3]  Getaneh Alemu,et al.  Linked data for libraries: benefits of a conceptual shift from library-specific record structures to RDF-based data models , 2012 .

[4]  Thierry Bertin-Mahieux,et al.  Large-Scale Cover Song Recognition Using the 2D Fourier Transform Magnitude , 2012, ISMIR.

[5]  Timothy W. Cole,et al.  The HathiTrust Research Center Workset Ontology: A Descriptive Framework for Non-Consumptive Research Collections , 2016 .

[6]  J. Stephen Downie,et al.  Enhancing scholarly use of digital libraries: A comparative survey and review of bibliographic metadata ontologies , 2016, 2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL).

[7]  Kevin R. Page,et al.  Explorations in Linked Data practice for early music corpora , 2014, IEEE/ACM Joint Conference on Digital Libraries.

[8]  Alan J. Dix,et al.  In Collaboration with In Concert: Reflecting a Digital Library as Linked Data for Performance Ephemera , 2016, DLfm.

[9]  J. Stephen Downie,et al.  Dynamic classification explorer for music digital libraries , 2008, JCDL '08.

[10]  Thierry Bertin-Mahieux,et al.  The Million Song Dataset , 2011, ISMIR.

[11]  Mark B. Sandler,et al.  An Ecosystem for Transparent Music Similarity in an Open World , 2009, ISMIR.

[12]  Egon Willighagen,et al.  Accessing biological data in R with semantic web technologies , 2014 .

[13]  Simon Dixon,et al.  10 th International Society for Music Information Retrieval Conference ( ISMIR 2009 ) USING MUSICAL STRUCTURE TO ENHANCE AUTOMATIC CHORD TRANSCRIPTION , 2009 .

[14]  György Fazekas,et al.  The Effects of Reverberation on Onset Detection Tasks , 2010 .

[15]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[16]  Juan Trujillo,et al.  Current state of Linked Data in digital libraries , 2016, J. Inf. Sci..

[17]  David De Roure,et al.  Semantics for Music Analysis through Linked Data: How Country is My Country? , 2010, 2010 IEEE Sixth International Conference on e-Science.

[18]  Mark P. J. van der Loo,et al.  The stringdist Package for Approximate String Matching , 2014, R J..

[19]  Deborah L. McGuinness,et al.  PROV-O: The PROV Ontology , 2013 .

[20]  Ian H. Witten,et al.  Towards the digital music library: tune retrieval from acoustic input , 1996, DL '96.

[21]  Simon Dixon,et al.  Approximate Note Transcription for the Improved Identification of Difficult Chords , 2010, ISMIR.

[22]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[23]  Andreas Rauber,et al.  Facilitating Comprehensive Benchmarking Experiments on the Million Song Dataset , 2012, ISMIR.

[24]  Xavier Serra,et al.  AcousticBrainz: A Community Platform for Gathering Music Information Obtained from Audio , 2015, ISMIR.

[25]  J. Stephen Downie,et al.  A Musical Progression with Greenstone: How Music Content Analysis and Linked Data is Helping Redefine the Boundaries to a Music Digital Library , 2014, DLfM '14.

[26]  Clifford A. Lynch Digital Collections, Digital Libraries and the Digitization of Cultural Heritage Information , 2002, First Monday.

[27]  Mark d'Inverno,et al.  Linked Data and You: Bringing Music Research Software into the Semantic Web , 2010 .

[28]  Jordan B. L. Smith,et al.  Design and creation of a large-scale database of structural annotations , 2011, ISMIR.

[29]  Mark B. Sandler,et al.  The Music Ontology , 2007, ISMIR.

[30]  David De Roure,et al.  Music and Science: Parallels in Production , 2015, DLfM@JCDL.

[31]  Avery Wang,et al.  An Industrial Strength Audio Search Algorithm , 2003, ISMIR.

[32]  Daniel Wolff,et al.  Big Chord Data Extraction and Mining , 2014 .

[33]  Ichiro Fujinaga,et al.  SALAMI: Structural Analysis of Large Amounts of Music Information , 2010 .

[34]  Ichiro Fujinaga,et al.  Discovering Metadata Inconsistencies , 2010, ISMIR.

[35]  Victor Henning,et al.  Mendeley - A Last.fm For Research? , 2008, 2008 IEEE Fourth International Conference on eScience.

[36]  J. Stephen Downie,et al.  A Comparative Analysis of Bibliographic Ontologies: Implications for Digital Humanities , 2016, DH.

[37]  J. Stephen Downie,et al.  Capturing the workflows of music information retrieval for repeatability and reuse , 2013, Journal of Intelligent Information Systems.

[38]  Angela Kroeger The Road to BIBFRAME: The Evolution of the Idea of Bibliographic Transition into a Post-MARC Future , 2013 .

[39]  Sean Bechhofer,et al.  Hello cleveland! Linked data publication of live music archives , 2013, 2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS).