A Musical Progression with Greenstone: How Music Content Analysis and Linked Data is Helping Redefine the Boundaries to a Music Digital Library

Despite the recasting of the web's technical capabilities through Web 2.0, conventional digital library software architectures---from which many of our leading Music Digital Libraries (MDLs) are formed---result in digital resources that are, surprisingly, disconnected from other online sources of information, and embody a "read-only" mindset. Leveraging from Music Information Retrieval (MIR) techniques and Linked Open Data (LOD), in this paper we demonstrate a new form of music digital library that encompasses management, discovery, delivery, and analysis of the musical content it contains. Utilizing open source tools such as Greenstone, audioDB, Meandre, and Apache Jena we present a series of transformations to a musical digital library sourced from audio files that steadily increases the level of support provided to the user for musicological study. While the seed for this work was motivated by better supporting musicologists in a digital library, the developed software architecture alters the boundaries to what is conventionally thought of as a digital library---and in doing so challenges core assumptions made in mainstream digital library software design.

[1]  Emilia Gómez,et al.  Tonality Visualization of Polyphonic audio , 2005, ICMC.

[2]  Ian H. Witten,et al.  Stress-Testing General Purpose Digital Library Software , 2009, ECDL.

[3]  J. Stephen Downie,et al.  How People Describe Their Music Information Needs: A Grounded Theory Analysis Of Music Queries , 2003 .

[4]  Marcel Worring,et al.  Where Is the User in Multimedia Retrieval? , 2012, IEEE Multim..

[5]  Xiao Hu,et al.  Evaluation of Music Information Retrieval: Towards a User-Centered Approach , 2010 .

[6]  David Tcheng,et al.  A general approach to data-intensive computing using the Meandre component-based framework , 2010, Wands '10.

[7]  Masataka Goto,et al.  Multimodal Music Processing (Dagstuhl Seminar 11041) , 2011, Dagstuhl Reports.

[8]  Panos Constantopoulos,et al.  Research and Advanced Technology for Digital Libraries , 2001, Lecture Notes in Computer Science.

[9]  Brian Christopher Smith,et al.  Query by humming: musical information retrieval in an audio database , 1995, MULTIMEDIA '95.

[10]  George Buchanan,et al.  A new framework for building digital library collections , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[11]  Ian H. Witten,et al.  How to Build a Digital Library, Second Edition , 2009 .

[12]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[13]  Katherine M. Wisser,et al.  Library Resources & Technical Services , 2003 .

[14]  Christopher Strachey,et al.  Fundamental Concepts in Programming Languages , 2000, High. Order Symb. Comput..

[15]  George Tzanetakis,et al.  A comparative evaluation of search techniques for query-by-humming using the MUSART testbed , 2007 .

[16]  Markus Schedl,et al.  The neglected user in music information retrieval research , 2013, Journal of Intelligent Information Systems.

[17]  Michael A. Casey,et al.  Investigating Music Collections at Different Scales with AudioDB , 2010 .

[18]  C. Lagoze,et al.  The making of the Open Archives Initiative Protocol for Metadata Harvesting , 2003 .

[19]  Ian H. Witten,et al.  The Greenstone Digital Library Software , 2009, Handbook of Research on Digital Libraries.

[20]  George Buchanan,et al.  Dynamic Digital Library Construction and Configuration , 2004, ECDL.

[21]  Casey A. Mullin International Music Score Library Project/Petrucci Music Library (review) , 2010 .

[22]  Sandra Payette,et al.  The Fedora Project: An Open-source Digital Object Repository Management System , 2003, D Lib Mag..

[23]  David Bainbridge,et al.  AUTOMATIC READING OF MUSIC NOTATION , 1997 .

[24]  Steffen Pauws,et al.  CubyHum: a fully operational "query by humming" system , 2002, ISMIR.

[25]  David Bainbridge MELDEX: A web-based melodic index service , 1998 .

[26]  Brewster Kahle,et al.  Preserving the Internet , 1997 .

[27]  MacKenzie Smith,et al.  DSpace: An Open Source Dynamic Digital Repository , 2003, D Lib Mag..

[28]  Ian H. Witten,et al.  Managing Gigabytes: Compressing and Indexing Documents and Images , 1999 .

[29]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[30]  Jordan B. L. Smith,et al.  Design and creation of a large-scale database of structural annotations , 2011, ISMIR.

[31]  Ian H. Witten,et al.  Managing gigabytes (2nd ed.): compressing and indexing documents and images , 1999 .

[32]  David Bainbridge,et al.  A Musical Web Mining and Audio Feature Extraction Extension to The Greenstone Digital Library Software , 2011, ISMIR.

[33]  Carlos Guedes,et al.  Optical music recognition: state-of-the-art and open issues , 2012, International Journal of Multimedia Information Retrieval.

[34]  Annika Hinze,et al.  Tipple: location-triggered mobile access to a digital library for audio books , 2013, JCDL '13.

[35]  Avery Wang,et al.  An Industrial Strength Audio Search Algorithm , 2003, ISMIR.

[36]  Ian H. Witten,et al.  How to Build a Digital Library , 2002 .

[37]  Horst Bunke,et al.  Handbook of Character Recognition and Document Image Analysis , 1997 .

[38]  Jonathan Foote,et al.  Audio Retrieval by Rhythmic Similarity , 2002, ISMIR.