A Framework for Managing Multimodal Digitized Music Collections

In this paper, we present a framework for managing heterogeneous, multimodal digitized music collections containing visual music representations (scanned sheet music) as well as acoustic music material (audio recordings). As a first contribution, we propose a preprocessing workflow comprising feature extraction, audio indexing, and music synchronization (linking the visual with the acoustic data). Then, as a second contribution, we introduce novel user interfaces for multimodal music presentation, navigation, and content-based retrieval. In particular, our system offers high quality audio playback with time-synchronous display of the digitized sheet music. Furthermore, our system allows a user to select regions within the scanned pages of a musical score in order to search for musically similar sections within the audio documents. Our novel user interfaces and search functionalities will be integrated into the library service system of the Bavarian State Library as part of the Probado project.

[1]  J. Adachi,et al.  Retrieval methods for English-text with missrecognized OCR characters , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[2]  Hans-Jürgen Appelrath,et al.  PROBADO - A Generic Repository Integration Framework , 2007, ECDL.

[3]  Stuart Macdonald,et al.  User Engagement in Research Data Curation , 2009, ECDL.

[4]  Meinard Müller,et al.  Automatic synchronization of music data in score-, MIDI- and PCM-format , 2003, ISMIR.

[5]  Donald Byrd,et al.  Prospects for Improving OMR with Multiple Recognizers , 2006, ISMIR.

[6]  Ian H. Witten,et al.  Managing gigabytes 2nd edition , 1999 .

[7]  D. Blostein,et al.  Handbook on Optical Character Recognition and Document Image Analysis, Pp. 000-000 Recognition of Mathematical Notation * , 1996 .

[8]  Jenn Riley,et al.  Variations2: retrieving and using music in an academic setting , 2006, CACM.

[9]  Horst Bunke,et al.  Handbook of Character Recognition and Document Image Analysis , 1997 .

[10]  Gregory H. Wakefield,et al.  Audio thumbnailing of popular music using chroma-based representations , 2005, IEEE Transactions on Multimedia.

[11]  Meinard Müller,et al.  Automated Synchronization of Scanned Sheet Music with Audio Recordings , 2007, ISMIR.

[12]  Meinard Müller,et al.  Efficient Index-Based Audio Matching , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Ichiro Fujinaga,et al.  Optical Music Recognition System within a Large-Scale Digitization Project , 2000, ISMIR.

[14]  Frank Kurth,et al.  The Probado Music Repository at the Bavarian State Library , 2007, ISMIR.

[15]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[16]  W. Bruce Croft,et al.  Probabilistic Retrieval of OCR Degraded Text Using N-Grams , 1997, ECDL.

[17]  Patrick Le Boeuf,et al.  Functional Requirements for Bibliographic Records , 2005 .

[18]  George Tzanetakis,et al.  Polyphonic audio matching and alignment for music retrieval , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).