Lyric Extraction and Recognition on Digital Images of Early Music Sources

Optical music recognition (OMR) is one of the most promising tools for generating large-scale, distributable libraries of musical data. Much OMR work has focussed on instrumental music, avoiding a special challenge vocal music poses for OMR: lyric recognition. Lyrics complicate the page layout, making it more difficult to identify the regions of the page that carry musical notation. Furthermore, users expect a complete OMR process for vocal music to include recognition of the lyrics, reunification of syllables when they have been separated, and alignment of these lyrics with the recognised music. Unusual layouts and inconsistent practises for syllabification, however, make lyric recognition more challenging than traditional optical character recognition (OCR). This paper surveys historical approaches to lyric recognition, outlines open challenges, and presents a new approach to extracting text lines in medieval manuscripts, one of the frontiers of OMR research today.

[1]  Bernadette Dorizzi,et al.  A Hidden Markov Models combination framework for handwriting recognition , 2003, Document Analysis and Recognition.

[2]  Tim DiLauro,et al.  Enhancing access to the levy sheet music collection: reconstructing full-text lyrics from syllables , 2002, JCDL '02.

[3]  Michael Droettboom,et al.  Beyond transcription : Case studies in special document analysis requirements , 2003 .

[4]  Laurent Pugin,et al.  Optical Music Recognition of Early Typographic Prints using Hidden Markov Models , 2006 .

[5]  Klaus D. Tönnies,et al.  Line detection and segmentation in historical church registers , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[6]  Ichiro Fujinaga,et al.  A Robust Border Detection Algorithm with Application to Medieval Music Manuscripts , 2009, ICMC.

[7]  Samy Bengio,et al.  Offline recognition of unconstrained handwritten texts using HMMs and statistical language models , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Ichiro Fujinaga,et al.  Strike Up the Score: Deriving Searchable and Playable Digital Formats from Sheet Music , 2001, D Lib Mag..

[9]  João Rogério Caldas Pinto,et al.  Ancient Music Recovery for Digital Libraries , 2000, ECDL.

[10]  Frank Kurth,et al.  The Probado Music Repository at the Bavarian State Library , 2007, ISMIR.

[11]  Julia Craig-McFeely Digital Image Archive of Medieval Music: The evolution of a digital resource , 2008 .

[12]  Laurent Pugin,et al.  Optical Music Recognitoin of Early Typographic Prints using Hidden Markov Models , 2006, ISMIR.

[13]  Zhixin Shi,et al.  A natural learning algorithm based on Hough transform for text lines extraction in handwritten documents , 1999 .

[14]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[15]  Ehud Rivlin,et al.  Offline cursive script word recognition – a survey , 1999, International Journal on Document Analysis and Recognition.

[16]  Laurence Likforman-Sulem,et al.  Text line segmentation of historical documents: a survey , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[17]  Norbert Fuhr,et al.  Retrieval in text collections with historic spelling using linguistic and spelling variants , 2007, JCDL '07.