Handwritten Musical Document Retrieval Using Music-Score Spotting

In this paper, we present a novel approach for retrieval of handwritten musical documents using a query sequence/word of musical scores. In our algorithm, the musical score-words are described as sequences of symbols generated from a universal codebook vocabulary of musical scores. Staff lines are removed first from musical documents using structural analysis of staff lines and symbol codebook vocabulary is created in offline. Next, using this symbol codebook the music symbol information in each document image is encoded. Given a query sequence of musical symbols in a musical score-line, the symbols in the query are searched in each of these encoded documents. Finally, a sub-string matching algorithm is applied to find query words. For codebook, two different feature extraction methods namely: Zernike Moments and 400 dimensional gradient features are tested and two unsupervised classifiers using SOM and K-Mean are evaluated. The results are compared with a baseline approach of DTW. The performance is measured on a collection of handwritten musical documents and results are promising.

[1]  Ichiro Fujinaga,et al.  A Comparative Study of Staff Removal Algorithms , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Henry S. Baird,et al.  A Critical Survey of Music Image Analysis , 1992 .

[3]  Shan Li,et al.  Complex Zernike Moments Features for Shape-Based Image Retrieval , 2009, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[4]  Tetsushi Wakabayashi,et al.  Handwritten Numeral Recognition of Six Popular Indian Scripts , 2007 .

[5]  Kazuhiko Yamamoto,et al.  Structured Document Image Analysis , 1992, Springer Berlin Heidelberg.

[6]  J. W. Roach,et al.  Using domain knowledge in low-level visual processing to interpret handwritten music: An experiment , 1988, Pattern Recognit..

[7]  David S. Doermann,et al.  On musical score recognition using probabilistic reasoning , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[8]  Alicia Fornés,et al.  CVC-MUSCIMA: a ground truth of handwritten music score images for writer identification and staff removal , 2012, International Journal on Document Analysis and Recognition (IJDAR).

[9]  Alicia Fornés,et al.  A combination of features for symbol-independent writer identification in old music scores , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[10]  R. Manmatha,et al.  Features for word spotting in historical manuscripts , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[11]  Alicia Fornés,et al.  The ICDAR 2011 Music Scores Competition: Staff Removal and Writer Identification , 2011, 2011 International Conference on Document Analysis and Recognition.

[12]  Jean-Yves Ramel,et al.  Word Retrieval in Historical Document Using Character-Primitives , 2011, 2011 International Conference on Document Analysis and Recognition.