Automatic Stave Discovery for Musical Facsimiles

Lately, there is an increased interest in the analysis of music score facsimiles, aiming at automatic digitization and recognition. Noise, corruption, variations in handwriting, non-standard page layouts and notations are common problems affecting especially the centuries-old manuscripts. Starting from a facsimile, the current state-of-the-art methods binarize the image, detect and group the staff lines, then remove the staff lines and classify the remaining symbols imposing rules and prior knowledge to obtain the final digital representation. The first steps are critical for the performance of the overall system. Here we propose to handle binarization, staff detection and noise removal by means of dynamic programming (DP) formulations. Our main insights are: a) the staves (the 5-groups of staff lines) are represented by repetitive line patterns, are more constrained and informative, and thus we propose direct optimization over such patterns instead of first spotting single staff lines, b) the optimal binarization threshold also is the one giving the maximum evidence for the presence of staves, c) the noise, or background, is given by the regions where there is insufficient stave pattern evidence. We validate our techniques on the CVC-MUSCIMA(2011) staff removal benchmark, achieving the best error rates (1.7%), as well as on various, other handwritten score facsimiles from the Renaissance.

[1]  Henry S. Baird,et al.  A Critical Survey of Music Image Analysis , 1992 .

[2]  Ichiro Fujinaga,et al.  A Comparative Study of Staff Removal Algorithms , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Alicia Fornés,et al.  The ICDAR 2011 Music Scores Competition: Staff Removal and Writer Identification , 2011, 2011 International Conference on Document Analysis and Recognition.

[4]  Hidetoshi Miyao,et al.  Stave Extraction for Printed Music Scores , 2002, IDEAL.

[5]  Gilson A. Giraldi,et al.  Music Score Binarization Based on Domain Knowledge , 2011, IbPRIA.

[6]  Ichiro Fujinaga,et al.  Staff Detection and Removal , 2004 .

[7]  Sylvie Philipp-Foliguet,et al.  Printed music recognition , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[8]  M. Kasiran,et al.  An information framework for a merchant trust agent in electronic commerce , 2002 .

[9]  Jaime S. Cardoso,et al.  Robust Staffline Thickness and Distance Estimation in Binary and Gray-Level Music Scores , 2010, 2010 20th International Conference on Pattern Recognition.

[10]  Alicia Fornés,et al.  CVC-MUSCIMA: a ground truth of handwritten music score images for writer identification and staff removal , 2012, International Journal on Document Analysis and Recognition (IJDAR).

[11]  Carlos Guedes,et al.  Staff Detection with Stable Paths , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Susan Ella George,et al.  Visual Perception of Music Notation: On-Line and Off-Line Recognition , 2004 .

[13]  Umapada Pal,et al.  An Efficient Staff Removal Approach from Printed Musical Documents , 2010, 2010 20th International Conference on Pattern Recognition.

[14]  David Bainbridge,et al.  Dealing with superimposed objects in optical music recognition , 1997 .

[15]  Shijian Lu,et al.  An Effective Staff Detection and Removal Technique for Musical Documents , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.