Writer identification in handwritten musical scores with bags of notes

Writer Identification is an important task for the automatic processing of documents. However, the identification of the writer in graphical documents is still challenging. In this work, we adapt the Bag of Visual Words framework to the task of writer identification in handwritten musical scores. A vanilla implementation of this method already performs comparably to the state-of-the-art. Furthermore, we analyze the effect of two improvements of the representation: a Bhattacharyya embedding, which improves the results at virtually no extra cost, and a Fisher Vector representation that very significantly improves the results at the cost of a more complex and costly representation. Experimental evaluation shows results more than 20 points above the state-of-the-art in a new, challenging dataset.

[1]  Djeddi Chawki,et al.  A texture based approach for Arabic writer identification and verification , 2010, 2010 International Conference on Machine and Web Intelligence.

[2]  Florent Perronnin,et al.  Fisher Kernels on Visual Vocabularies for Image Categorization , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Ichiro Fujinaga,et al.  Gamera Versus Aruspix: Two Optical Music Recognition Approaches , 2008, ISMIR.

[4]  Giovanni Soda,et al.  Bag of Characters and SOM Clustering for Script Recognition and Writer Identification , 2010, 2010 20th International Conference on Pattern Recognition.

[5]  Alicia Fornés,et al.  On the Use of Textural Features for Writer Identification in Old Handwritten Music Scores , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[6]  Alicia Fornés,et al.  A combination of features for symbol-independent writer identification in old music scores , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[7]  Cordelia Schmid,et al.  Image categorization using Fisher kernels of non-iid image models , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  José A. Rodríguez-Serrano,et al.  Fisher Kernels for Handwritten Word-spotting , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[9]  Gabriela Csurka,et al.  Assessing the aesthetic quality of photographs using generic image descriptors , 2011, 2011 International Conference on Computer Vision.

[10]  Florent Perronnin,et al.  Large-scale image retrieval with compressed Fisher vectors , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Nailja Luth,et al.  Automatic identification of music notations , 2002, Second International Conference on Web Delivering of Music, 2002. WEDELMUSIC 2002. Proceedings..

[12]  David Haussler,et al.  Exploiting Generative Models in Discriminative Classifiers , 1998, NIPS.

[13]  Sergio Escalera,et al.  Blurred Shape Model for binary and grey-level symbol recognition , 2009, Pattern Recognit. Lett..

[14]  Somaya Al-Máadeed,et al.  Writer identification using edge-based directional probability distribution features for arabic words , 2008, 2008 IEEE/ACS International Conference on Computer Systems and Applications.

[15]  Somaya Al-Máadeed,et al.  Writer identification of Arabic handwriting documents using grapheme features , 2008, 2008 IEEE/ACS International Conference on Computer Systems and Applications.

[16]  Alicia Fornés,et al.  The ICDAR 2011 Music Scores Competition: Staff Removal and Writer Identification , 2011, 2011 International Conference on Document Analysis and Recognition.

[17]  Alicia Fornés,et al.  A Symbol-Dependent Writer Identification Approach in Old Handwritten Music Scores , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[18]  Albert Gordo,et al.  A Bag-of-Pages Approach to Unordered Multi-page Document Classification , 2010, 2010 20th International Conference on Pattern Recognition.

[19]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[20]  Horst Bunke,et al.  A writer identification and verification system using HMM based recognizers , 2006, Pattern Analysis and Applications.

[21]  Florent Perronnin,et al.  Large-scale image categorization with explicit data embedding , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Thierry Paquet,et al.  A writer identification and verification system , 2005, Pattern Recognit. Lett..

[23]  Ilvio Bruder,et al.  Integrating knowledge components for writer identification in a digital archive of historical music scores , 2004, JCDL.

[24]  Andrew Zisserman,et al.  Efficient additive kernels via explicit feature maps , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Marcus Liwicki,et al.  A writer identification system for on-line whiteboard data , 2008, Pattern Recognit..

[26]  Lambert Schomaker,et al.  Automatic writer identification using connected-component contours and edge-based features of uppercase Western script , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Alicia Fornés,et al.  Writer Identification in Old Handwritten Music Scores , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[28]  W. Homenda Optical Music Recognition: the Case Study of Pattern Recognition , 2005, CORES.

[29]  Ernest Valveny,et al.  A bag of notes approach to writer identification in old handwritten musical scores , 2010, DAS '10.

[30]  Roland Göcke Building a system for writer identification on handwritten music scores , 2003 .

[31]  Tieniu Tan,et al.  Personal identification based on handwriting , 2000, Pattern Recognit..

[32]  Andrew Zisserman,et al.  The devil is in the details: an evaluation of recent feature encoding methods , 2011, BMVC.

[33]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[34]  Alicia Fornés,et al.  CVC-MUSCIMA: a ground truth of handwritten music score images for writer identification and staff removal , 2012, International Journal on Document Analysis and Recognition (IJDAR).

[35]  Thomas Mensink,et al.  Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.

[36]  Ichiro Fujinaga,et al.  A Comparative Study of Staff Removal Algorithms , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Nicole Vincent,et al.  Text independent writer recognition using redundant writing patterns with contour-based orientation and curvature features , 2010, Pattern Recognit..