Human-Interactive Optical Music Recognition

We propose a human-driven Optical Music Recognition (OMR) system that creates symbolic music data from common Western notation scores. Despite decades of development, OMR still remains largely unsolved as state-ofthe-art automatic systems are unable to give reliable and useful results on a wide range of documents. For this reason our system, Ceres, combines human input and machine recognition to efficiently generate high-quality symbolic data. We propose a scheme for human-in-the-loop recognition allowing the user to constrain the recognition in two ways. The human actions allow the user to impose either a pixel labeling or model constraint, while the system rerecognizes subject to these constraints. We present evaluation based on different users’ log data using both Ceres and Sibelius software to produce the same music documents. We conclude that our system shows promise for transcribing complicated music scores with high accuracy.

[1]  Ichiro Fujinaga Exemplar-based Learning in Adaptive Optical Music Recognition System , 1996, ICMC.

[2]  Kim C. Ng,et al.  Recognition and reconstruction of primitives in music scores , 1996, Image Vis. Comput..

[3]  Rainer Stiefelhagen,et al.  Computers in the Human Interaction Loop , 2009, Human-Computer Interaction Series.

[4]  Ichiro Fujinaga,et al.  Optical Music Recognition System within a Large-Scale Digitization Project , 2000, ISMIR.

[5]  Gautham J. Mysore,et al.  Source Separation of Polyphonic Music with Interactive User-Feedback on a Piano Roll Display , 2013, ISMIR.

[6]  Henry S. Baird,et al.  A Critical Survey of Music Image Analysis , 1992 .

[7]  Christopher Raphael,et al.  Interpreting Rhythm in Optical Music Recognition , 2012, ISMIR.

[8]  David A. Maltz,et al.  Markov source model for printed music decoding , 1995, Electronic Imaging.

[9]  Donald Byrd,et al.  Prospects for Improving OMR with Multiple Recognizers , 2006, ISMIR.

[10]  Mabry Tyson,et al.  A Mixed-initiative Framework for Robust Plan Sketching , 2003, ICAPS.

[11]  Isabelle Bloch,et al.  Robust and Adaptive OMR System Including Fuzzy Modeling, Fusion of Musical Rules, and Possible Error Detection , 2007, EURASIP J. Adv. Signal Process..

[12]  Pietro Perona,et al.  Visual Recognition with Humans in the Loop , 2010, ECCV.

[13]  Vladimir Viro,et al.  Peachnote: Music Score Search and Analysis Platform , 2011, ISMIR.

[14]  Bertrand Coüasnon,et al.  Using a Grammar for a Reliable Full Score Recognition System , 1995, ICMC.

[15]  Ivan Bruno,et al.  Optical Music Imaging: Music Document Digitisation, Recognition, Evaluation, and Restoration , 2008 .

[16]  Pierfrancesco Bellini,et al.  Assessing Optical Music Recognition Tools , 2007, Computer Music Journal.

[17]  Alejandro Héctor Toselli,et al.  Computer Assisted Transcription for Ancient Text Images , 2007, ICIAR.

[18]  Jaime S. Cardoso,et al.  Optical recognition of music symbols , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[19]  Dorothea Blostein,et al.  A Graph-Rewriting Paradigm for Discrete Relaxation: Application to Sheet-Music Recognition , 1998, Int. J. Pattern Recognit. Artif. Intell..

[20]  Christopher Raphael,et al.  New Approaches to Optical Music Recognition , 2011, ISMIR.

[21]  Ichiro Fujinaga,et al.  USING HIDDEN MARKOV MODELS , 2007 .

[22]  Manuela M. Veloso,et al.  Supporting Combined Human and Machine Planning: An Interface for Planning by Analogical Reasoning , 1997, ICCBR.

[23]  Ichiro Fujinaga Optical music recognition system which learns , 1993, Other Conferences.

[24]  Dan Klein,et al.  Improved Typesetting Models for Historical OCR , 2014, ACL.

[25]  Liang Chen,et al.  Human-Directed Optical Music Recognition , 2016, Document Recognition and Retrieval.