Understanding Optical Music Recognition

For over 50 years, researchers have been trying to teach computers to read music notation, referred to as Optical Music Recognition (OMR). However, this field is still difficult to access for new researchers, especially those without a significant musical background: Few introductory materials are available, and, furthermore, the field has struggled with defining itself and building a shared terminology. In this work, we address these shortcomings by (1) providing a robust definition of OMR and its relationship to related fields, (2) analyzing how OMR inverts the music encoding process to recover the musical notation and the musical semantics from documents, and (3) proposing a taxonomy of OMR, with most notably a novel taxonomy of applications. Additionally, we discuss how deep learning affects modern OMR research, as opposed to the traditional pipeline. Based on this work, the reader should be able to attain a basic understanding of OMR: its objectives, its inherent structure, its relationship to other fields, the state of the art, and the research opportunities it affords.

[1]  Rahul Roy,et al.  Graphics Recognition. Current Trends and Evolutions , 2017, Lecture Notes in Computer Science.

[2]  Jakob Grue Simonsen,et al.  Towards a Standard Testbed for Optical Music Recognition: Definitions, Metrics, and Page Images , 2015 .

[3]  Gerhard Widmer,et al.  Towards Full-Pipeline Handwritten OMR with Musical Symbol Detection by U-Nets , 2018, ISMIR.

[4]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Ichiro Fujinaga,et al.  Developing an environment for teaching computers to read music , 2018 .

[6]  Ichiro Fujinaga,et al.  Optical music recognition and manuscript chant sources , 2014 .

[7]  Alicia Fornés,et al.  Primitive Segmentation in Old Handwritten Music Scores , 2005, GREC.

[8]  Carlos Guedes,et al.  Optical music recognition: state-of-the-art and open issues , 2012, International Journal of Multimedia Information Retrieval.

[9]  D. Thesis,et al.  A System for Optical Music Recognition and Audio Synthesis , 2014 .

[10]  Alicia Fornés,et al.  On the Use of Textural Features for Writer Identification in Old Handwritten Music Scores , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[11]  S. Schreibman,et al.  A new companion to digital humanities , 2016 .

[12]  Christoph Dalitz,et al.  Optical recognition of psaltic Byzantine chant notation , 2008, International Journal of Document Analysis and Recognition (IJDAR).

[13]  Kimmo Kettunen,et al.  Digitisation and Digital Library Presentation System – Sheet Music to the Mix , 2018 .

[14]  Louis W. G. Barton The NEUMES Project: digital transcription of medieval chant manuscripts , 2002, Second International Conference on Web Delivering of Music, 2002. WEDELMUSIC 2002. Proceedings..

[15]  Kumiko Kushiyama,et al.  Gocen: a handwritten notational interface for musical performance and learning music , 2012, SIGGRAPH '12.

[16]  Alicia Fornés,et al.  Towards the Recognition of Compound Music Notes in Handwritten Music Scores , 2016, 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[17]  Muriel Visani,et al.  DocCreator: A New Software for Creating Synthetic Ground-Truthed Document Images , 2017, J. Imaging.

[18]  Kevin R. Page,et al.  Proceedings of the 4th International Workshop on Digital Libraries for Musicology , 2015 .

[19]  Atul K. Chhabra,et al.  Symbol Recognition : An Overview , 2005 .

[20]  Liang Chen,et al.  Renotation from Optical Music Recognition , 2015, MCM.

[21]  Jorge Calvo-Zaragoza,et al.  End-to-End Neural Optical Music Recognition of Monophonic Scores , 2018 .

[22]  Lippold Haken,et al.  Justification of printed music , 1991, CACM.

[23]  Donald Byrd,et al.  A Music Representation Requirement Specification for Academia , 2003, Computer Music Journal.

[24]  Julián Urbano,et al.  MIREX 2010 SYMBOLIC MELODIC SIMILARITY: LOCAL ALIGNMENT WITH GEOMETRIC REPRESENTATIONS , 2010 .

[25]  Alicia Fornés,et al.  Writer Identification in Old Handwritten Music Scores , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[26]  Luc Van Gool,et al.  Automatic Handwritten Mensural Notation Interpreter: From Manuscript to MIDI Performance , 2015, ISMIR.

[27]  Alicia Fornés,et al.  CVC-MUSCIMA: a ground truth of handwritten music score images for writer identification and staff removal , 2012, International Journal on Document Analysis and Recognition (IJDAR).

[28]  Susan E. George,et al.  Online Pen-Based Recognition of Music Notation with Artificial Neural Networks , 2003, Computer Music Journal.

[29]  Jan Hajič A Case for Intrinsic Evaluation of Optical Music Recognition , 2018 .

[30]  Stephan Baumann A simplified attributed graph grammar for high-level music recognition , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[31]  George Carayannis,et al.  An Optical Notation Recognition System for Printed Music based on template matching and high level reasoning , 2000, RIAO.

[32]  Gerhard Widmer,et al.  Computational Models of Expressive Music Performance: A Comprehensive and Critical Review , 2018, Front. Digit. Humanit..

[33]  Pierfrancesco Bellini,et al.  Optical music sheet segmentation , 2001, Proceedings First International Conference on WEB Delivering of Music. WEDELMUSIC 2001.

[35]  Ichiro Fujinaga,et al.  Enhanced Bleedthrough Correction for Early Music Documents with Recto-Verso Registration , 2008, ISMIR.

[36]  OMRJX: a framework for piano scores optical music recognition , 2011 .

[37]  Alejandro Héctor Toselli,et al.  Probabilistic Music-Symbol Spotting in Handwritten Scores , 2018, 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[38]  ÁlvaroFrancisco,et al.  An integrated grammar-based approach for mathematical expression recognition , 2016 .

[39]  Martin Blenkle,et al.  Staats- und Universitätsbibliothek Bremen , 2002 .

[40]  Heinz Roggenkemper,et al.  How can Machine Learning make Optical Music Recognition more relevant for practicing musicians , 2018 .

[41]  Thomas Kieninger,et al.  An open approach towards the benchmarking of table structure recognition systems , 2010, DAS '10.

[42]  Roger B. Dannenberg,et al.  Improvised duet interaction: learning improvisation techniques for automatic accompaniment , 2017, NIME.

[43]  Bertrand Coüasnon,et al.  Bootstrapping Samples of Accidentals in Dense Piano Scores for CNN-Based Detection , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[44]  Kia Ng,et al.  Big Data Optical Music Recognition with Multi Images and Multi Recognisers , 2014, EVA.

[45]  Pavel Pecina,et al.  Groundtruthing (Not Only) Music Notation with MUSICMarker: A Practical Overview , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[46]  Chih-Ta Yen,et al.  Optical music recognition of the singer using formant frequency estimation of vocal fold vibration and lip motion with interpolated GMM classifiers , 2014 .

[47]  Alicja Wieczorkowska,et al.  Music Information Retrieval , 2009, Encyclopedia of Data Warehousing and Mining.

[48]  Soo-Hyung Kim,et al.  Recognition of Music Scores with Non-Linear Distortions in Mobile Devices , 2017, Multimedia Tools and Applications.

[49]  Chakarida Nukoolkit,et al.  Optical Music Recognition on Android Platform , 2012, IAIT 2012.

[50]  Jaroslav Pokorný,et al.  Further Steps Towards a Standard Testbed for Optical Music Recognition , 2016, ISMIR.

[51]  Andrew Hankinson,et al.  Keynote Speech SIMSSA: SINGLE INTERFACE FOR MUSIC SCORE SEARCHING AND ANALYSIS , 2014 .

[52]  Umapada Pal,et al.  HMM-based writer identification in music score documents without staff-line removal , 2017, Expert Syst. Appl..

[53]  Meinard Müller,et al.  Automatic Mapping of Scanned Sheet Music to Audio Recordings , 2008, ISMIR.

[54]  Robert M. Haralick,et al.  Format of Ground Truth Data Used in the Evaluation of the Results of an Optical Music Recognition System , 2019 .

[55]  Jan Hajic,et al.  How current optical music recognition systems are becoming useful for digital libraries , 2018, DLfm.

[56]  Farshad Tajeripour,et al.  A music symbols recognition method using pattern matching along with integrated projection and morphological operation techniques , 2017, Multimedia Tools and Applications.

[57]  Isabelle Bloch,et al.  A fuzzy model for optical recognition of musical scores , 2004, Fuzzy Sets Syst..

[58]  George Heussenstamm The Norton manual of music notation , 1987 .

[59]  Ivan Bruno,et al.  Optical Music Imaging: Music Document Digitisation, Recognition, Evaluation, and Restoration , 2008 .

[60]  Timothy C. Bell,et al.  A music notation construction engine for optical music recognition , 2003, Softw. Pract. Exp..

[61]  Jürgen Schmidhuber,et al.  DeepScores-A Dataset for Segmentation, Detection and Classification of Tiny Objects , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[62]  Ewa Lukasik,et al.  Mobile System for Optical Music Recognition and Music Sound Generation , 2015, CISIM.

[63]  Liang Chen,et al.  Human-Guided Recognition of Music Score Images , 2017, DLfM.

[64]  Alejandro Héctor Toselli,et al.  Handwritten Music Recognition for Mensural Notation: Formulation, Data and Baseline Results , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[65]  David Bainbridge,et al.  Dealing with superimposed objects in optical music recognition , 1997 .

[66]  Wladyslaw Homenda Automatic recognition of printed music and its conversion into playable music data , 1996 .

[67]  Andreas Dengel,et al.  TRANSFORMING PRINTED PIANO MUSIC INTO MIDI , 1993 .

[68]  Pierfrancesco Bellini,et al.  Assessing Optical Music Recognition Tools , 2007, Computer Music Journal.

[69]  Apurva A. Mehta,et al.  Optical music notes recognition for printed piano music score sheet , 2015, 2015 International Conference on Computer Communication and Informatics (ICCCI).

[70]  Jorge Calvo-Zaragoza,et al.  Camera-PrIMuS: Neural End-to-End Optical Music Recognition on Realistic Monophonic Scores , 2018, ISMIR.

[71]  Ichiro Fujinaga,et al.  Optical Music Recognition System within a Large-Scale Digitization Project , 2000, ISMIR.

[72]  Mandi Goodsett Choral Public Domain Library , 2014 .

[73]  Horst M. Eidenberger,et al.  Handwritten Music Object Detection: Open Issues and Baseline Results , 2018, 2018 13th IAPR International Workshop on Document Analysis Systems (DAS).

[74]  Wazir Zada Khan,et al.  [COMSCAN]: an optical music recognition system , 2009, FIT.

[75]  Ichiro Fujinaga,et al.  Reducing Costs for Digitising Early Music with Dynamic Adaptation , 2007, ECDL.

[76]  Alicia Fornés,et al.  Old Handwritten Musical Symbol Classification by a Dynamic Time Warping Based Method , 2008, GREC.

[77]  Jan Hajic,et al.  A Baseline for General Music Object Detection with Deep Learning , 2018, Applied Sciences.

[78]  Shiliang Zhang,et al.  Watch, attend and parse: An end-to-end neural network based approach to handwritten mathematical expression recognition , 2017, Pattern Recognit..

[79]  Laurent Pugin,et al.  Optical Music Recognitoin of Early Typographic Prints using Hidden Markov Models , 2006, ISMIR.

[80]  Jorge Calvo-Zaragoza,et al.  Staff-line removal with selectional auto-encoders , 2017, Expert Syst. Appl..

[81]  Julia Craig-McFeely,et al.  The art of virtual restoration: creating the digital image archive of medieval music (DIAMM) , 2001 .

[82]  Tara N. Sainath,et al.  State-of-the-Art Speech Recognition with Sequence-to-Sequence Models , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[83]  Minh-Triet Tran,et al.  Virtual Music Teacher for New Music Learners with Optical Music Recognition , 2015, HCI.

[84]  Liang Chen,et al.  Optical Music Recognition and Human-in-the-loop Computation , 2018 .

[85]  Timothy C. Bell,et al.  The Challenge of Optical Music Recognition , 2001, Comput. Humanit..

[86]  Ichiro Fujinaga,et al.  Introduction to SIMSSA (Single Interface for Music Score Searching and Analysis) , 2014, DLfM '14.

[87]  O. Jenkins,et al.  MusicHand : A Handwritten Music Recognition System , 2005 .

[88]  Juan Ramón Rico-Juan,et al.  HISPAMUS: Handwritten Spanish Music Heritage Preservation by Automatic Transcription , 2018 .

[89]  Peter Jonas,et al.  Scores of scores: an openscore project to encode and share sheet music , 2018, DLfm.

[90]  Lovekesh Vig,et al.  An Efficient End-to-End Neural Model for Handwritten Text Recognition , 2018, BMVC.

[91]  Jan Hajic,et al.  Discussion Group Summary: Optical Music Recognition , 2017, GREC.

[92]  Kia Ng,et al.  Improving OMR for Digital Music Libraries with Multiple Recognisers and Multiple Sources , 2014, DLfM '14.

[93]  Adria Rico Blanes,et al.  Camera-Based Optical Music Recognition Using a Convolutional Neural Network , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[94]  Michael Kassler,et al.  Optical Character-Recognition of Printed Music: A Review of Two Dissertations@@@Automatic Recognition of Sheet Music@@@Computer Pattern Recognition of Standard Engraved Music Notation , 1972 .

[95]  Alicia Fornés,et al.  Towards the Alignment of Handwritten Music Scores , 2015, GREC.

[96]  Gueesang Lee,et al.  A Lightweight and Effective Music Score Recognition on Mobile Phones , 2015, J. Inf. Process. Syst..

[97]  Julia Craig-McFeely Digital Image Archive of Medieval Music: The evolution of a digital resource , 2008 .

[98]  David Lewis,et al.  Searching Page-Images of Early Music Scanned with OMR: A Scalable Solution Using Minimal Absent Words , 2018, ISMIR.

[99]  Ichiro Fujinaga,et al.  Automatic Pitch Detection in Printed Square Notation , 2011, ISMIR.

[100]  Ichiro Fujinaga,et al.  Correcting Large-Scale OMR Data with Crowdsourcing , 2014, DLfM '14.

[101]  Ernest Valveny,et al.  Writer identification in handwritten musical scores with bags of notes , 2013, Pattern Recognit..

[102]  José Oncina,et al.  Recognition of Pen-Based Music Notation: The HOMUS Dataset , 2014, 2014 22nd International Conference on Pattern Recognition.

[103]  Xiang Bai,et al.  An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[104]  Jorge Morato,et al.  MIREX 2011 Symbolic Melodic Similarity: Sequence Alignment with Geometric Representations , 2011 .

[105]  Fumitaka Kimura,et al.  Handwritten Musical Document Retrieval Using Music-Score Spotting , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[106]  Michael Good,et al.  Using MusicXML for file interchange , 2003, Proceedings Third International Conference on WEB Delivering of Music.

[107]  Andreas Dengel,et al.  Table Recognition in Heterogeneous Documents Using Machine Learning , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[108]  Liang Chen,et al.  MIDI-assisted egocentric optical music recognition , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[109]  Donna Harman,et al.  Information Retrieval Evaluation , 2011, Synthesis Lectures on Information Concepts, Retrieval, and Services.

[110]  Meinard Müller,et al.  Matching Musical Themes based on noisy OCR and OMR input , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[111]  Jia-Shing Sheu,et al.  An optical music recognition system for traditional Chinese Kunqu Opera scores written in Gong-Che Notation , 2014, EURASIP J. Audio Speech Music. Process..

[112]  Dorothea Blostein,et al.  Recognition of Music Notation: SSPR’90 Working Group Report , 1992 .

[113]  Robert Piéchaud,et al.  STANDARD MUSIC FONT LAYOUT (SMuFL) , 2015 .

[114]  Linn Saxrud Johansen Optical Music Recognition , 2009 .

[115]  Klaus Keil,et al.  Applications of RISM data in digital libraries and digital musicology , 2017, International Journal on Digital Libraries.

[116]  Julián Urbano,et al.  MIREX 2012 Symbolic Melodic Similarity : Hybrid Sequence Alignment with Geometric Representations , 2012 .

[117]  Julián Urbano,et al.  MIREX 2013 Symbolic Melodic Similarity: A Geometric Model supported with Hybrid Sequence Alignment , 2013 .

[118]  Gerhard Widmer,et al.  Learning Audio-Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification , 2018, Trans. Int. Soc. Music. Inf. Retr..

[119]  Mert Bay,et al.  Evaluation of Multiple-F0 Estimation and Tracking Systems , 2009, ISMIR.

[120]  Eleanor Selfridge-Field,et al.  Beyond MIDI: the handbook of musical codes , 1997 .

[121]  Timothy C. Bell,et al.  Identifying music documents in a collection of images , 2006, ISMIR.

[122]  Susan E. George Wavelets for Dealing with Super-Imposed Objects in Recognition of Music Notation , 2004 .

[123]  Horst M. Eidenberger,et al.  Towards a Universal Music Symbol Classifier , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[124]  Daniel P. Lopresti,et al.  Issues in Ground-Truthing Graphic Documents , 2001, GREC.

[125]  Ke Zhang,et al.  Pixel.js: Web-Based Pixel Classification Correction Platform for Ground Truth Creation , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[126]  Karen Ullrich,et al.  Optical Music Recognition with Convolutional Sequence-to-Sequence Models , 2017, ISMIR.

[127]  Joan-Andreu Sánchez,et al.  An integrated grammar-based approach for mathematical expression recognition , 2016, Pattern Recognit..

[128]  Nicholas P. Carter A NEW EDITION OF WALTON'S FAÇADE USING AUTOMATIC SCORE RECOGNITION , 1993 .

[129]  Jorge Calvo-Zaragoza,et al.  MuRET: a music recognition, encoding, and transcription tool , 2018, DLfm.

[130]  Alicia Fornés,et al.  A Starting Point for Handwritten Music Recognition , 2018 .

[131]  Donald Byrd,et al.  Prospects for Improving OMR with Multiple Recognizers , 2006, ISMIR.

[132]  Alicia Fornés,et al.  Optical Music Recognition by Recurrent Neural Networks , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[133]  Matthew J. Dovey Overview of the OMRAS project: Online music retrieval and searching , 2004, J. Assoc. Inf. Sci. Technol..

[134]  Alicia Fornés,et al.  The ICDAR 2011 Music Scores Competition: Staff Removal and Writer Identification , 2011, 2011 International Conference on Document Analysis and Recognition.

[135]  Dorothea Blostein,et al.  A graph grammar programming style for recognition of music notation , 1993, Machine Vision and Applications.

[136]  Ichlro FuJinaga,et al.  Optical Music Recognition Using Projections , 1988 .

[137]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[138]  Basilios Gatos,et al.  A survey of document image word spotting techniques , 2017, Pattern Recognit..

[139]  Jaroslav Pokorný,et al.  Introduction to Optical Music Recognition: Overview and Practical Challenges , 2015, DATESO.

[140]  Jorge Calvo-Zaragoza,et al.  Optical Music Recognition in Mensural Notation with Region-based Convolutional Neural Networks , 2018, ISMIR.

[141]  Perry Roland,et al.  The Music Encoding Initiative ( MEI ) , 2002 .

[142]  Tim Crawford,et al.  Evaluating OMR on the Early Music Online Collection , 2013, International Society for Music Information Retrieval Conference.

[143]  Michael Good MusicXML: An internet-friendly format for sheet music , 2001 .

[144]  Björn W. Schuller,et al.  The SEILS Dataset: Symbolically Encoded Scores in Modern-Early Notation for Computational Musicology , 2017, ISMIR.

[145]  Sanu Pulimootil Achankunju Music Search Engine from Noisy OMR Data , 2018 .

[146]  Alberto Ciampa,et al.  On Automatic Pattern Recognition and Acquisition of Printed Music , 1982, ICMC.

[147]  David S. Prerau,et al.  Computer pattern recognition of printed music , 1899, AFIPS '71 (Fall).

[148]  Pavel Pecina,et al.  The MUSCIMA++ Dataset for Handwritten Optical Music Recognition , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[149]  Henry S. Baird,et al.  A Critical Survey of Music Image Analysis , 1992 .

[150]  Brianne Ross,et al.  Behind bars. , 2011, Nursing times.