Refined Spectral Template Models for Score Following

Score followers often use spectral templates for notes and chords to estimate the similarity between positions in the score and the incoming audio stream. Here, we propose two methods on different modelling levels to improve the quality of these templates, and subsequently the quality of the alignment. The first method focuses on creating more informed templates for individual notes. This is achieved by estimating the template based on synthesised sounds rather than generic Gaussian mixtures, as used in current state-of-theart systems. The second method introduces an advanced approach to aggregate individual note templates into spectral templates representing a specific score position. In contrast to score chordification, the common procedure used by score followers to deal with polyphonic scores, we use weighting functions to weight notes, observing their temporal relationships. We evaluate both methods against a dataset of classical piano music to show their positive impact on the alignment quality.

[1]  Barry Vercoe,et al.  The Synthetic Performer in The Context of Live Performance , 1984, International Conference on Mathematics and Computing.

[2]  Pedro Cano,et al.  Low-Delay Singing Voice Alignment to Text , 1999, ICMC.

[3]  Christopher Raphael,et al.  Aligning music audio with symbolic scores using a hybrid graphical model , 2006, Machine Learning.

[4]  Roger B. Dannenberg,et al.  Understanding Features and Distance Functions for Music Sequence Alignment , 2010, ISMIR.

[5]  A. Doucet,et al.  A Tutorial on Particle Filtering and Smoothing: Fifteen years later , 2008 .

[6]  Roger B. Dannenberg,et al.  Enhanced Vocal Performance Tracking Using Multiple Information Sources , 1998, ICMC.

[7]  Nicola Orio,et al.  Robust Polyphonic Midi Score Following with Hidden Markov Models , 2004, ICMC.

[8]  Tetsuya Ogata,et al.  Real-Time Audio-to-Score Alignment Using Particle Filter for Coplayer Music Robots , 2011, EURASIP J. Adv. Signal Process..

[9]  Gerhard Widmer,et al.  Adaptive distance normalization for real-time music tracking , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[10]  Christopher Raphael,et al.  Evaluation of Real-Time Audio-to-Score Alignment , 2007, ISMIR.

[11]  Christopher Raphael,et al.  Automatic Segmentation of Acoustic Musical Signals Using Hidden Markov Models , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Christopher Raphael,et al.  Music Plus One and Machine Learning , 2010, ICML.

[13]  Nicola Orio,et al.  Score Following Using Spectral Analysis and Hidden Markov Models , 2001, ICMC.

[14]  Arshia Cont,et al.  A Coupled Duration-Focused Architecture for Real-Time Music-to-Score Alignment , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Gerhard Widmer,et al.  Automatic Page Turning for Musicians via Real-Time Machine Listening , 2008, ECAI.

[16]  Pedro Cano,et al.  Score-Performance Matching Using HMMs , 1999, ICMC.

[17]  Arshia Cont Realtime Audio to Score Alignment for Polyphonic Music Instruments, using Sparse Non-Negative Constraints and Hierarchical HMMS , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[18]  Roger B. Dannenberg,et al.  An On-Line Algorithm for Real-Time Accompaniment , 1984, ICMC.

[19]  Miller Puckette,et al.  Score Following Using the Sung Voice , 1995, ICMC.

[20]  Bernhard Niedermayer Non-Negative Matrix Division for the Automatic Transcription of Polyphonic Music , 2008, ISMIR.

[21]  Florian Krebs,et al.  Tracking rests and Tempo changes: Improved Score following with Particle filters , 2013, ICMC.