Approximate Note Transcription for the Improved Identification of Difficult Chords

The automatic detection and transcription of musical chords from audio is an established music computing task. The choice of chord profiles and higher-level time-series modelling have received a lot of attention, resulting in methods with an overall performance of more than 70% in the MIREX Chord Detection task 2009. Research on the front end of chord transcription algorithms has often concentrated on finding good chord templates to fit the chroma features. In this paper we reverse this approach and seek to find chroma features that are more suitable for usage in a musically-motivated model. We do so by performing a prior approximate transcription using an existing technique to solve non-negative least squares problems (NNLS). The resulting NNLS chroma features are tested by using them as an input to an existing state-of-the-art high-level model for chord transcription. We achieve very good results of 80% accuracy using the song collection and metric of the 2009 MIREX Chord Detection tasks. This is a significant increase over the top result (74%) in MIREX 2009. The nature of some chords makes their identification particularly susceptible to confusion between fundamental frequency and partials. We show that the recognition of these diffcult

[1]  Anssi Klapuri,et al.  Multiple Fundamental Frequency Estimation by Summing Harmonic Amplitudes , 2006, ISMIR.

[2]  Takuya Yoshioka,et al.  Automatic Chord Transcription with Concurrent Recognition of Chord Symbols and Boundaries , 2004, ISMIR.

[3]  Kevin Murphy,et al.  Bayes net toolbox for Matlab , 1999 .

[4]  Malcolm Slaney,et al.  Acoustic Chord Transcription and Key Extraction From Audio Using Key-Dependent HMMs Trained on Synthesized Audio , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Daniel P. W. Ellis,et al.  Structured Prediction Models for Chord Transcription of Music Audio , 2009, 2009 International Conference on Machine Learning and Applications.

[6]  Charles L. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[7]  Simon Dixon,et al.  Simultaneous Estimation of Chords and Musical Context From Audio , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Geoffroy Peeters Chroma-based estimation of musical key from audio-signal analysis , 2006, ISMIR.

[9]  Matthew E. P. Davies,et al.  Towards a musical beat emphasis function , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[10]  Marc Leman,et al.  A Probabilistic Framework for Audio-Based Tonal Key and Chord Recognition , 2006, GfKl.

[11]  Simon Dixon,et al.  10 th International Society for Music Information Retrieval Conference ( ISMIR 2009 ) USING MUSICAL STRUCTURE TO ENHANCE AUTOMATIC CHORD TRANSCRIPTION , 2009 .

[12]  Mark D. Plumbley,et al.  Polyphonic music transcription by non-negative sparse coding of power spectra , 2004 .

[13]  Anssi Klapuri,et al.  Automatic Transcription of Melody, Bass Line, and Chords in Polyphonic Music , 2008, Computer Music Journal.

[14]  Matthias Mauch,et al.  Automatic chord transcription from audio using computational models of musical context , 2010 .

[15]  Jean-Pierre Martens,et al.  A novel chroma representation of polyphonic music based on multiple pitch tracking techniques , 2008, ACM Multimedia.

[16]  Geoffroy Peeters,et al.  Simultaneous estimation of chord progression and downbeats from an audio file , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[17]  Namunu Chinthaka Maddage Automatic structure detection for popular music , 2006, IEEE Multimedia.

[18]  Marc Leman,et al.  Content-Based Music Information Retrieval: Current Directions and Future Challenges , 2008, Proceedings of the IEEE.

[19]  Yves Grenier,et al.  Template-based Chord Recognition : Influence of the Chord Types , 2009, ISMIR.