Using Musical Relationships Between Chord Labels in Automatic Chord Extraction Tasks

Recent researches on Automatic Chord Extraction (ACE) have focused on the improvement of models based on machine learning. However, most models still fail to take into account the prior knowledge underlying the labeling alphabets (chord labels). Furthermore, recent works have shown that ACE performances are converging towards a glass ceiling. Therefore, this prompts the need to focus on other aspects of the task, such as the introduction of musical knowledge in the representation, the improvement of the models towards more complex chord alphabets and the development of more adapted evaluation methods. In this paper, we propose to exploit specific properties and relationships between chord labels in order to improve the learning of statistical ACE models. Hence, we analyze the interdependence of the representations of chords and their associated distances, the precision of the chord alphabets, and the impact of the reduction of the alphabet before or after training of the model. Furthermore, we propose new training losses based on musical theory. We show that these improve the results of ACE systems based on Convolutional Neural Networks. By performing an in-depth analysis of our results, we uncover a set of related insights on ACE tasks based on statistical models, and also formalize the musical meaning of some classification errors.

[1]  Alexander Rehding,et al.  Hugo Riemann and the Birth of Modern Musical Thought: Beethoven's deafness, exotic harmonies and tone imaginations , 2003 .

[2]  Juan Pablo Bello,et al.  Rethinking Automatic Chord Recognition with Convolutional Neural Networks , 2012, 2012 11th International Conference on Machine Learning and Applications.

[3]  Juan Pablo Bello,et al.  Four Timely Insights on Automatic Chord Estimation , 2015, ISMIR.

[4]  Yves Grenier,et al.  Template-based Chord Recognition : Influence of the Chord Types , 2009, ISMIR.

[5]  A. Schoenberg,et al.  Structural functions of harmony , 1954 .

[6]  Yoshua Bengio,et al.  Audio Chord Recognition with Recurrent Neural Networks , 2013, ISMIR.

[7]  Ron J. Weiss,et al.  Exploring common variations in state of the art chord recognition systems , 2010 .

[8]  Christopher Harte,et al.  Towards automatic extraction of harmony information from music signals , 2010 .

[9]  Daniel P. W. Ellis,et al.  MIR_EVAL: A Transparent Implementation of Common MIR Metrics , 2014, ISMIR.

[10]  Taemin Cho Improved techniques for automatic chord recognition from music audio signals , 2014 .

[11]  Gerhard Widmer,et al.  Feature Learning for Chord Recognition: The Deep Chroma Extractor , 2016, ISMIR.

[12]  Alexander Lerch,et al.  Chord Detection Using Deep Learning , 2015, ISMIR.

[13]  Mark B. Sandler,et al.  Symbolic Representation of Musical Chords: A Proposed Syntax for Text Annotations , 2005, ISMIR.

[14]  H.-L. Lou,et al.  Implementing the Viterbi algorithm , 1995, IEEE Signal Process. Mag..

[15]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[16]  Kyogu Lee Automatic Chord Recognition from Audio Using Enhanced Pitch Class Profile , 2006, ICMC.

[17]  Tijl De Bie,et al.  Automatic Chord Estimation from Audio: A Review of the State of the Art , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[18]  Juan Pablo Bello,et al.  Structured Training for Large-Vocabulary Chord Recognition , 2017, ISMIR.

[19]  Richard Cohn Neo-Riemannian Operations, Parsimonious Trichords, and Their "Tonnetz" Representations , 1997 .

[20]  Gerhard Widmer,et al.  A fully convolutional deep auditory model for musical chord recognition , 2016, 2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP).

[21]  Krzysztof Z. Gajos,et al.  ChordRipple: Recommending Chords to Help Novice Composers Go Beyond the Ordinary , 2016, IUI.

[22]  Juan Pablo Bello,et al.  Learning a robust Tonnetz-space transform for automatic chord recognition , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[23]  S. Dixon,et al.  MIREX 2019: VAMP PLUGINS FROM THE CENTRE FOR DIGITAL MUSIC , 2013 .