Genre classification of symbolic music with SMBGT

Automatic music genre classification is a task that has attracted the interest of the music community for more than two decades. Music can be of high importance within the area of assistive technologies as it can be seen as an assistive technology with high therapeutic and educational functionality for children and adults with disabilities. Several similarity methods and machine learning techniques have been applied in the literature to deal with music genre classification, and as a result data mining and Music Information Retrieval (MIR) are strongly interconnected. In this paper, we deal with music genre classification for symbolic music, and specifically MIDI, by combining the recently proposed novel similarity measure for sequences, SMBGT, with the k-Nearest Neighbor (k-NN) classifier. For all MIDI songs we first extract all of their channels and then transform each channel into a sequence of 2D points, providing information for pitch and duration of their music notes. The similarity between two songs is found by computing the SMBGT for all pairs of the songs' channels and getting the maximum pairwise channel score as their similarity. Each song is treated as a query to which k-NN is applied, and the returned genre of the classifier is the one with the majority of votes in the k neighbors. Classification accuracy results indicate that there is room for improvement, especially due to the ambiguous definitions of music genres that make it hard to clearly discriminate them. Using this framework can also help us analyze and understand potential disadvantages of SMBGT, and thus identify how it can be improved when used for classification of real-time sequences.

[1]  Roger B. Dannenberg,et al.  Melody Matching Directly From Audio , 2001 .

[2]  Ichiro Fujinaga,et al.  Automatic music classification and the importance of instrument identification , 2005 .

[3]  William P. Birmingham,et al.  Encoding Timing Information for Musical Query Matching , 2002, ISMIR.

[4]  Andreas Rauber,et al.  Evaluation of Feature Extractors and Psycho-Acoustic Transformations for Music Genre Classification , 2005, ISMIR.

[5]  Dennis Shasha,et al.  Warping indexes with envelope transforms for query by humming , 2003, SIGMOD '03.

[6]  François Pachet,et al.  Representing Musical Genre: A State of the Art , 2003 .

[7]  F. Fabbri A Theory Of Musical Genres. Two Applications , 1982 .

[8]  Costas S. Iliopoulos,et al.  String Matching with Gaps for Musical Melodic Recognition , 2002, Stringology.

[9]  Douglas Eck,et al.  Aggregate features and ADABOOST for music classification , 2006, Machine Learning.

[10]  William P. Birmingham,et al.  HMM-based musical query retrieval , 2002, JCDL '02.

[11]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[12]  Daniel P. W. Ellis,et al.  Song-Level Features and Support Vector Machines for Music Classification , 2005, ISMIR.

[13]  Steffen Pauws,et al.  CubyHum: a fully operational "query by humming" system , 2002, ISMIR.

[14]  Ichiro Fujinaga,et al.  Musical genre classification: Is it worth pursuing and how can it be improved? , 2006, ISMIR.

[15]  Daniel P. W. Ellis,et al.  Anchor space for classification and similarity measurement of music , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[16]  Jyh-Shing Roger Jang,et al.  A Query-by-Singing System based on Dynamic Programming , 2000 .

[17]  William P. Birmingham,et al.  A Comprehensive Trainable Error Model for Sung Music Queries , 2004, J. Artif. Intell. Res..

[18]  J. Stephen Downie,et al.  Survey Of Music Information Needs, Uses, And Seeking Behaviours: Preliminary Findings , 2004, ISMIR.

[19]  Elaine Chew,et al.  Challenging Uncertainty in Query by Humming Systems: A Fingerprinting Approach , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[20]  David Sankoff,et al.  Comparison of musical sequences , 1990, Comput. Humanit..

[21]  Dimitrios Gunopulos,et al.  A subsequence matching with gaps-range-tolerances framework , 2011, Proc. VLDB Endow..

[22]  Zehra Cataltepe,et al.  Music Genre Classification Using MIDI and Audio Features , 2007, EURASIP J. Adv. Signal Process..

[23]  Gregory H. Wakefield,et al.  Time Series Alignment for Music Information Retrieval , 2004, ISMIR.

[24]  Christos Faloutsos,et al.  Stream Monitoring under the Time Warping Distance , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[25]  L. Bergroth,et al.  A survey of longest common subsequence algorithms , 2000, Proceedings Seventh International Symposium on String Processing and Information Retrieval. SPIRE 2000.

[26]  Zhouyu Fu,et al.  A Survey of Audio-Based Music Classification and Annotation , 2011, IEEE Transactions on Multimedia.

[27]  Ning Hu,et al.  A Probabilistic Model of Melodic Similarity , 2002, ICMC.

[28]  Ian H. Witten,et al.  Towards the digital music library: tune retrieval from acoustic input , 1996, DL '96.

[29]  François Pachet,et al.  A taxonomy of musical genres , 2000, RIAO.

[30]  Dimitrios Gunopulos,et al.  A survey of query-by-humming similarity methods , 2012, PETRA '12.

[31]  N. Scaringella,et al.  Automatic genre classification of music content: a survey , 2006, IEEE Signal Process. Mag..

[32]  Dimitrios Gunopulos,et al.  Hum-a-song: A Subsequence Matching with Gaps-Range-Tolerances Query-By-Humming System , 2012, Proc. VLDB Endow..

[33]  Ronald de Wolf,et al.  Algorithmic Clustering of Music Based on String Compression , 2004, Computer Music Journal.

[34]  Justin Zobel,et al.  Melodic matching techniques for large music databases , 1999, MULTIMEDIA '99.

[35]  Hagen Soltau,et al.  Recognition of music types , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[36]  Esko Ukkonen,et al.  Including Interval Encoding into Edit Distance Based Music Comparison and Retrieval , 2003 .

[37]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[38]  David S. Watson,et al.  A Machine Learning Approach to Musical Style Recognition , 1997, ICMC.

[39]  Jean-Pierre Martens,et al.  A comparison of human and automatic musical genre classification , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.