Improving Genre Classification by Combination of Audio and Symbolic Descriptors Using a Transcription Systems

Recent research in music genre classification hints at a glass ceiling being reached using timbral audio features. To overcome this, the combination of multiple different feature sets bearing diverse characteristics is needed. We propose a new approach to extend the scope of the features: We transcribe audio data into a symbolic form using a transcription system, extract symbolic descriptors from that representation and combine them with audio features. With this method, we are able to surpass the glass ceiling and to further improve music genre classification, as shown in the experiments through three reference music databases and comparison to previously published performance results.

[1]  François Pachet,et al.  Improving Timbre Similarity : How high’s the sky ? , 2004 .

[2]  C. G. Fenwick How High is the Sky? , 1958, American Journal of International Law.

[3]  P.P. de Leon,et al.  Pattern Recognition Approach for Music Style Identification Using Shallow Statistical Descriptors , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[4]  George Tzanetakis,et al.  Music analysis and retrieval systems for audio signals , 2004, J. Assoc. Inf. Sci. Technol..

[5]  Anssi Klapuri,et al.  Multiple Fundamental Frequency Estimation by Summing Harmonic Amplitudes , 2006, ISMIR.

[6]  Gerhard Widmer,et al.  Towards Characterisation of Music via Rhythmic Patterns , 2004, ISMIR.

[7]  Xavier Serra,et al.  ISMIR 2004 Audio Description Contest , 2006 .

[8]  Gerhard Widmer,et al.  Probabilistic Combination of Features for Music Classification , 2006, ISMIR.

[9]  Elias Pampalk,et al.  Please Scroll down for Article Journal of New Music Research the Som-enhanced Jukebox: Organization and Visualization of Music Collections Based on Perceptual Models , 2022 .

[10]  Tao Li,et al.  A comparative study on content-based music genre classification , 2003, SIGIR.

[11]  Elias Pampalk,et al.  Computational Models of Music Similarity and their Application in Music Information Retrieval , 2006 .

[12]  Pedro Ponce de Len,et al.  Pattern Recognition Approach for Music Style Identification Using Shallow Statistical Descriptors , 2007, IEEE Trans. Syst. Man Cybern. Part C.

[13]  Xavier Rodet Musical Sound Signal Analysis/Synthesis: Sinusoidal+Residual and Elementary Waveform Models , 1997 .

[14]  Anssi Klapuri,et al.  Recognition of Note Onsets in Digital Music Using Semitone Bands , 2005, CIARP.

[15]  Andreas Rauber,et al.  Evaluation of Feature Extractors and Psycho-Acoustic Transformations for Music Genre Classification , 2005, ISMIR.

[16]  Ichiro Fujinaga,et al.  Automatic Genre Classification Using Large High-Level Musical Feature Sets , 2004, ISMIR.

[17]  Gerhard Widmer,et al.  Improvements of Audio-Based Music Similarity and Genre Classificaton , 2005, ISMIR.

[18]  José Manuel Iñesta Quereda,et al.  A Pattern Recognition Approach for Melody Track Selection in MIDI Files , 2006, ISMIR.

[19]  Anssi Klapuri,et al.  Signal Processing Methods for Music Transcription , 2006 .

[20]  Daniel P. W. Ellis,et al.  Song-Level Features and Support Vector Machines for Music Classification , 2005, ISMIR.

[21]  George Tzanetakis,et al.  Manipulation, analysis and retrieval systems for audio signals , 2002 .

[22]  Tao Li,et al.  Factors in automatic musical genre classification of audio signals , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[23]  Douglas Eck,et al.  Aggregate features and ADABOOST for music classification , 2006, Machine Learning.