Percussion-related Semantic Descriptors of Music Audio Files

Automatic extraction of semantic music content descriptors has traditionally focused on melodic, rhythmic and harmonic aspects. In the present paper, we will present several music content descriptors that are related to percussion instrumentation. The “percussion index” estimates the amount of percussion that can be found in a music audio file and yields a (numerical or categorical) value that represents the amount of percussion detected in the file. A further refinement is the “percussion profile”, which roughly indicates the existing balance between drums and cymbals. We finally present the percussivity descriptor, which represents the overall impulsiveness or abruptness of the percussive events. Data from initial evaluations, both objective and subjective will also be presented and discussed.

[1]  Anssi Klapuri,et al.  MODEL-BASED EVENT LABELING IN THE TRANSCRIPTION OF PERCUSSIVE AUDIO SIGNALS , 2003 .

[2]  Anssi Klapuri,et al.  Sound onset detection by applying psychoacoustic knowledge , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[3]  Marc Leman,et al.  Automatic Harmonic Description of Musical Signals Using Schema-based Chord Decomposition , 1999 .

[4]  Tuomas Virtanen,et al.  Sound Source Separation Using Sparse Coding with Temporal Continuity Objective , 2003, ICMC.

[5]  Christian Uhle,et al.  EXTRACTION OF DRUM TRACKS FROM POLYPHONIC MUSIC USING INDEPENDENT SUBSPACE ANALYSIS , 2003 .

[6]  Jian Tang,et al.  Parametric vector quantization for coding percussive sounds in music , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[7]  Derry Fitzgerald,et al.  SUB-BAND INDEPENDENT SUBSPACE ANALYSIS FOR DRUM TRANSCRIPTION , 2002 .

[8]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[9]  Michael A. Casey,et al.  Separation of Mixed Audio Sources By Independent Subspace Analysis , 2000, ICMC.

[10]  Eric D. Scheirer,et al.  Music Content Analysis through Models of Audition , 1998 .

[11]  Anssi Klapuri,et al.  Conventional and periodic N-grams in the transcription of drum sequences , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[12]  Eugene Coyle,et al.  Prior Subspace Analysis for Drum Transcription , 2003 .

[13]  Marc Leman,et al.  Classification of Percussive Sounds using Support Vector Machines , 2004 .

[14]  François Pachet,et al.  Automatic extraction of drum tracks from polyphonic music signals , 2002, Second International Conference on Web Delivering of Music, 2002. WEDELMUSIC 2002. Proceedings..

[15]  Robert E. Schapire,et al.  A Brief Introduction to Boosting , 1999, IJCAI.

[16]  Yoichi Muraoka,et al.  A Real-Time Beat Tracking System for Audio Signals , 1996, ICMC.

[17]  Fabien Gouyon,et al.  Automatic labeling of unpitched percussion sounds , 2003 .

[18]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[19]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[20]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[21]  Derry Fitzgerald,et al.  Drum Transcription in the presence of pitched instruments using Prior Subspace Analysis , 2003 .