An open-source drum transcription system for Pure Data and Max MSP

This paper presents a drum transcription algorithm adjusted to the constraints of real-time audio. We introduce an instance filtering (IF) method using sub-band onset detection, which improves the performance of a system having at its core a feature-based K-nearest neighbor classifier (KNN). The architecture proposed allows for adapting different parts of the algorithm for either bass drum, snare drum or hi-hat cymbals. The open-source system is implemented in the graphic programming languages Pure Data (PD) and Max MSP, and aims to work with a large variety of drum sets. We evaluated its performance on a database of audio samples generated from a well known collection of midi drum loops randomly matched with a diverse collection of drum sets. Both of the evaluation stages, testing and validation, show an improvement in the performance when using the instance filtering algorithm.

[1]  Masataka Goto,et al.  Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  David Wessel,et al.  Live Drum Separation Using Probabilistic Spectral Clustering Based on the Itakura-Saito Divergence , 2011 .

[3]  Ajay Kapur,et al.  Subband-based Drum Transcription for Audio Signals , 2005, 2005 IEEE 7th Workshop on Multimedia Signal Processing.

[4]  D. FITZGERALD,et al.  Drum Transcription using Automatic Grouping of Events and Prior Subspace Analysis , 2003 .

[5]  W. Brent Cepstral Analysis Tools for Percussive Timbre Identification , 2011 .

[6]  Bernard De Baets,et al.  AN ALGORITHM FOR DETECTING AND LABELING DRUM EVENTS IN POLYPHONIC MUSIC , 2005 .

[7]  Paul M. Brossier,et al.  Automatic annotation of musical audio for interactive applications , 2006 .

[8]  Jouni Paulus,et al.  Drum transcription with non-negative spectrogram factorisation , 2005, 2005 13th European Signal Processing Conference.

[9]  Fabien Gouyon,et al.  Automatic labeling of unpitched percussion sounds , 2003 .

[10]  Marco Marchini,et al.  An Unsupervised System for the Synthesis of Variations from Audio Percussion Patterns , 2010, CMMR 2010.

[11]  Gaël Richard,et al.  Supervised and Unsupervised Sequence Modelling for Drum Transcription , 2007, ISMIR.

[12]  Gaël Richard,et al.  Automatic transcription of drum loops , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[14]  N. M. Collins,et al.  Towards autonomous agents for live computer music : realtime machine listening and interactive music systems , 2007 .

[15]  Gaël Richard,et al.  Drum Track Transcription of Polyphonic Music Using Noise Subspace Projection , 2005, ISMIR.