Real-Time Transcription and Separation of Drum Recordings Based on NMF Decomposition

This paper proposes a real-time capable method for transcribing and separating occurrences of single drum instruments in polyphonic drum recordings. Both the detection and the decomposition are based on Non-Negative Matrix Factorization and can be implemented with very small systemic delay. We propose a simple modification to the update rules that allows to capture timedynamic spectral characteristics of the involved drum sounds. The method can be applied in music production and music education software. Performance results with respect to drum transcription are presented and discussed. The evaluation data-set consisting of annotated drum recordings is published for use in further studies in the field. Index Terms drum transcription, source separation, nonnegative matrix factorization, spectral processing, audio plug-in, music production, music education

[1]  Jan Larsen,et al.  Single-channel source separation using non-negative matrix factorization , 2009 .

[2]  Gaël Richard,et al.  Automatic transcription of drum loops , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[4]  Jozef Baruník Diploma thesis , 1999 .

[5]  Christian Uhle,et al.  Further Steps towards Drum Transcription of Polyphonic Music , 2004 .

[6]  François Pachet,et al.  Automatic extraction of drum tracks from polyphonic music signals , 2002, Second International Conference on Web Delivering of Music, 2002. WEDELMUSIC 2002. Proceedings..

[7]  Jakob Abeßer,et al.  Modeling Musical Attributes to Characterize Two-Track Recordings with Bass and Drums , 2011, ISMIR.

[8]  Christian Dittmar,et al.  Drumloop Separation using adaptive Spectrogram Templates , 2010 .

[9]  Christian Uhle,et al.  EXTRACTION OF DRUM TRACKS FROM POLYPHONIC MUSIC USING INDEPENDENT SUBSPACE ANALYSIS , 2003 .

[10]  Mark D. Plumbley Algorithms for nonnegative independent component analysis , 2003, IEEE Trans. Neural Networks.

[11]  Anssi Klapuri,et al.  Conventional and periodic N-grams in the transcription of drum sequences , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[12]  Eugene Coyle,et al.  Prior Subspace Analysis for Drum Transcription , 2003 .

[13]  Mark D. Plumbley Algorithms for Non-Negative Independent Component Analysis , 2002 .

[14]  Michael N. Vrahatis,et al.  Real-time drums transcription with characteristic bandpass filtering , 2012, AM '12.

[15]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[16]  Guillermo Sapiro,et al.  Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[17]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[18]  Anssi Klapuri,et al.  Drum Sound Detection in Polyphonic Music with Hidden Markov Models , 2009, EURASIP J. Audio Speech Music. Process..

[19]  Peter Grosche,et al.  Extracting Predominant Local Pulse Information From Music Recordings , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[20]  Augusto Sarti,et al.  DRUM MUSIC TRANSCRIPTION USING PRIOR SUBSPACE ANALYSIS AND PATTERN RECOGNITION , 2010 .

[21]  Morten Mørup,et al.  Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation , 2006, ICA.

[22]  Mark B. Sandler,et al.  Sonic visualiser: an open source application for viewing, analysing, and annotating music audio files , 2010, ACM Multimedia.

[23]  Bernard De Baets,et al.  AN ALGORITHM FOR DETECTING AND LABELING DRUM EVENTS IN POLYPHONIC MUSIC , 2005 .

[24]  Tuomas Virtanen,et al.  Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine , 2005, 2005 13th European Signal Processing Conference.

[25]  Paris Smaragdis,et al.  Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs , 2004, ICA.

[26]  Meinard Müller,et al.  Score-Informed Voice Separation For Piano Recordings , 2011, ISMIR.

[27]  Masataka Goto,et al.  Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods , 2004, ISMIR.

[28]  David Wessel,et al.  Live Drum Separation Using Probabilistic Spectral Clustering Based on the Itakura-Saito Divergence , 2011 .

[29]  Mark D. Plumbley,et al.  Automatic Music Transcription and Audio Source Separation , 2002, Cybern. Syst..

[30]  Derry Fitzgerald,et al.  Automatic Drum Transcription and Source Separation , 2004 .

[31]  Qiang Yang,et al.  Detect and Track Latent Factors with Online Nonnegative Matrix Factorization , 2007, IJCAI.

[32]  Jouni Paulus,et al.  Signal Processing Methods for Drum Transcription and Music Structure Analysis , 2010 .

[33]  Michael A. Casey,et al.  Separation of Mixed Audio Sources By Independent Subspace Analysis , 2000, ICMC.