论文信息 - GENERALISED PRIOR SUBSPACE ANALYSIS FOR POLYPHONIC PITCH TRANSCRIPTION

GENERALISED PRIOR SUBSPACE ANALYSIS FOR POLYPHONIC PITCH TRANSCRIPTION

A reformulation of Prior Subspace Analysis (PSA) is presented, which restates the problem as that of fitting an undercomplete signal dictionary to a spectrogram. Further, a generalization of PSA is derived which allows the transcription of polyphonic pitched instruments. This involves the translation of a single frequency prior subspace of a note to approximate other notes, overcoming the problem of needing a separate basis function for each note played by an instrument. Examples are then demonstrated which show the utility of the generalised PSA algorithm for the purposes of polyphonic pitch transcription.

[1] Jouni Paulus,et al. Drum transcription with non-negative spectrogram factorisation , 2005, 2005 13th European Signal Processing Conference.

[2] Barak A. Pearlmutter,et al. Monaural Source Separation Using Spectral Cues , 2004, ICA.

[3] Hiroki Asari,et al. Non-negative Matrix Factorization: A possible way to learn sound dictionaries , 2005 .

[4] H. Sebastian Seung,et al. Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[5] Patrik O. Hoyer,et al. Non-negative sparse coding , 2002, Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing.

[6] Mark D. Plumbley,et al. Polyphonic transcription by non-negative sparse coding of power spectra , 2004, ISMIR.

[7] Tuomas Virtanen,et al. Sound Source Separation Using Sparse Coding with Temporal Continuity Objective , 2003, ICMC.

[8] Petri Toiviainen,et al. MIR In Matlab: The MIDI Toolbox , 2004, ISMIR.

[9] Michael A. Casey,et al. Separation of Mixed Audio Sources By Independent Subspace Analysis , 2000, ICMC.

[10] Tuomas Virtanen,et al. Separation of sound sources by convolutive sparse coding , 2004, SAPA@INTERSPEECH.

[11] Derry Fitzgerald,et al. Automatic Drum Transcription and Source Separation , 2004 .

[12] Barak A. Pearlmutter,et al. Blind Source Separation by Sparse Decomposition in a Signal Dictionary , 2001, Neural Computation.

[13] P. Smaragdis,et al. Non-negative matrix factorization for polyphonic music transcription , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[14] Xavier Rodet,et al. Music Transcription with ISA and HMM , 2004, ICA.

[15] D. Fitzgerald,et al. Shifted non-negative matrix factorisation for sound source separation , 2005, IEEE/SP 13th Workshop on Statistical Signal Processing, 2005.

[16] Mark D. Plumbley,et al. Polyphonic music transcription by non-negative sparse coding of power spectra , 2004 .

[17] Pierre Comon,et al. Independent component analysis, A new concept? , 1994, Signal Process..