Mono-To-Stereo Blind Upmix Using Non-Negative Matrix Factorization and Decorrelator

This paper presents a new method for upmixing mono signal to stereo signal with guaranteeing high stereophonic image quality (SIQ) and large apparent source width (ASW). The proposed method consists of analysis phase and synthesis phase. In analysis phase, a mono signal is first decomposed into multiple sound sources by the use of high-rank nonnegative matrix factorization. Then the multiple sources are clustered into two groups based on tonality criterion. In synthesis phase, one group is directly fed into left and right channels while the other group is decorrelated before being fed into each channel. Subjective tests reveals that the proposed method gives listener high SIQ and large ASW with minimizing timbral distortions.

[1]  Mathieu Lagrange,et al.  Semi-Automatic Mono to Stereo Up-Mixing Using Sound Source Formation , 2007 .

[2]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[3]  Francis Rumsey,et al.  Spatial audio and sensory evaluation techniques - context, history and aims , 2006 .

[4]  Malcolm J. Hawksford,et al.  Diffuse Signal Processing and Acoustic Source Characterization for Applications in Synthetic Loudspeaker Arrays , 2002 .

[5]  Constantine Kotropoulos,et al.  Musical Instrument Classification using Non-Negative Matrix Factorization Algorithms and Subset Feature Selection , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[6]  Tuomas Virtanen,et al.  Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine , 2005, 2005 13th European Signal Processing Conference.

[7]  Manfred R. Schroeder,et al.  -Colorless- Artificial Reverberation , 1960 .

[8]  Francis Rumsey,et al.  On the relative importance of spatial and timbral fidelities in judgments of degraded multichannel audio quality. , 2005, The Journal of the Acoustical Society of America.

[9]  P. Smaragdis,et al.  Non-negative matrix factorization for polyphonic music transcription , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[10]  Juergen Herre,et al.  Ambience Separation from Mono Recordings Using Non-Negative Matrix Factorization , 2007 .