Dissonance Reduction In Polyphonic Audio Using Harmonic Reorganization

In this paper, a method for automatic reduction of dissonance in recorded isolated chords is proposed. Previous approaches address this problem using source separation and note-level processing. In our approach, we manipulate the harmonic structure as a whole in order to avoid beating partials which, according to prior research on dissonance perception, typically produce an unpleasant sound. The proposed system firstly performs a sinusoidal plus residual modeling of the input and analyses the various fundamental frequencies present in the chord. This information is used to create a symbolic representation of the in-tune version of the input according to some musical rules. Then, the partials of the signals are shifted in order to fit the in-tune harmonic structure of the input chord. The input is assumed to contain one isolated chord, with relatively stable fundamental frequencies belonging to the Western chromatic scale. The evaluation has been performed by 31 expert musicians, which have quantified the perceived consonance of six varied, out-of-tune chords in three variants: unprocessed, processed with our system and processed by a state-of-the-art commercial tool (Melodyne Editor). The proposed approach attains an important reduction of the perceived dissonance, showing better performance than Melodyne Editor for most of the cases evaluated.

[1]  Ana M. Barbancho,et al.  Inharmonicity-Based Method for the Automatic Generation of Guitar Tablature , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  O. R. Gurney,et al.  An Old Babylonian Treatise on the Tuning of the Harp , 1968, Iraq.

[3]  John A. Swets,et al.  On the Width of Critical Bands , 1962 .

[4]  R. Plomp,et al.  Tonal consonance and critical bandwidth. , 1965, The Journal of the Acoustical Society of America.

[5]  Xavier Rodet,et al.  Tracking of partials for additive sound synthesis using hidden Markov models , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  B. Moore Frequency difference limens for short-duration tones. , 1973, The Journal of the Acoustical Society of America.

[7]  Axel Röbel,et al.  Multiple Fundamental Frequency Estimation and Polyphony Inference of Polyphonic Music Signals , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Guido Torelli,et al.  New Polyphonic Sound Generator Chip with Integrated Microprocessor-Programmable ADSR Envelope Shaper , 1983, IEEE Transactions on Consumer Electronics.

[9]  P. Depalle,et al.  Spectral Envelopes and Inverse FFT Synthesis , 1992 .

[10]  Norman Cazden Sensory Theories of Musical Consonance , 1962 .

[11]  Mototsugu Abe,et al.  Design Criteria for the Quadratically Interpolated FFT Method ( I ) : Bias due to Interpolation October 13 , 2004 , .

[12]  S. Zabell,et al.  On Student's 1908 Article “The Probable Error of a Mean” , 2008 .

[13]  Dan Tidhar,et al.  Estimation of harpsichord inharmonicity and temperament from musical recordings. , 2012, The Journal of the Acoustical Society of America.

[14]  A.P. Klapuri,et al.  A perceptually motivated multiple-F0 estimation method , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[15]  Kristoffer Jensen,et al.  ENVELOPE MODEL OF ISOLATED MUSICAL SOUNDS , 1999 .

[16]  Malcolm Slaney,et al.  An Efficient Implementation of the Patterson-Holdsworth Auditory Filter Bank , 1997 .

[17]  A. W. M. van den Enden,et al.  Discrete Time Signal Processing , 1989 .

[18]  Xavier Serra,et al.  A system for sound analysis/transformation/synthesis based on a deterministic plus stochastic decomposition , 1989 .

[19]  Anssi Klapuri,et al.  Multiple fundamental frequency estimation based on harmonicity and spectral smoothness , 2003, IEEE Trans. Speech Audio Process..

[20]  Masataka Goto,et al.  RWC Music Database: Music genre database and musical instrument sound database , 2003, ISMIR.

[21]  Matti Karjalainen,et al.  A computationally efficient multipitch analysis model , 2000, IEEE Trans. Speech Audio Process..

[22]  W. B. Haas Music information retrieval based on tonal harmony , 2012 .

[23]  J. Barbour Tuning and Temperament: A Historical Survey , 2004 .

[24]  Robert C. Maher,et al.  An approach for the separation of voices in composite musical signals , 1989 .

[25]  H. Helmholtz Die Lehre Von Den Tonempfindungen ALS Physiologische Grundlage Fur Die Theorie Der Musik , 2013 .