Navigating descriptive sub-representations of musical timbre

Musicians, audio engineers and producers often make use of common timbral adjectives to describe musical signals and transformations. However, the subjective nature of these terms, and the variability with respect to musical context often leads to inconsistencies in their definition. In this study, a model is proposed for controlling an equaliser by navigating clusters of datapoints, which represent grouped parameter settings with the same timbral description. The associated interface allows users to identify the nearest cluster to their current parameter setting and recommends changes based on its relationship to a cluster centroid. To do this, we apply dimensionality reduction to a dataset of equaliser curves described as warm and bright using a stacked autoencoder, then group the entries using an agglomerative clustering algorithm with a coherence-based distance criterion. To test the efficacy of the system, we implement listening tests and show that subjects are able to match datapoints to their respective sub-representations with 93.75% mean accuracy.

[1]  W. Marsden I and J , 2012 .

[2]  Joshua D. Reiss,et al.  Semantic Description of Timbral Transformations in Music Production , 2016, ACM Multimedia.

[3]  Andrew J. Hunt,et al.  Timbral description of musical instruments , 2006 .

[4]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[5]  Sinan Saraçli,et al.  Comparison of hierarchical cluster analysis methods by cophenetic correlation , 2013, Journal of Inequalities and Applications.

[6]  Joshua D. Reiss,et al.  Automatic Equalization of Multichannel Audio Using Cross-Adaptive Methods , 2009 .

[7]  Spyridon Stasis,et al.  Semantically Controlled Adaptive Equalisation in Reduced Dimensionality Parameter Space , 2016 .

[8]  Brecht De Man,et al.  Web Audio Evaluation Tool: A framework for subjective assessment of audio , 2016 .

[9]  Tim Brookes,et al.  Perceptually-Motivated Audio Morphing: Warmth , 2010 .

[10]  P. Legendre,et al.  Comparison tests for dendrograms: A comparative evaluation , 1995 .

[11]  György Fazekas,et al.  SAFE: A System for the Extraction and Retrieval of Semantic Audio Descriptors , 2014 .

[12]  Bryan Pardo,et al.  Social-EQ: Crowdsourcing an Equalization Descriptor Map , 2013, ISMIR.

[13]  Joshua D. Reiss,et al.  An Additive Synthesis Technique for Independent Modification of the Auditory Perceptions of Brightness and Warmth , 2011 .

[14]  Joshua D. Reiss,et al.  Autonomous Multitrack Equalization Based on Masking Reduction , 2015 .

[15]  J. G. Skellam,et al.  A New Method for determining the Type of Distribution of Plant Individuals , 1954 .

[16]  Jörn Loviscach,et al.  subjEQt: controlling an equalizer through subjective terms , 2006, CHI EA '06.

[17]  Tim Brookes,et al.  Perceptually-Motivated Audio Morphing: Brightness , 2007 .

[18]  James W. Beauchamp,et al.  Synthesis by Spectral Amplitude and "Brightness" Matching of Analyzed Musical Instrument Tones , 1981 .

[19]  Graham Darke,et al.  Assessment of Timbre Using Verbal Attributes , 2005 .

[20]  Joshua D. Reiss,et al.  Proximity Effect Detection for Directional Microphones , 2011 .

[21]  Tim Brookes,et al.  Perceptually-Motivated Audio Morphing: Softness , 2009 .

[22]  Vesa Välimäki,et al.  All About Audio Equalization: Solutions and Frontiers , 2016 .

[23]  Esben Skovenborg Development of Semantic Scales for Music Mastering , 2016 .

[24]  Razvan Pascanu,et al.  Theano: A CPU and GPU Math Compiler in Python , 2010, SciPy.

[25]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[26]  Bryan Pardo,et al.  Reverbalize: A Crowdsourced Reverberation Controller , 2014, ACM Multimedia.

[27]  J. Grey Multidimensional perceptual scaling of musical timbres. , 1977, The Journal of the Acoustical Society of America.