论文信息 - Navigating descriptive sub-representations of musical timbre

Navigating descriptive sub-representations of musical timbre

Musicians, audio engineers and producers often make use of common timbral adjectives to describe musical signals and transformations. However, the subjective nature of these terms, and the variability with respect to musical context often leads to inconsistencies in their deﬁnition. In this study, a model is proposed for controlling an equaliser by navigating clusters of datapoints, which represent grouped parameter settings with the same timbral description. The associated interface allows users to identify the nearest cluster to their current parameter setting and recommends changes based on its relationship to a cluster centroid. To do this, we apply dimensionality reduction to a dataset of equaliser curves described as warm and bright using a stacked autoencoder, then group the entries using an agglomerative clustering algorithm with a coherence-based distance criterion. To test the eﬃcacy of the system, we implement listening tests and show that subjects are able to match datapoints to their respective sub-representations with 93.75% mean accuracy.

Spyridon Stasis | Jason Hockman | Ryan Stables

[1] W. Marsden. I and J , 2012 .

[2] Joshua D. Reiss,et al. Semantic Description of Timbral Transformations in Music Production , 2016, ACM Multimedia.

[3] Andrew J. Hunt,et al. Timbral description of musical instruments , 2006 .

[4] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[5] Sinan Saraçli,et al. Comparison of hierarchical cluster analysis methods by cophenetic correlation , 2013, Journal of Inequalities and Applications.

[6] Joshua D. Reiss,et al. Automatic Equalization of Multichannel Audio Using Cross-Adaptive Methods , 2009 .

[7] Spyridon Stasis,et al. Semantically Controlled Adaptive Equalisation in Reduced Dimensionality Parameter Space , 2016 .

[8] Brecht De Man,et al. Web Audio Evaluation Tool: A framework for subjective assessment of audio , 2016 .

[9] Tim Brookes,et al. Perceptually-Motivated Audio Morphing: Warmth , 2010 .

[10] P. Legendre,et al. Comparison tests for dendrograms: A comparative evaluation , 1995 .

[11] György Fazekas,et al. SAFE: A System for the Extraction and Retrieval of Semantic Audio Descriptors , 2014 .