Social-EQ: Crowdsourcing an Equalization Descriptor Map

We seek to simplify audio production interfaces (such as those for equalization) by letting users communicate their audio production objectives with descriptive language (e.g. “Make the violin sound ‘warmer.’”). To achieve this goal, a system must be able to tell whether the stated goal is appropriate for the selected tool (e.g. making the violin “warmer” with a panning tool does not make sense). If the goal is appropriate for the tool, it must know what actions need to be taken. Further, the tool should not impose a vocabulary on users, but rather understand the vocabulary users prefer. In this work, we describe SocialEQ, a webbased project for learning a vocabulary of actionable audio equalization descriptors. Since deployment, SocialEQ has learned 324 distinct words in 731 learning sessions. Data on these terms is made available for download. We examine terms users have provided, exploring which ones map well to equalization, which ones have broadly-agreed upon meaning, which term have meanings specific small groups, and which terms are synonymous.

[1]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[2]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[3]  Bryan Pardo,et al.  Weighted-Function-Based Rapid Mapping of Descriptors to Audio Processing Parameters , 2011 .

[4]  H. Helmholtz,et al.  Book Reviews: On the Sensations of Tone as a Physiological Basis for the Theory of Music , 1954 .

[5]  R. Shepard Geometrical approximations to the structure of musical pitch. , 1982, Psychological review.

[6]  G. Soete,et al.  Perceptual scaling of synthesized musical timbres: Common dimensions, specificities, and latent subject classes , 1995, Psychological research.

[7]  Dale Reed Capturing perceptual expertise: a sound equalization expert system , 2001, Knowl. Based Syst..

[8]  J. Grey Multidimensional perceptual scaling of musical timbres. , 1977, The Journal of the Acoustical Society of America.

[9]  Joshua D. Reiss,et al.  An Investigation of Musical Timbre: Uncovering Salient Semantic Descriptors and Perceptual Dimensions , 2011, ISMIR.

[10]  Jörn Loviscach,et al.  subjEQt: controlling an equalizer through subjective terms , 2006, CHI EA '06.

[11]  Lawrence N. Solomon Search for Physical Correlates to Psychological Dimensions of Sounds , 1959 .

[12]  P. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 1999 .

[13]  John R. Hershey,et al.  Approximating the Kullback Leibler Divergence Between Gaussian Mixture Models , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[14]  David Miles Huber,et al.  Modern Recording Techniques , 1974 .

[15]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[16]  H. Helmholtz,et al.  On the Sensations of Tone as a Physiological Basis for the Theory of Music , 2005 .

[17]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[18]  Ewa Lukasik,et al.  Towards timbre-driven semantic retrieval of violins , 2005, 5th International Conference on Intelligent Systems Design and Applications (ISDA'05).

[19]  Jeffrey J. Scott,et al.  An Audio Processing Library for MIR Application Development in Flash , 2010, ISMIR.

[20]  Denis Smalley,et al.  Spectromorphology: explaining sound-shapes , 1997, Organised Sound.

[21]  Alastair C. Disley,et al.  Quarterly Progress and Status Report Spectral correlates of timbral semantics relating to the pipe organ , 2007 .