MUSICALLY EXPRESSIVE SOUND TEXTURES FROM GENERALIZED AUDIO

We present a method of musically expressive synthesis-by-analysis that takes advantage of recent advancements in auditory scene analysis and sound separation algorithms. Our model represents incoming audio as a sub-conceptual model using statistical decorrelation techniques that abstract away individual auditory events, leaving only the gross parameters of the sound‐ the “eigensound” or generalized spectral template. Using these approaches we present various optimization guidelines and musical enhancements, specifically with regards to the beat and temporal nature of the sounds, with an eye towards real-time effects and synthesis. Our model results in completely novel and pleasing sound textures that can be varied with parameter tuning of the “unmixing” weight matrix.

[1]  Daniel P. W. Ellis,et al.  Sound texture modelling with linear prediction in both time and frequency domains , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[2]  Gene H. Golub,et al.  Matrix computations , 1983 .

[3]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[4]  Dani Lischinski,et al.  Texture Mixing and Texture Movie Synthesis Using Statistical Learning , 2001, IEEE Trans. Vis. Comput. Graph..

[5]  John C. Stapleton,et al.  Synthesis of musical tones based on the Karhunen-Loeve transform , 1988, IEEE Trans. Acoust. Speech Signal Process..

[6]  Michael R. Casey General sound recognition and similarity tools , 2001 .

[7]  Eric D. Scheirer,et al.  Tempo and beat analysis of acoustic musical signals. , 1998, The Journal of the Acoustical Society of America.

[8]  Kris Popat,et al.  Analysis and synthesis of sound textures , 1998 .

[9]  Agostino Di Scipio,et al.  SYNTHESIS OF ENVIRONMENTAL SOUND TEXTURES BY ITERATED NONLINEAR FUNCTIONS , 1999 .

[10]  Paris Smaragdis,et al.  Combining Musical and Cultural Features for Intelligent Style Detection , 2002, ISMIR.

[11]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.