DESCRIPTOR-BASED SOUND TEXTURE SAMPLING

Existing methods for sound texture synthesis are often concerned with the extension of a given recording, while keeping its overall properties and avoiding artefacts. However, they generally lack controllability of the resulting sound texture. After a review and classification of existing approaches, we propose two methods of statistical modeling of the audio descriptors of texture recordings using histograms and Gaussian mixture models. The models can be interpolated to steer the evolution of the sound texture between different target recordings (e.g. from light to heavy rain). Target descriptor values are stochastically drawn from the statistic models by inverse transform sampling to control corpus-based concatenative synthesis for the final sound generation, that can also be controlled interactively by navigation through the descriptor space. To better cover the target descriptor space, we expand the corpus by automatically generating variants of the source sounds with transformations applied, and storing only the resulting descriptors and the transformation parameters in the corpus.

[1]  Perry R. Cook,et al.  Feature-Based Synthesis: Mapping Acoustic and Perceptual Features onto Synthesis Parameters , 2006, ICMC.

[2]  Diemo Schwarz,et al.  Principles and Applications of Interactive Corpus-Based Concatenative Synthesis , 2008 .

[3]  Ismo Kauppinen,et al.  AN ADAPTIVE TECHNIQUE FOR MODELING AUDIO SIGNALS , 2001 .

[4]  Charles Bascou,et al.  New sound Decomposition Method Applied to Granular synthesis , 2005, ICMC.

[5]  Daniel P. W. Ellis,et al.  Sound texture modelling with linear prediction in both time and frequency domains , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[6]  D. Schwarz,et al.  Corpus-Based Concatenative Synthesis , 2007, IEEE Signal Processing Magazine.

[7]  Daniel Arfib,et al.  Using visual textures for sonir textures production an control , 2006 .

[8]  Daniel Arfib,et al.  Instrumental gestures and sonic textures , 2005 .

[9]  M. Sile O'Modhrain,et al.  PebbleBox and CrumbleBag: Tactile Interfaces for Granular Synthesis , 2004, NIME.

[10]  Sandro Guidati Auralisation and psychoacoustic evaluation of traffic noise scenarios , 2008 .

[11]  Nathaniel Finney,et al.  Autonomous Generation of Soundscapes using Unstructured Sound Databases , 2009 .

[12]  Nicolas Tsingos,et al.  Retargetting Example Sounds to Interactive Physics-driven Animations , 2009 .

[13]  Diemo Schwarz,et al.  Ftm - Complex Data Structures for Max , 2005, ICMC.

[14]  Norbert Schnell,et al.  MnM: a Max/MSP mapping toolbox , 2005, NIME.

[15]  Dani Lischinski,et al.  Synthesizing Sound Textures through Wavelet Tree Learning , 2002, IEEE Computer Graphics and Applications.

[16]  Rabab K. Ward,et al.  (SPECIAL SECTION - SIGNAL PROCESSING FOR SOUND SYNTHESIS ) , 2007 .

[17]  Dani Lischinski,et al.  Synthesis of Sound Textures by Learning and Resampling of Wavelet Trees , .

[18]  Diemo Schwarz Concatenative sound synthesis: The early years , 2006 .

[19]  Andrew Horner,et al.  Sound Texture Synthesis Using an Overlap–Add/Granular Synthesis Approach , 2009 .

[20]  Marc Cardle Automated Sound Editing , 2004 .

[21]  Andrea Valle,et al.  A graph-based system for the dynamic generation of soundscapes , 2009 .

[22]  Tae Hong Park,et al.  Feature modulation synthesis (FMS) , 2007, ICMC.

[23]  Diemo Schwarz,et al.  Scalability in Content-Based Navigation of Sound Databases , 2009, ICMC.

[24]  David Birchfield,et al.  Design of a Generative Model for Soundscape Creation , 2005, ICMC.

[25]  Perry R. Cook,et al.  Toward Synthesized Environments: A Survey of Analysis and Synthesis Methods for Sound Designers and Composers , 2009, ICMC.

[26]  Diemo Schwarz,et al.  A Modular Sound Descriptor Analysis Framework For Relaxed-Real-Time Applications , 2010, ICMC.

[27]  L. Wyse,et al.  SOUND TEXTURE MODELING AND TIME-FREQUENCY LPC , 2004 .

[28]  Agostino Di Scipio,et al.  SYNTHESIS OF ENVIRONMENTAL SOUND TEXTURES BY ITERATED NONLINEAR FUNCTIONS , 1999 .

[29]  SCRIME-LaBRI STATISTICAL APPROACH FOR SOUND MODELING , 2000 .

[30]  Jim R. Parker,et al.  Creating audio textures by example: tiling and stitching , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[31]  Mathieu Lagrange,et al.  Perceptual Evaluation of a Real-time Synthesis Technique for Rolling Sounds , 2007 .

[32]  Dani Lischinski,et al.  Granular Synthesis of Sound Textures Using Statistical Learning , 1999, ICMC.

[33]  Mathieu Lagrange,et al.  Objective quality measurement of the excitation of impact sounds in a source/filter model , 2008 .

[34]  Dinesh K. Pai,et al.  Manipulation and Resynthesis with Natural Grains , 2001, ICMC.

[35]  Chen Shen,et al.  Synthesizing sounds from rigid-body simulations , 2002, SCA '02.

[36]  Diemo Schwarz,et al.  Corpus-Based Transcription as an Approach to the Compositional Control of Timbre , 2009, ICMC.

[37]  Diemo Schwarz,et al.  REAL-TIME CORPUS-BASED CONCATENATIVE SYNTHESIS WITH CATART , 2006 .

[38]  Charles Bascou,et al.  GMU , A FLEXIBLE GRANULAR SYNTHESIS ENVIRONMENT IN MAX / MSP , 2005 .