Latent force models for sound: Learning modal synthesis parameters and excitation functions from audio recordings

Latent force models are a Bayesian learning technique that combine physical knowledge with dimensionality reduction — sets of coupled differential equations are modelled via shared dependence on a low-dimensional latent space. Analogously, modal sound synthesis is a technique that links physical knowledge about the vibration of objects to acoustic phenomena that can be observed in data. We apply latent force modelling to sinusoidal models of audio recordings, simultaneously inferring modal synthesis parameters (stiffness and damping) and the excitation or contact force required to reproduce the behaviour of the observed vibrational modes. Exposing this latent excitation function to the user constitutes a controllable synthesis method that runs in real time and enables sound morphing through interpolation of learnt parameters.

[1]  Jean Marie Adrien,et al.  Dynamic Modeling of Vibrating Structures for Sound Synthesis, Modal Synthesis , 1989 .

[2]  Perry R. Cook,et al.  Real Sound Synthesis for Interactive Applications , 2002 .

[3]  Smith,et al.  Physical audio signal processing : for virtual musical instruments and audio effects , 2010 .

[4]  Ming C. Lin,et al.  Example-guided physically based modal sound synthesis , 2013, ACM Trans. Graph..

[5]  Xavier Rodet,et al.  Musical Instrument Sound Morphing Guided by Perceptually Motivated Features , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Simo Särkkä,et al.  Sequential Inference for Latent Force Models , 2011, UAI.

[7]  Perry R. Cook,et al.  Physically Informed Sonic Modeling (PhISM): Synthesis of percussive sounds , 1997 .

[8]  Neil D. Lawrence,et al.  Latent Force Models , 2009, AISTATS.

[9]  Richard Kronland-Martinet,et al.  The simulation of piano string vibration: from physical models to finite difference schemes and digital waveguides. , 2003, The Journal of the Acoustical Society of America.

[10]  L. Trautmann,et al.  Digital sound synthesis based on transfer function models , 1999, Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452).

[11]  Jouni Hartikainen,et al.  Kalman filtering and smoothing solutions to temporal Gaussian process regression models , 2010, 2010 IEEE International Workshop on Machine Learning for Signal Processing.

[12]  Michael Klingbeil,et al.  Software for spectral Analysis, Editing, and synthesis , 2005, ICMC.

[13]  Simo Särkkä,et al.  State-Space Inference for Non-Linear Latent Force Models with Application to Satellite Orbit Prediction , 2012, ICML.

[14]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[15]  Jordi Bonada,et al.  Performance Control Driven Violin Timbre Model Based on Neural Networks , 2012, IEEE Transactions on Audio, Speech, and Language Processing.