Real-Time Modeling of Audio Distortion Circuits with Deep Learning

This paper studies deep neural networks for modeling of audio distortion circuits. The selected approach is blackbox modeling, which estimates model parameters based on the measured input and output signals of the device. Three common audio distortion pedals having a different circuit configuration and their own distinctive sonic character have been chosen for this study: the Ibanez Tube Screamer, the Boss DS-1, and the Electro-Harmonix Big Muff Pi. A feedforward deep neural network, which is a variant of the WaveNet architecture, is proposed for modeling these devices. The size of the receptive field of the neural network is selected based on the measured impulseresponse length of the circuits. A real-time implementation of the deep neural network is presented, and it is shown that the trained models can be run in real time on a modern desktop computer. Furthermore, it is shown that three minutes of audio is a sufficient amount of data for training the models. The deep neural network studied in this work is useful for real-time virtual analog modeling of nonlinear audio circuits.

[1]  T.H. Crystal,et al.  Linear prediction of speech , 1977, Proceedings of the IEEE.

[2]  Jakob Abeßer,et al.  Automatic Tablature Transcription of Electric Guitar Recordings by Estimation of Score- and Instrument-Related Parameters , 2014, DAFx.

[3]  Michael J. Kemp Analysis and Simulation of Non-Linear Audio Processes Using Finite Impulse Responses Derived at Multiple Impulse Amplitudes , 1999 .

[4]  Julius O. Smith,et al.  Resolving Wave Digital Filters with Multiple/Multiport Nonlinearities , 2015 .

[5]  Vesa Välimäki,et al.  Emulation of Operational Amplifiers and Diodes in Audio Distortion Circuits , 2012, IEEE Transactions on Circuits and Systems II: Express Briefs.

[6]  Udo Zölzer,et al.  BLACK-BOX MODELING OF DISTORTION CIRCUITS WITH BLOCK-ORIENTED MODELS Conference , 2016 .

[7]  Daniel P. W. Ellis,et al.  Modeling nonlinear circuits with linearized dynamical models via kernel regression , 2013, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[8]  Lauri Juvela,et al.  Deep Learning for Tube Amplifier Emulation , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Augusto Sarti,et al.  An Improved and Generalized Diode Clipper Model for Wave Digital Filters , 2015 .

[10]  Pierrick Lotton,et al.  Nonlinear System Identification Using Exponential Swept-Sine Signal , 2010, IEEE Transactions on Instrumentation and Measurement.

[11]  Z Udo DISCRETE-TIME MODELS FOR NONLINEAR AUDIO SYSTEMS , 1999 .

[12]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[13]  Udo Zölzer,et al.  Gray-Box Modeling of Guitar Amplifiers , 2018 .

[14]  Jonathan S. Abel,et al.  SIMULATION OF THE DIODE LIMITER IN GUITAR DISTORTION CIRCUI TS BY NUMERICAL SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS , 2007 .

[15]  Erich Elsen,et al.  Efficient Neural Audio Synthesis , 2018, ICML.

[16]  Heiga Zen,et al.  WaveNet: A Generative Model for Raw Audio , 2016, SSW.

[17]  Thomas Hélie,et al.  Volterra Series and State Transformation for Real-Time Simulations of Audio Circuits Including Saturations: Application to the Moog Ladder Filter , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  Jakob Abeßer,et al.  PARAMETRIC AUDIO CODING OF BASS GUITAR RECORDINGS USING A TUNED PHYSICAL MODELING ALGORITHM , 2013 .

[19]  Simone Orcioni,et al.  Identification of Volterra Models of Tube Audio Devices using Multiple-Variance Method , 2018, Journal of the Audio Engineering Society.

[20]  Julius O. Smith,et al.  Numerical Methods for Simulation of Guitar Distortion Circuits , 2008, Computer Music Journal.

[21]  Jonathan S. Abel,et al.  SIMPLIFIED, PHYSICALLY-INFORMED MODELS OF DISTORTION ANDOVERDRIVE GUITAR EFFECTS PEDALS , 2007 .

[22]  Angelo Farina,et al.  Simultaneous Measurement of Impulse Response and Distortion with a Swept-Sine Technique , 2000 .