Wave physics as an analog recurrent neural network

Analog machine learning computations are performed passively by propagating light and sound waves through programmed materials. Analog machine learning hardware platforms promise to be faster and more energy efficient than their digital counterparts. Wave physics, as found in acoustics and optics, is a natural candidate for building analog processors for time-varying signals. Here, we identify a mapping between the dynamics of wave physics and the computation in recurrent neural networks. This mapping indicates that physical wave systems can be trained to learn complex features in temporal data, using standard training techniques for neural networks. As a demonstration, we show that an inverse-designed inhomogeneous medium can perform vowel classification on raw audio signals as their waveforms scatter and propagate through it, achieving performance comparable to a standard digital implementation of a recurrent neural network. These findings pave the way for a new class of analog machine learning platforms, capable of fast and efficient processing of information in its native domain.

[1]  Joni Dambre,et al.  Trainable hardware for dynamical computing using error backpropagation through physical media , 2014, Nature Communications.

[2]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[3]  Yi Luo,et al.  All-optical machine learning using diffractive deep neural networks , 2018, Science.

[4]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[5]  Andrea Alù,et al.  Nonlocal Metasurfaces for Optical Signal Processing. , 2018, Physical review letters.

[6]  Shanhui Fan,et al.  A Photonic Crystal Slab Laplace Differentiator , 2017, 2018 Conference on Lasers and Electro-Optics (CLEO).

[7]  Bhavin J. Shastri,et al.  Neuromorphic photonic networks using silicon photonic weight banks , 2016, Scientific Reports.

[8]  Andrea Alù,et al.  Performing Mathematical Operations with Metamaterials , 2014, Science.

[9]  J. Viefhaus,et al.  Attosecond time–energy structure of X-ray free-electron laser pulses , 2018 .

[10]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[11]  Steve Madden,et al.  Supercontinuum generation in dispersion engineered highly nonlinear (gamma = 10 /W/m) As2S3) chalcogenide planar waveguide. , 2008, Optics express.

[12]  Ellen Zhou,et al.  Neuromorphic photonic networks using silicon photonic weight banks , 2017, Scientific Reports.

[13]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[14]  Jelena Vucković,et al.  Inverse design in nanophotonics , 2018, Nature Photonics.

[15]  Les E. Atlas,et al.  Recurrent neural networks and robust time series prediction , 1994, IEEE Trans. Neural Networks.

[16]  G. Trahey,et al.  A heterogeneous nonlinear attenuating full- wave model of ultrasound , 2009, IEEE Transactions on Ultrasonics, Ferroelectrics and Frequency Control.

[17]  Damien Querlioz,et al.  Vowel recognition with four coupled spin-torque nano-oscillators , 2017, Nature.

[18]  Alan Richardson,et al.  Seismic Full-Waveform Inversion Using Deep Learning Tools and Techniques , 2018, 1801.07232.

[19]  Shanhui Fan,et al.  Training of Photonic Neural Networks through In Situ Backpropagation , 2018, 2019 Conference on Lasers and Electro-Optics (CLEO).

[20]  Peter Stagge,et al.  Recurrent neural networks for time series classification , 2003, Neurocomputing.

[21]  Shanhui Fan,et al.  Photonic crystal slab Laplace operator for image differentiation , 2018 .

[22]  J. Hillenbrand,et al.  Acoustic characteristics of American English vowels. , 1994, The Journal of the Acoustical Society of America.

[23]  Ole Sigmund,et al.  Experimental validation of systematically designed acoustic hyperbolic meta material slab exhibiting negative refraction , 2016 .

[24]  Ole Sigmund,et al.  Design of robust and efficient photonic switches using topology optimization , 2012 .

[25]  Steven G. Johnson,et al.  The failure of perfectly matched layers, and towards their redemption by adiabatic absorbers. , 2008, Optics express.

[26]  M. Withford,et al.  Low-dimensional nanomaterial saturable absorbers for ultrashort-pulsed waveguide lasers , 2018, Optical Materials Express.

[27]  Dirk Englund,et al.  Deep learning with coherent nanophotonic circuits , 2017, 2017 Fifth Berkeley Symposium on Energy Efficient Electronic Systems & Steep Transistors Workshop (E3S).

[28]  Yann LeCun,et al.  Tunable Efficient Unitary Neural Networks (EUNN) and their application to RNNs , 2016, ICML.

[29]  N. Engheta,et al.  Inverse-designed metastructures that solve equations , 2019, Science.

[30]  Michael I. Jordan Serial Order: A Parallel Distributed Processing Approach , 1997 .

[31]  Geoffrey Zweig,et al.  Recurrent neural networks for language understanding , 2013, INTERSPEECH.

[32]  Abdoulaye Ba,et al.  Soft porous silicone rubbers with ultra-low sound speeds in acoustic metamaterials , 2017, Scientific Reports.

[33]  Ian A. D. Williamson,et al.  Training of Photonic Neural Networks through In Situ Backpropagation , 2019 .

[34]  Loris,et al.  Numerical demonstration of neuromorphic computing with photonic crystal cavities , 2018 .

[35]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[36]  F. Ursell,et al.  The long-wave paradox in the theory of gravity waves , 1953, Mathematical Proceedings of the Cambridge Philosophical Society.

[37]  Georg Dorffner,et al.  Neural Networks for Time Series Processing , 1996 .

[38]  Lei Ying,et al.  Nanophotonic media for artificial neural inference , 2018, Photonics Research.

[39]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[40]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[41]  Jacob biamonte,et al.  Quantum machine learning , 2016, Nature.

[42]  Sae Woo Nam,et al.  Superconducting optoelectronic circuits for neuromorphic computing , 2016, ArXiv.

[43]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.