Backpropagation through nonlinear units for all-optical training of neural networks

Backpropagation through nonlinear neurons is an outstanding challenge to the field of optical neural networks and the major conceptual barrier to all-optical training schemes. Each neuron is required to exhibit a directionally dependent response to propagating optical signals, with the backwards response conditioned on the forward signal, which is highly non-trivial to implement optically. We propose a practical and surprisingly simple solution that uses saturable absorption to provide the network nonlinearity. We find that the backward propagating gradients required to train the network can be approximated in a pump-probe scheme that requires only passive optical elements. Simulations show that, with readily obtainable optical depths, our approach can achieve equivalent performance to state-of-the-art computational networks on image classification benchmarks, even in deep networks with multiple sequential gradient approximations. This scheme is compatible with leading optical neural network proposals and therefore provides a feasible path towards end-to-end optical training.

[1]  Weiwen Zou,et al.  High-accuracy optical convolution unit architecture for convolutional neural networks by cascaded acousto-optical modulator arrays. , 2019, Optics express.

[2]  Zhenhua Ni,et al.  Atomic‐Layer Graphene as a Saturable Absorber for Ultrafast Pulsed Lasers , 2009, 0910.5820.

[3]  Yue Jiang,et al.  All-optical neural network with nonlinear activation functions , 2019, Optica.

[4]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..

[5]  Marco Cococcioni,et al.  Photonic Neural Networks: A Survey , 2019, IEEE Access.

[6]  Steven R. Skinner,et al.  Reinforcement and backpropagation training for an optical neural network using self-lensing effects , 2000, IEEE Trans. Neural Networks Learn. Syst..

[7]  Vladimir Ceperic,et al.  On-Chip Optical Convolutional Neural Networks , 2018, ArXiv.

[8]  Samuel S. Schoenholz,et al.  Neural Message Passing for Quantum Chemistry , 2017, ICML.

[9]  Shanhui Fan,et al.  Training of Photonic Neural Networks through In Situ Backpropagation , 2018, 2019 Conference on Lasers and Electro-Optics (CLEO).

[10]  K Wagner,et al.  Multilayer optical learning networks. , 1987, Applied optics.

[11]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[12]  Erik Cambria,et al.  Jumping NLP Curves: A Review of Natural Language Processing Research [Review Article] , 2014, IEEE Computational Intelligence Magazine.

[13]  D. Conkey,et al.  Atomic spectroscopy on a chip , 2007 .

[14]  Francis T. S. Yu,et al.  Overview of hybrid optical neural networks , 1996 .

[15]  Demetri Psaltis,et al.  Optical Neural Computers , 1987, Topical Meeting on Optical Computing.

[16]  D. Brady,et al.  Adaptive optical networks using photorefractive crystals. , 1988, Applied optics.

[17]  W. Pernice,et al.  Atomic vapor spectroscopy in integrated photonic structures , 2015, 1505.00611.

[18]  James C. Wyant,et al.  Two-Dimensional Matrix Multiplication using Coherent Optical Techniques , 1979 .

[19]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[20]  Laurent Larger,et al.  Reinforcement Learning in a large scale photonic Recurrent Neural Network , 2017, Optica.

[21]  Yi Luo,et al.  All-optical machine learning using diffractive deep neural networks , 2018, Science.

[22]  Matthias Troyer,et al.  Neural-network quantum state tomography , 2018 .

[23]  H. Tsang,et al.  In-Plane Optical Absorption and Free Carrier Absorption in Graphene-on-Silicon Waveguides , 2014, IEEE Journal of Selected Topics in Quantum Electronics.

[24]  Alex Lamb,et al.  Deep Learning for Classical Japanese Literature , 2018, ArXiv.