Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification

Convolutional neural networks (CNNs) excel in a wide variety of computer vision applications, but their high performance also comes at a high computational cost. Despite efforts to increase efficiency both algorithmically and with specialized hardware, it remains difficult to deploy CNNs in embedded systems due to tight power budgets. Here we explore a complementary strategy that incorporates a layer of optical computing prior to electronic computing, improving performance on image classification tasks while adding minimal electronic computational cost or processing time. We propose a design for an optical convolutional layer based on an optimized diffractive optical element and test our design in two simulations: a learned optical correlator and an optoelectronic two-layer CNN. We demonstrate in simulation and with an optical prototype that the classification accuracies of our optical systems rival those of the analogous electronic implementations, while providing substantial savings on computational cost.

[1]  J. Goodman Introduction to Fourier optics , 1969 .

[2]  Steve Serati,et al.  Optical correlator based target detection, recognition, classification, and tracking. , 2012, Applied optics.

[3]  Matthew O'Toole,et al.  Optical computing for fast light transport analysis , 2010, ACM Trans. Graph..

[4]  Federico Capasso,et al.  A broadband achromatic metalens for focusing and imaging in the visible , 2018, Nature Nanotechnology.

[5]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[6]  Lin Zhong,et al.  RedEye: Analog ConvNet Image Sensor Architecture for Continuous Mobile Vision , 2016, 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA).

[7]  Gordon Wetzstein,et al.  Photonic Multitasking Interleaved Si Nanoantenna Phased Array. , 2016, Nano letters.

[8]  Lin Yang,et al.  On-chip optical matrix-vector multiplier for parallel computation , 2013 .

[9]  P. Hanrahan,et al.  Light Field Photography with a Hand-held Plenoptic Camera , 2005 .

[10]  Michael R. Watts,et al.  Large-scale nanophotonic phased array , 2013, Nature.

[11]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[12]  Chein-I. Chang Hyperspectral Data Exploitation: Theory and Applications , 2007 .

[13]  Geoffrey L. Barrows,et al.  Wide-angle micro sensors for vision on a tight budget , 2011, CVPR 2011.

[14]  Ashok Veeraraghavan,et al.  ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks Using Angle Sensitive Pixels , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[16]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Laurent Larger,et al.  Reinforcement Learning in a large scale photonic Recurrent Neural Network , 2017, Optica.

[18]  Dirk Englund,et al.  Deep learning with coherent nanophotonic circuits , 2017, 2017 Fifth Berkeley Symposium on Energy Efficient Electronic Systems & Steep Transistors Workshop (E3S).

[19]  B Javidi,et al.  Optical implementation of neural networks for face recognition by the use of nonlinear joint transform correlators. , 1995, Applied optics.

[20]  D. Brady,et al.  Adaptive optical networks using photorefractive crystals. , 1988, Applied optics.

[21]  Yifan Peng,et al.  The diffractive achromat full spectrum computational imaging with diffractive optics , 2016, ACM Trans. Graph..

[22]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[23]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[24]  M Segev,et al.  Modulation instability and pattern formation in spatially incoherent light beams. , 2000, Science.

[25]  Song Han,et al.  Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[26]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[27]  Gordon Wetzstein,et al.  Variable Aperture Light Field Photography: Overcoming the Diffraction-Limited Spatio-Angular Resolution Tradeoff , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  M. Segev,et al.  Photonic Floquet topological insulators , 2012, Nature.

[29]  D A Gregory,et al.  Real-time pattern recognition using a modified liquid crystal television in a coherent optical correlator. , 1986, Applied optics.

[30]  Lucien E. Weiss,et al.  Multicolour localization microscopy by point-spread-function engineering , 2016, Nature Photonics.

[31]  Shanhui Fan,et al.  Training of Photonic Neural Networks through In Situ Backpropagation , 2018, 2019 Conference on Lasers and Electro-Optics (CLEO).

[32]  David A. Patterson,et al.  In-datacenter performance analysis of a tensor processing unit , 2017, 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA).

[33]  Chein-I Chang,et al.  Hyperspectral Data Exploitation , 2007 .

[34]  D Psaltis,et al.  Optical implementation of the Hopfield model. , 1985, Applied optics.

[35]  Pavan Chandra Konda,et al.  Learned sensing: jointly optimized microscope hardware for accurate image classification. , 2019, Biomedical optics express.

[36]  Shree K. Nayar,et al.  Optical Splitting Trees for High-Precision Monocular Imaging , 2007, IEEE Computer Graphics and Applications.

[37]  Yi Luo,et al.  All-optical machine learning using diffractive deep neural networks , 2018, Science.

[38]  E. FieslerIDIAP,et al.  Adaptive Multilayer Optical Neural Network with Optical Thresholding , 1995 .

[39]  F T Yu,et al.  Two-dimensional programmable optical neural network. , 1989, Applied optics.

[40]  Ralph Etienne-Cummings,et al.  Implementation of steerable spatiotemporal image filters on the focal plane , 2002 .

[41]  Yi Luo,et al.  Design of task-specific optical systems using broadband diffractive neural networks , 2019, Light, science & applications.

[42]  Frédo Durand,et al.  Image and depth from a conventional camera with a coded aperture , 2007, ACM Trans. Graph..

[43]  Anschrift Dr. Cornelia Denz Optical Neural Networks , 1998, Vieweg+Teubner Verlag.