Self-optimizing adaptive optics control with reinforcement learning

Current and future high-contrast imaging instruments require extreme Adaptive Optics (XAO) systems to reach contrasts necessary to directly image exoplanets. Telescope vibrations and the temporal error induced by the latency of the control loop limit the performance of these systems. Optimization of the (predictive) control algorithm is crucial in reducing these effects. We describe how model-free Reinforcement Learning can be used to optimize a Recurrent Neural Network controller for closed-loop adaptive optics control. We verify our proposed approach for tip-tilt control in simulations and a lab setup. The results show that this algorithm can effectively learn to suppress a combination of tip-tilt vibrations. Furthermore, we report decreased residuals for power-law input turbulence compared to an optimal gain integrator. Finally, we demonstrate that the controller can learn to identify the parameters of a varying vibration without requiring online updating of the control law. We conclude that Reinforcement Learning is a promising approach towards data-driven predictive control; future research will apply this approach to the control of high-order deformable mirrors.

[1]  Dmitry Savransky,et al.  Performance of the Gemini Planet Imager's adaptive optics system. , 2016, Applied optics.

[2]  R. Paschall,et al.  Linear quadratic Gaussian control of a deformable mirror adaptive optics system with time-delayed measurements. , 1993, Applied optics.

[3]  Jürgen Schmidhuber,et al.  Solving Deep Memory POMDPs with Recurrent Policy Gradients , 2007, ICANN.

[4]  Niek Doelman,et al.  Impact of time-variant turbulence behavior on prediction for adaptive optics systems. , 2019, Journal of the Optical Society of America. A, Optics, image science, and vision.

[5]  J. Milli,et al.  The wind-driven halo in high-contrast images I: analysis from the focal plane images of SPHERE , 2020, 2003.05794.

[6]  Andrew Serio,et al.  On-sky vibration environment for the Gemini Planet Imager and mitigation effort , 2014, Astronomical Telescopes and Instrumentation.

[7]  Eric Gendron,et al.  First on-sky SCAO validation of full LQG control with vibration mitigation on the CANARY pathfinder. , 2014, Optics express.

[8]  Sebastiaan Haffert,et al.  Nonlinear wavefront reconstruction with convolutional neural networks for Fourier-based wavefront sensors. , 2020, Optics express.

[9]  T. Fusco,et al.  SAXO, the eXtreme Adaptive Optics System of SPHERE: overview and calibration procedure , 2010, Astronomical Telescopes + Instrumentation.

[10]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[11]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[12]  P. Welch The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms , 1967 .

[13]  Suresh Sivanandam,et al.  Wavefront reconstruction and prediction with convolutional neural networks , 2018, Astronomical Telescopes + Instrumentation.

[14]  Thierry Fusco,et al.  First laboratory validation of vibration filtering with LQG control law for adaptive optics. , 2008, Optics express.

[15]  T. Fusco,et al.  SPHERE eXtreme AO control scheme: final performance assessment and on sky validation of the first auto-tuned LQG based operational system , 2014, Astronomical Telescopes and Instrumentation.

[17]  Marc Peter Deisenroth,et al.  Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.

[18]  Eric Gendron,et al.  Astronomical adaptive optics. II. Experimental results of an optimized modal control. , 1995 .

[19]  Eric Gendron,et al.  Astronomical adaptive optics. I. Modal control optimization. , 1994 .

[20]  John E. Krist,et al.  The Vector Vortex Coronagraph: sensitivity to central obscuration, low-order aberrations, chromaticism, and polarization , 2010, Astronomical Telescopes + Instrumentation.

[21]  James P. Lloyd,et al.  Tip-Tilt Error in Lyot Coronagraphs , 2005 .

[22]  G Rousset,et al.  Modal prediction for closed-loop adaptive optics. , 1997, Optics letters.

[23]  Marc Ferrari,et al.  Local ensemble transform Kalman filter, a fast non-stationary control law for adaptive optics on ELTs: theoretical aspects and first simulation results. , 2014, Optics express.

[24]  Jean-Pierre Véran,et al.  Advanced vibration suppression algorithms in adaptive optics systems. , 2012, Journal of the Optical Society of America. A, Optics, image science, and vision.

[25]  Dimitri Mawet,et al.  Demonstrating predictive wavefront control with the Keck II near-infrared pyramid wavefront sensor , 2019, Optical Engineering + Applications.

[26]  Frantz Martinache,et al.  Characterizing vibrations at the Subaru Telescope for the Subaru coronagraphic extreme adaptive optics instrument , 2018 .

[27]  David S. Doelman,et al.  High Contrast Imaging for Python (HCIPy): an open-source adaptive optics and coronagraph simulator , 2018, Astronomical Telescopes + Instrumentation.

[28]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[29]  Olivier Guyon,et al.  Adaptive Optics Predictive Control with Empirical Orthogonal Functions (EOFs) , 2017, 1707.00570.

[30]  He Sun,et al.  Identification and adaptive control of a high-contrast focal plane wavefront correction system , 2018 .

[31]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[32]  Christoph U. Keller,et al.  Optimization of contrast in adaptive optics for exoplanet imaging , 2018, Astronomical Telescopes + Instrumentation.

[33]  Alison P. Wong,et al.  An all-photonic focal-plane wavefront sensor , 2020, Nature Communications.

[34]  M. Kenworthy,et al.  Robustness of prediction for extreme adaptive optics systems under various observing conditions , 2020, 2003.10225.

[35]  David Silver,et al.  Memory-based control with recurrent neural networks , 2015, ArXiv.

[36]  Jean-Marc Conan,et al.  On the optimal reconstruction and control of adaptive optical systems with mirror dynamics. , 2010, Journal of the Optical Society of America. A, Optics, image science, and vision.

[37]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.