Parametrized Quantum Policies for Reinforcement Learning

With the advent of real-world quantum computing, the idea that parametrized quantum computations can be used as hypothesis families in a quantum-classical machine learning system is gaining increasing traction. Such hybrid systems have already shown the potential to tackle real-world tasks in supervised and generative learning, and recent works have established their provable advantages in special artificial tasks. Yet, in the case of reinforcement learning, which is arguably most challenging and where learning boosts would be extremely valuable, no proposal has been successful in solving even standard benchmarking tasks, nor in showing a theoretical learning advantage over classical algorithms. In this work, we achieve both. We propose a hybrid quantum-classical reinforcement learning model using very few qubits, which we show can be effectively trained to solve several standard benchmarking environments. Moreover, we demonstrate, and formally prove, the ability of parametrized quantum circuits to solve certain learning tasks that are intractable for classical models, including current state-of-art deep neural networks, under the widely-believed classical hardness of the discrete logarithm problem.

[1]  Sofiène Jerbi,et al.  Quantum agents in the Gym: a variational quantum algorithm for deep Q-learning , 2021, Quantum.

[2]  A. Todri,et al.  Enabling multi-programming mechanism for quantum computing in the NISQ era , 2021, Quantum.

[3]  J. Latorre,et al.  One qubit as a universal approximant , 2021, Physical Review A.

[4]  H. Neven,et al.  Machine learning of high dimensional data on a noisy quantum processor , 2021, npj Quantum Information.

[5]  Sukin Sim,et al.  Noisy intermediate-scale quantum (NISQ) algorithms , 2021, Reviews of Modern Physics.

[6]  Sotiris Kotsiantis,et al.  Explainable AI: A Review of Machine Learning Interpretability Methods , 2020, Entropy.

[7]  Xiaoting Wang,et al.  Quantum reinforcement learning in continuous action space , 2020, ArXiv.

[8]  Keisuke Fujii,et al.  Qulacs: a fast and versatile quantum circuit simulator for research purpose , 2020, Quantum.

[9]  H. Neven,et al.  Power of data in quantum machine learning , 2020, Nature Communications.

[10]  K. Temme,et al.  A rigorous and robust quantum speed-up in supervised machine learning , 2020, Nature Physics.

[11]  Owen Lockwood,et al.  Reinforcement Learning with Quantum Variational Circuit , 2020, AIIDE.

[12]  Lea M. Trenkwalder,et al.  Quantum Enhancements for Deep Reinforcement Learning in Large Spaces , 2020 .

[13]  Kohei Nakajima,et al.  Universal Approximation Property of Quantum Machine Learning Models in Quantum-Enhanced Feature Spaces. , 2020, Physical review letters.

[14]  Maria Schuld,et al.  Effect of data encoding on the expressive power of variational quantum-machine-learning models , 2020, Physical Review A.

[15]  Jens Eisert,et al.  On the Quantum versus Classical Learnability of Discrete Distributions , 2020, Quantum.

[16]  David Von Dollen,et al.  TensorFlow Quantum: A Software Framework for Quantum Machine Learning , 2020, ArXiv.

[17]  Jakub W. Pachocki,et al.  Dota 2 with Large Scale Deep Reinforcement Learning , 2019, ArXiv.

[18]  John C. Platt,et al.  Quantum supremacy using a programmable superconducting processor , 2019, Nature.

[19]  Jiming Liu,et al.  Reinforcement Learning in Healthcare: A Survey , 2019, ACM Comput. Surv..

[20]  Jos'e I. Latorre,et al.  Data re-uploading for a universal quantum classifier , 2019, Quantum.

[21]  Chao-Han Huck Yang,et al.  Variational Quantum Circuits for Deep Reinforcement Learning , 2019, IEEE Access.

[22]  Marcello Benedetti,et al.  Parameterized quantum circuits as machine learning models , 2019, Quantum Science and Technology.

[23]  Travis S. Humble,et al.  Establishing the quantum supremacy frontier with a 281 Pflop/s simulation , 2019, Quantum Science and Technology.

[24]  D Zhu,et al.  Training of quantum circuits on a hybrid quantum computer , 2018, Science Advances.

[25]  C. Gogolin,et al.  Evaluating analytic gradients on quantum hardware , 2018, Physical Review A.

[26]  Kristan Temme,et al.  Supervised learning with quantum-enhanced feature spaces , 2018, Nature.

[27]  Lei Wang,et al.  Differentiable Learning of Quantum Circuit Born Machine , 2018, Physical Review A.

[28]  M. Schuld,et al.  Circuit-centric quantum classifiers , 2018, Physical Review A.

[29]  Raia Hadsell,et al.  Learning to Navigate in Cities Without a Map , 2018, NeurIPS.

[30]  Maria Schuld,et al.  Quantum Machine Learning in Feature Hilbert Spaces. , 2018, Physical review letters.

[31]  Enrique Solano,et al.  Measurement-based adaptation protocol with quantum reinforcement learning , 2018, Quantum Reports.

[32]  Keisuke Fujii,et al.  Quantum circuit learning , 2018, Physical Review A.

[33]  Hartmut Neven,et al.  Classification with Quantum Neural Networks on Near Term Processors , 2018, 1802.06002.

[34]  John Preskill,et al.  Quantum Computing in the NISQ era and beyond , 2018, Quantum.

[35]  Blake R. Johnson,et al.  Unsupervised Machine Learning on a Hybrid Quantum Computer , 2017, 1712.05771.

[36]  Amir Hussain,et al.  Applications of Deep Learning and Reinforcement Learning to Biological Data , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[37]  Vedran Dunjko,et al.  Exponential improvements for quantum-accessible reinforcement learning , 2017, 1710.11160.

[38]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[39]  David Von Dollen,et al.  Quantum-Enhanced Reinforcement Learning for Finite-Episode Games with Discrete State Spaces , 2017, Front. Phys..

[40]  J. Gambetta,et al.  Hardware-efficient variational quantum eigensolver for small molecules and quantum magnets , 2017, Nature.

[41]  Anna Levit,et al.  Reinforcement learning using quantum Boltzmann machines , 2016, Quantum Inf. Comput..

[42]  J. S. Oberoi,et al.  Quantum Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[43]  Hans-J. Briegel,et al.  Quantum-enhanced machine learning , 2016, Physical review letters.

[44]  Pieter Abbeel,et al.  Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.

[45]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[46]  Ronald de Wolf,et al.  Quantum Computing: Lecture Notes , 2015, ArXiv.

[47]  Vedran Dunjko,et al.  Quantum speedup for active learning agents , 2014, 1401.4997.

[48]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[49]  Alán Aspuru-Guzik,et al.  A variational eigenvalue solver on a photonic quantum processor , 2013, Nature Communications.

[50]  Olivier Buffet,et al.  Policy‐Gradient Algorithms , 2013 .

[51]  M. W. Johnson,et al.  Quantum annealing with manufactured spins , 2011, Nature.

[52]  Thierry Paul,et al.  Quantum computation and quantum information , 2007, Mathematical Structures in Computer Science.

[53]  Peter L. Bartlett,et al.  Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning , 2001, J. Mach. Learn. Res..

[54]  Lov K. Grover A fast quantum mechanical algorithm for database search , 1996, STOC '96.

[55]  Peter W. Shor,et al.  Polynomial-Time Algorithms for Prime Factorization and Discrete Logarithms on a Quantum Computer , 1995, SIAM Rev..

[56]  Ronald J. Williams,et al.  Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[57]  Manuel Blum,et al.  How to generate cryptographically strong sequences of pseudo random bits , 1982, 23rd Annual Symposium on Foundations of Computer Science (sfcs 1982).

[58]  Jingling Li,et al.  Final Report: Expressive Power of Parametrized Quantum Circuits , 2019 .

[59]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.