Classical-To-Quantum Transfer Learning for Spoken Command Recognition Based on Quantum Neural Networks

This work investigates an extension of transfer learning applied in machine learning algorithms to the emerging hybrid end-to-end quantum neural network (QNN) for spoken command recognition (SCR). Our QNN-based SCR system is composed of classical and quantum components: (1) the classical part mainly relies on a 1D convolutional neural network (CNN) to extract speech features; (2) the quantum part is built upon the variational quantum circuit with a few learnable parameters. Since it is inefficient to train the hybrid end-to-end QNN from scratch on a noisy intermediate-scale quantum (NISQ) device, we put forth a hybrid transfer learning algorithm that allows a pre-trained classical network to be transferred to the classical part of the hybrid QNN model. The pre-trained classical network is further modified and augmented through jointly fine-tuning with a variational quantum circuit (VQC). The hybrid transfer learning methodology is particularly attractive for the task of QNN-based SCR because low-dimensional classical features are expected to be encoded into quantum states. We assess the hybrid transfer learning algorithm applied to the hybrid classical-quantum QNN for SCR on the Google speech command dataset, and our classical simulation results suggest that the hybrid transfer learning can boost our baseline performance on the SCR task.

[1]  Chao-Han Huck Yang,et al.  QTN-VQC: An End-to-End Learning framework for Quantum Neural Networks , 2021, ArXiv.

[2]  Brian McMahan,et al.  Listening to the World Improves Speech Command Recognition , 2017, AAAI.

[3]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[4]  Thomas N. Theis,et al.  The End of Moore's Law: A New Beginning for Information Technology , 2017, Computing in Science & Engineering.

[5]  Dae-Shik Kim,et al.  End-to-End Speech Command Recognition with Capsule Network , 2018, INTERSPEECH.

[6]  Ievgeniia Oshurko Quantum Machine Learning , 2020, Quantum Computing.

[7]  Chao-Han Huck Yang,et al.  Variational Quantum Circuits for Deep Reinforcement Learning , 2019, IEEE Access.

[8]  Chao Yang,et al.  A Survey on Deep Transfer Learning , 2018, ICANN.

[9]  Ryan Babbush,et al.  Barren plateaus in quantum neural network training landscapes , 2018, Nature Communications.

[10]  Chin-Hui Lee,et al.  Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network-Based Vector-to-Vector Regression , 2020, IEEE Transactions on Signal Processing.

[11]  Travis S. Humble,et al.  Quantum supremacy using a programmable superconducting processor , 2019, Nature.

[12]  Douglas Coimbra de Andrade,et al.  A neural attention model for speech command recognition , 2018, ArXiv.

[13]  Jun Du,et al.  A Theory on Deep Neural Network Based Vector-to-Vector Regression With an Illustration of Its Expressive Power in Speech Enhancement , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[14]  Chin-Hui Lee,et al.  Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15]  Keisuke Fujii,et al.  Quantum circuit learning , 2018, Physical Review A.

[16]  Yves Scherrer,et al.  Deep Linguistic Multilingual Translation and Bilingual Dictionaries , 2009, WMT@EACL.

[17]  John Preskill,et al.  Quantum Computing in the NISQ era and beyond , 2018, Quantum.

[18]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[19]  Aram W. Harrow,et al.  Quantum computational supremacy , 2017, Nature.

[20]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[21]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22]  Soonwon Choi,et al.  Quantum convolutional neural networks , 2018, Nature Physics.

[23]  Li Shen,et al.  A Sufficient Condition for Convergences of Adam and RMSProp , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[25]  Nathan Killoran,et al.  PennyLane: Automatic differentiation of hybrid quantum-classical computations , 2018, ArXiv.

[26]  Pete Warden,et al.  Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition , 2018, ArXiv.

[27]  Chin-Hui Lee,et al.  Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition , 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).