Quantum probability-inspired graph neural network for document representation and classification

Abstract Recent studies have found that text can be represented in Hilbert space through a neural network driven by quantum probability, which provides a unified representation of texts with different granularities without losing the performance of downstream tasks. However, these quantum probability-inspired methods only focus on intra-document semantics and lack modeling global structural information. In this paper, we explore the potential of combining quantum probability with graph neural network, and propose a quantum probability-inspired graph neural network model to capture global structural information of interaction between documents for document representation and classification. We build document interaction graph for a given corpus based on document word relation and frequency information, then learn a graph neural network driven by quantum probability on the defined graph. First, the proposed model represents each document node in the graph as a superposition state in a Hilbert space. Then the proposed model further computes density matrix representations for nodes to encode document interaction as mixed states. Finally, the model computes classification probability by performing quantum measurement on the mixed states. Experiments on four document classification benchmarks show that the proposed model outperforms a variety of classical neural network models and the previous quantum probability-inspired model with much smaller parameter size. Extended analyses also demonstrate the robustness of the proposed model with limited training data and its ability to learn semantically distinguishable document representation.

[1]  Massimo Melucci,et al.  Semantic Hilbert Space for Text Representation Learning , 2019, WWW.

[2]  Guoyin Wang,et al.  Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms , 2018, ACL.

[3]  Joan Bruna,et al.  Deep Convolutional Networks on Graph-Structured Data , 2015, ArXiv.

[4]  Qiaozhu Mei,et al.  PTE: Predictive Text Embedding through Large-scale Heterogeneous Text Networks , 2015, KDD.

[5]  Diederik Aerts,et al.  Concepts and Their Dynamics: A Quantum-Theoretic Modeling of Human Thought , 2012, Top. Cogn. Sci..

[6]  Pengfei Liu,et al.  Heterogeneous Graph Neural Networks for Extractive Document Summarization , 2020, ACL.

[7]  Yidong Chen,et al.  Lattice-to-sequence attentional Neural Machine Translation models , 2018, Neurocomputing.

[8]  Yoshua Bengio,et al.  Learning Concept Embeddings for Query Expansion by Quantum Entropy Minimization , 2014, AAAI.

[9]  Yoshua Bengio,et al.  Modeling term dependencies with quantum language models for IR , 2013, SIGIR.

[10]  Dawei Song,et al.  Exploration of Quantum Interference in Document Relevance Judgement Discrepancy , 2016, Entropy.

[11]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[12]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[13]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[14]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[15]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[16]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[17]  Guido Zuccon,et al.  Using the Quantum Probability Ranking Principle to Rank Interdependent Documents , 2010, ECIR.

[18]  Yidong Chen,et al.  Sentiment analysis via integrating distributed representations of variable-length word sequence , 2016, Neurocomputing.

[19]  J. Busemeyer,et al.  Quantum cognition: a new theoretical approach to psychology , 2015, Trends in Cognitive Sciences.

[20]  Dawei Song,et al.  End-to-End Quantum-like Language Models with Application to Question Answering , 2018, AAAI.

[21]  A. Gleason Measures on the Closed Subspaces of a Hilbert Space , 1957 .

[22]  Wanxiang Che,et al.  Document Modeling with Graph Attention Networks for Multi-grained Machine Reading Comprehension , 2020, ACL.

[23]  Changpeng Shao Quantum speedup of training radial basis function networks , 2019, Quantum Inf. Comput..

[24]  Jack Hidary,et al.  Quantum Graph Neural Networks , 2019, ArXiv.

[25]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[26]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[27]  Dawei Song,et al.  A Novel Re-ranking Approach Inspired by Quantum Measurement , 2011, ECIR.

[28]  Joan Bruna,et al.  Spectral Networks and Locally Connected Networks on Graphs , 2013, ICLR.

[29]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[30]  Rahul Goel,et al.  Online Embedding Compression for Text Classification using Low Rank Matrix Factorization , 2018, AAAI.

[31]  Diederik Aerts,et al.  New fundamental evidence of non-classical structure in the combination of natural concepts , 2015, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[32]  Razvan Pascanu,et al.  Relational inductive biases, deep learning, and graph networks , 2018, ArXiv.

[33]  Yuexian Hou,et al.  Exploring Relevance Judgement Inspired by Quantum Weak Measurement , 2018, AAAI.

[34]  YaoChong Li,et al.  A quantum deep convolutional neural network for image recognition , 2020, Quantum Science and Technology.

[35]  C. J. van Rijsbergen,et al.  What can quantum theory bring to information retrieval , 2010, CIKM.

[36]  Yuan Luo,et al.  Graph Convolutional Networks for Text Classification , 2018, AAAI.

[37]  Fabio Tamburini,et al.  Towards Quantum Language Models , 2017, EMNLP.

[38]  Jianxin Li,et al.  Large-Scale Hierarchical Text Classification with Recursively Regularized Deep Graph-CNN , 2018, WWW.

[39]  Peng Zhang,et al.  A quantum-inspired multimodal sentiment analysis framework , 2018, Theor. Comput. Sci..

[40]  Guoyin Wang,et al.  Joint Embedding of Words and Labels for Text Classification , 2018, ACL.

[41]  Dongdong Chen,et al.  Quantum-based subgraph convolutional neural networks , 2019, Pattern Recognit..

[42]  Diederik Aerts,et al.  Quantum-Theoretic Modeling in Computer Science , 2019, International Journal of Theoretical Physics.

[43]  Elham Kashefi,et al.  A Quantum-Theoretic Approach to Distributional Semantics , 2013, NAACL.

[44]  Dawei Song,et al.  Quantum-Inspired Interactive Networks for Conversational Sentiment Analysis , 2019, IJCAI.

[45]  J. Neumann Mathematical Foundations of Quantum Mechanics , 1955 .

[46]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[47]  Massimo Melucci,et al.  CNM: An Interpretable Complex-valued Network for Matching , 2019, NAACL.

[48]  Dawei Song,et al.  Modeling Multi-query Retrieval Tasks Using Density Matrix Transformation , 2015, SIGIR.

[49]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.