Quantum Tensor Networks, Stochastic Processes, and Weighted Automata

Modeling joint probability distributions over sequences has been studied from many perspectives. The physics community developed matrix product states, a tensor-train decomposition for probabilistic modeling, motivated by the need to tractably model many-body systems. But similar models have also been studied in the stochastic processes and weighted automata literature, with little work on how these bodies of work relate to each other. We address this gap by showing how stationary or uniform versions of popular quantum tensor network models have equivalent representations in the stochastic processes and weighted automata literature, in the limit of infinitely long sequences. We demonstrate several equivalence results between models used in these three communities: (i) uniform variants of matrix product states, Born machines and locally purified states from the quantum tensor networks literature, (ii) predictive state representations, hidden Markov models, norm-observable operator models and hidden quantum Markov models from the stochastic process literature,and (iii) stochastic weighted automata, probabilistic automata and quadratic automata from the formal languages literature. Such connections may open the door for results and methods developed in one area to be applied in another.

[1]  Herbert Jaeger,et al.  Norm-Observable Operator Models , 2010, Neural Computation.

[2]  Andrew Critch,et al.  Algebraic Geometry of Matrix Product States , 2012, 1210.2812.

[3]  J. Zittartz,et al.  Matrix Product Ground States for One-Dimensional Spin-1 Quantum Antiferromagnets , 1993, cond-mat/9307028.

[4]  Jun Wang,et al.  Unsupervised Generative Modeling Using Matrix Product States , 2017, Physical Review X.

[5]  Raphaël Bailly Quadratic Weighted Automata: Spectral Algorithm and Likelihood Maximization , 2011, ACML 2011.

[6]  Herbert Jaeger,et al.  Observable Operator Models for Discrete Stochastic Time Series , 2000, Neural Computation.

[7]  Yisong Yue,et al.  Long-term Forecasting using Higher Order Tensor RNNs , 2017 .

[8]  Garnet Kin-Lic Chan,et al.  Matrix product operators, matrix product states, and ab initio density matrix renormalization group algorithms. , 2016, The Journal of chemical physics.

[9]  W. Stinespring Positive functions on *-algebras , 1955 .

[10]  F. Verstraete,et al.  Sequential generation of entangled multiqubit states. , 2005, Physical review letters.

[11]  Andreas Winter,et al.  Quantum Learning of Classical Stochastic Processes: The Completely-Positive Realization Problem , 2014, TQC.

[12]  Anton Rodomanov,et al.  Putting MRFs on a Tensor Train , 2014, ICML.

[13]  Byron Boots,et al.  Learning and Inference in Hilbert Space with Quantum Graphical Models , 2018, NeurIPS.

[14]  Herbert Jaeger,et al.  Links between multiplicity automata, observable operator models and predictive state representations: a unified learning framework , 2015, J. Mach. Learn. Res..

[15]  Shun-ichi Amari,et al.  Identifiability of hidden Markov information sources and their minimum degrees of freedom , 1992, IEEE Trans. Inf. Theory.

[16]  Alexander Novikov,et al.  Ultimate tensorization: compressing convolutional and FC layers alike , 2016, ArXiv.

[17]  Claus Scheiderer,et al.  Spectrahedral Shadows , 2016, SIAM J. Appl. Algebra Geom..

[18]  Mingsheng Ying,et al.  Reachability Analysis of Quantum Markov Decision Processes , 2014, Inf. Comput..

[19]  François Denis,et al.  On Rational Stochastic Languages , 2006, Fundam. Informaticae.

[20]  Jens Eisert,et al.  Expressive power of tensor-network factorizations for probabilistic modeling, with applications from hidden Markov models to quantum machine learning , 2019, NeurIPS.

[21]  Roman Orus,et al.  A Practical Introduction to Tensor Networks: Matrix Product States and Projected Entangled Pair States , 2013, 1306.2164.

[22]  Frank Verstraete,et al.  Tangent-space methods for uniform matrix product states , 2018, SciPost Physics Lecture Notes.

[23]  Joelle Pineau,et al.  Point-based value iteration: An anytime algorithm for POMDPs , 2003, IJCAI.

[24]  James Stokes,et al.  Probabilistic Modeling with Matrix Product States , 2019, Entropy.

[25]  J. Eisert,et al.  Area laws for the entanglement entropy - a review , 2008, 0808.3773.

[26]  Jennifer L. Barry,et al.  Quantum partially observable Markov decision processes , 2014 .

[27]  K. Birgitta Whaley,et al.  Towards quantum machine learning with tensor networks , 2018, Quantum Science and Technology.

[28]  Michael R. James,et al.  Predictive State Representations: A New Theory for Modeling Dynamical Systems , 2004, UAI.

[29]  J. Ignacio Cirac,et al.  Infinite matrix product states, conformal field theory, and the Haldane-Shastry model , 2009, 0911.3029.

[30]  Alex Bateman,et al.  An introduction to hidden Markov models. , 2007, Current protocols in bioinformatics.

[31]  J. William Helton,et al.  Sufficient and Necessary Conditions for Semidefinite Representability of Convex Hulls and Sets , 2007, SIAM J. Optim..

[32]  Liqing Zhang,et al.  Tensor Ring Decomposition , 2016, ArXiv.

[33]  Ariadna Quattoni,et al.  Spectral learning of weighted automata , 2014, Machine Learning.

[34]  J Eisert,et al.  Matrix-product operators and states: NP-hardness and undecidability. , 2014, Physical review letters.

[35]  Byron Boots,et al.  Learning Hidden Quantum Markov Models , 2017, AISTATS.

[36]  Chu Guo,et al.  Matrix product operators for sequence-to-sequence learning , 2018, Physical Review E.

[37]  David Perez-Garcia,et al.  Irreducible forms of matrix product states: Theory and applications , 2017, 1708.00029.

[38]  Pierre Dupont,et al.  Links between probabilistic automata and hidden Markov models: probability distributions, learning models and induction algorithms , 2005, Pattern Recognit..

[39]  Sham M. Kakade,et al.  A spectral algorithm for learning Hidden Markov Models , 2008, J. Comput. Syst. Sci..

[40]  David J. Schwab,et al.  Supervised Learning with Tensor Networks , 2016, NIPS.

[41]  K. Wiesner,et al.  Hidden Quantum Markov Models and non-adaptive read-out of many-body states , 2010, 1002.2337.

[42]  A. Heller On Stochastic Processes Derived From Markov Chains , 1965 .

[43]  Eric Wiewiora,et al.  Modeling probability distributions with predictive state representations , 2007 .

[44]  Joelle Pineau,et al.  Methods of Moments for Learning Stochastic Languages: Unified Presentation and Empirical Comparison , 2014, ICML.

[45]  Guillaume Rabusseau,et al.  Tensor Networks for Probabilistic Sequence Modeling , 2020, AISTATS.

[46]  Mathukumalli Vidyasagar,et al.  The complete realization problem for hidden Markov models: a survey and some new results , 2011, Math. Control. Signals Syst..

[47]  M. Fannes,et al.  Finitely correlated states on quantum spin chains , 1992 .

[48]  J. Eisert,et al.  Colloquium: Area laws for the entanglement entropy , 2010 .

[49]  White,et al.  Density matrix formulation for quantum renormalization groups. , 1992, Physical review letters.

[50]  Isaac L. Chuang,et al.  Quantum Computation and Quantum Information (10th Anniversary edition) , 2011 .

[51]  Luca Benvenuti,et al.  A tutorial on the positive realization problem , 2004, IEEE Transactions on Automatic Control.

[52]  Liva Ralaivola,et al.  Grammatical inference as a principal component analysis problem , 2009, ICML '09.

[53]  Frank Verstraete,et al.  Matrix product state representations , 2006, Quantum Inf. Comput..

[54]  K. Kraus General state changes in quantum theory , 1971 .

[55]  H. Jaeger Discrete-time, discrete-valued observable operator models: a tutorial , 2003 .

[56]  J Eisert,et al.  Quantum measurement occurrence is undecidable. , 2011, Physical review letters.

[57]  F. Verstraete,et al.  Matrix product operator representations , 2008, 0804.3976.

[58]  Geoffrey J. Gordon,et al.  Supervised Learning for Dynamical System Learning , 2015, NIPS.

[59]  J. Ignacio Cirac,et al.  Purifications of multipartite states: limitations and constructive methods , 2013, 1308.1914.

[60]  Byron Boots,et al.  Reduced-Rank Hidden Markov Models , 2009, AISTATS.

[61]  R. Høegh-Krohn,et al.  Spectral Properties of Positive Maps on C*‐Algebras , 1978 .

[62]  Byron Boots,et al.  Expressiveness and Learning of Hidden Quantum Markov Models , 2019, AISTATS.

[63]  Alexander Novikov,et al.  Tensorizing Neural Networks , 2015, NIPS.

[64]  Ivan Oseledets,et al.  Tensor-Train Decomposition , 2011, SIAM J. Sci. Comput..

[65]  François Denis,et al.  Learning Classes of Probabilistic Automata , 2004, COLT.