From Probabilistic Graphical Models to Generalized Tensor Networks for Supervised Learning

Tensor networks have found a wide use in a variety of applications in physics and computer science, recently leading to both theoretical insights as well as practical algorithms in machine learning. In this work we explore the connection between tensor networks and probabilistic graphical models, and show that it motivates the definition of generalized tensor networks where information from a tensor can be copied and reused in other parts of the network. We discuss the relationship between generalized tensor network architectures used in quantum physics, such as string-bond states, and architectures commonly used in machine learning. We provide an algorithm to train these networks in a supervised-learning context and show that they overcome the limitations of regular tensor networks in higher dimensions, while keeping the computation efficient. A method to combine neural networks and tensor networks as part of a common deep learning architecture is also introduced. We benchmark our algorithm for several generalized tensor network architectures on the task of classifying images and sounds, and show that they outperform previously introduced tensor-network algorithms. The models we consider also have a natural implementation on a quantum computer and may guide the development of near-term quantum machine learning architectures.

[1]  Geoffrey E. Hinton,et al.  A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[2]  Paul Smolensky,et al.  Information processing in dynamical systems: foundations of harmony theory , 1986 .

[3]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[4]  A. Gendiar,et al.  Latent heat calculation of the three-dimensional q=3, 4, and 5 Potts models by the tensor product variational approach. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[6]  Brendan J. Frey,et al.  Extending Factor Graphs so as to Unify Directed and Undirected Graphical Models , 2002, UAI.

[7]  Philipp Slusallek,et al.  Introduction to real-time ray tracing , 2005, SIGGRAPH Courses.

[8]  F. Verstraete,et al.  Matrix product states, projected entangled pair states, and variational renormalization group methods for quantum spin systems , 2008, 0907.2796.

[9]  Yoshua Bengio,et al.  Classification using discriminative restricted Boltzmann machines , 2008, ICML '08.

[10]  Norbert Schuch,et al.  Simulation of quantum many-body systems with strings of operators and Monte Carlo tensor contractions. , 2008, Physical review letters.

[11]  J. Ignacio Cirac,et al.  Ground-state properties of quantum many-body systems: entangled-plaquette states and variational Monte Carlo , 2009, 0905.3898.

[12]  Garnet Kin-Lic Chan,et al.  Approximating strongly correlated wave functions with correlator product states , 2009, 0907.4646.

[13]  Nir Friedman,et al.  Probabilistic Graphical Models: Principles and Techniques - Adaptive Computation and Machine Learning , 2009 .

[14]  Jacob D. Biamonte,et al.  Categorical Tensor Network States , 2010, ArXiv.

[15]  J. Cirac,et al.  Simulating two- and three-dimensional frustrated quantum systems with string-bond states , 2009, 0908.4036.

[16]  Ivan Oseledets,et al.  Tensor-Train Decomposition , 2011, SIAM J. Sci. Comput..

[17]  Razvan Pascanu,et al.  Learning Algorithms for the Classification Restricted Boltzmann Machine , 2012, J. Mach. Learn. Res..

[18]  Roman Orus,et al.  A Practical Introduction to Tensor Networks: Matrix Product States and Projected Entangled Pair States , 2013, 1306.2164.

[19]  Justin Salamon,et al.  A Dataset and Taxonomy for Urban Sound Research , 2014, ACM Multimedia.

[20]  J Eisert,et al.  Matrix-product operators and states: NP-hardness and undecidability. , 2014, Physical review letters.

[21]  Andrew Critch,et al.  Algebraic Geometry of Matrix Product States , 2012, 1210.2812.

[22]  Anima Anandkumar,et al.  Tensor decompositions for learning latent variable models , 2012, J. Mach. Learn. Res..

[23]  E. Farhi,et al.  A Quantum Approximate Optimization Algorithm , 2014, 1411.4028.

[24]  Karol J. Piczak Environmental sound classification with convolutional neural networks , 2015, 2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP).

[25]  Nadav Cohen,et al.  On the Expressive Power of Deep Learning: A Tensor Analysis , 2015, COLT 2016.

[26]  Alexander Novikov,et al.  Tensorizing Neural Networks , 2015, NIPS.

[27]  Andrzej Cichocki,et al.  Tensor Networks for Dimensionality Reduction and Large-scale Optimization: Part 1 Low-Rank Tensor Decompositions , 2016, Found. Trends Mach. Learn..

[28]  Naonori Ueda,et al.  Polynomial Networks and Factorization Machines: New Insights and Efficient Training Algorithms , 2016, ICML.

[29]  Amnon Shashua,et al.  Convolutional Rectifier Networks as Generalized Tensor Decompositions , 2016, ICML.

[30]  David J. Schwab,et al.  Supervised Learning with Tensor Networks , 2016, NIPS.

[31]  Alán Aspuru-Guzik,et al.  The theory of variational hybrid quantum-classical algorithms , 2015, 1509.04279.

[32]  Yiannis Vlassopoulos,et al.  Language as a matrix product state , 2017, ArXiv.

[33]  Masashi Sugiyama,et al.  Tensor Networks for Dimensionality Reduction and Large-scale Optimization: Part 2 Applications and Future Perspectives , 2017, Found. Trends Mach. Learn..

[34]  Ronen Tamari,et al.  Analysis and Design of Convolutional Networks via Hierarchical Tensor Decompositions , 2017, ArXiv.

[35]  Alexander Novikov,et al.  Exponential Machines , 2017, ICLR.

[36]  D. Deng,et al.  Quantum Entanglement in Neural Network States , 2017, 1701.04844.

[37]  Yiannis Vlassopoulos,et al.  Tensor network language model , 2017, ArXiv.

[38]  Yisong Yue,et al.  Long-term Forecasting using Higher Order Tensor RNNs , 2017 .

[39]  Amnon Shashua,et al.  Inductive Bias of Deep Convolutional Networks through Pooling Geometry , 2016, ICLR.

[40]  S. R. Clark,et al.  Unifying neural-network quantum states and correlator product states via tensor networks , 2017, 1710.03545.

[41]  Roland Vollgraf,et al.  Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms , 2017, ArXiv.

[42]  Hans-J. Briegel,et al.  Machine learning \& artificial intelligence in the quantum domain , 2017, ArXiv.

[43]  J. Chen,et al.  Equivalence of restricted Boltzmann machines and tensor network states , 2017, 1701.04831.

[44]  Keisuke Fujii,et al.  Quantum circuit learning , 2018, Physical Review A.

[45]  Simone Severini,et al.  Compact Neural Networks based on the Multiscale Entanglement Renormalization Ansatz , 2017, BMVC.

[46]  Hartmut Neven,et al.  Classification with Quantum Neural Networks on Near Term Processors , 2018, 1802.06002.

[47]  Jun Wang,et al.  Unsupervised Generative Modeling Using Matrix Product States , 2017, Physical Review X.

[48]  Simone Severini,et al.  Hierarchical quantum classifiers , 2018, npj Quantum Information.

[49]  Lei Wang,et al.  Differentiable Learning of Quantum Circuit Born Machine , 2018, Physical Review A.

[50]  Jing Chen,et al.  Information Perspective to Probabilistic Modeling: Boltzmann Machines versus Born Machines , 2017, Entropy.

[51]  Elina Robeva,et al.  Duality of Graphical Models and Tensor Networks , 2017, Information and Inference: A Journal of the IMA.

[52]  Amnon Shashua,et al.  Deep Learning and Quantum Entanglement: Fundamental Connections with Implications to Network Design , 2017, ICLR.

[53]  Xiao Zhang,et al.  Entanglement-Based Feature Extraction by Tensor Network Machine Learning , 2018, Frontiers in Applied Mathematics and Statistics.

[54]  J. Cirac,et al.  Neural-Network Quantum States, String-Bond States, and Chiral Topological States , 2017, 1710.04045.

[55]  Jens Eisert,et al.  Expressive power of tensor-network factorizations for probabilistic modeling, with applications from hidden Markov models to quantum machine learning , 2019, NeurIPS.

[56]  K. Birgitta Whaley,et al.  Towards quantum machine learning with tensor networks , 2018, Quantum Science and Technology.

[57]  Alejandro Perdomo-Ortiz,et al.  A generative modeling approach for benchmarking and training shallow quantum circuits , 2018, npj Quantum Information.

[58]  Maria Schuld,et al.  Quantum Machine Learning in Feature Hilbert Spaces. , 2018, Physical review letters.

[59]  Masashi Sugiyama,et al.  Learning Efficient Tensor Representations with Ring-structured Networks , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[60]  Gang Su,et al.  Machine learning by unitary tensor network of hierarchical tree structure , 2017, New Journal of Physics.

[61]  Amnon Shashua,et al.  Quantum Entanglement in Deep Learning Architectures. , 2018, Physical review letters.

[62]  Anima Anandkumar,et al.  Tensor Regression Networks , 2017, J. Mach. Learn. Res..

[63]  Ievgeniia Oshurko Quantum Machine Learning , 2020, Quantum Computing.