Learning Restricted Models of Arithmetic Circuits

We present a polynomial time algorithm for learning a large class of algebraic models of computation. We show that any arithmetic circuit whose partial derivatives in- duce a low-dimensional vector space is exactly learnable from membership and equivalence queries. As a consequence, we obtain polynomial-time algorithms for learning restricted algebraic branching programs as well as noncommutative set-multilinear arithmetic formu- lae. In addition, we observe that the algorithms of Bergadano et al. (1996) and Beimel et al. (2000) can be used to learn depth-3 set-multilinear arithmetic circuits. Previously only versions of depth-2 arithmetic circuits were known to be learnable in polynomial time. Our learning algorithms can be viewed as solving a generalization of the well known polyno- mial interpolation problemwhere the unknown polynomial has a succinct representation. We can learn representations of polynomials encoding exponentiallymany monomials. Our techniques combine a careful algebraic analysis of the partial derivatives of arithmetic cir- cuits with "multiplicity automata" learning algorithms due to Bergadano et al. (1997) and Beimel et al. (2000).

[1]  Noam Nisan,et al.  Lower bounds for non-commutative computation , 1991, STOC '91.

[2]  Nader H. Bshouty,et al.  On Learning width Two Branching Programs , 1998, Inf. Process. Lett..

[3]  電子情報通信学会 IEICE transactions on information and systems , 1992 .

[4]  Tadao Kasami,et al.  A Polynomial Time Learning Algorithm for Recognizable Series , 1994 .

[5]  Richard Zippel,et al.  Probabilistic algorithms for sparse polynomials , 1979, EUROSAM.

[6]  D. Angluin Queries and Concept Learning , 1988 .

[7]  Noam Nisan,et al.  Constant depth circuits, Fourier transform, and learnability , 1993, JACM.

[8]  Linda Sellie,et al.  Learning sparse multivariate polynomials over a field with queries and counterexamples , 1993, COLT '93.

[9]  Ran Raz,et al.  Deterministic polynomial identity testing in non-commutative models , 2004, Proceedings. 19th IEEE Annual Conference on Computational Complexity, 2004..

[10]  Ming-Deh A. Huang,et al.  Interpolation of sparse multivariate polynomials over large finite fields with applications , 1996, SODA '96.

[11]  Francesco Bergadano,et al.  Learning Behaviors of Automata from Multiplicity and Equivalence Queries , 1994, SIAM J. Comput..

[12]  Johan Håstad,et al.  Almost optimal lower bounds for small depth circuits , 1986, STOC '86.

[13]  Avi Wigderson,et al.  Depth-3 arithmetic circuits over fields of characteristic zero , 2002, computational complexity.

[14]  Richard M. Wilson,et al.  A course in combinatorics , 1992 .

[15]  Nader H. Bshouty,et al.  On Interpolating Arithmetic Read-Once Formulas with Exponentiation , 1998, J. Comput. Syst. Sci..

[16]  Ran Raz,et al.  Multi-linear formulas for permanent and determinant are of super-polynomial size , 2004, STOC '04.

[17]  Yishay Mansour,et al.  Simple learning algorithms for decision trees and multivariate polynomials , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.

[18]  Eyal Kushilevitz,et al.  Learning functions represented as multiplicity automata , 2000, JACM.

[19]  Ran Raz,et al.  Separation of Multilinear Circuit and Formula Size , 2006, Theory Comput..

[20]  Noam Nisan,et al.  Constant depth circuits, Fourier transform, and learnability , 1989, 30th Annual Symposium on Foundations of Computer Science.

[21]  James Aspnes,et al.  The expressive power of voting polynomials , 1991, STOC '91.

[22]  Adam R. Klivans,et al.  Learning Arithmetic Circuits via Partial Derivatives , 2003, COLT.

[23]  Noam Nisan,et al.  Lower bounds on arithmetic circuits via partial derivatives , 2005, computational complexity.

[24]  Marek Karpinski,et al.  An exponential lower bound for depth 3 arithmetic circuits , 1998, STOC '98.

[25]  Lisa Hellerstein,et al.  Learning Arithmetic Read-Once Formulas , 1995, SIAM J. Comput..

[26]  Lisa Hellerstein,et al.  Learning Boolean Read-Once Formulas over Generalized Bases , 1995, J. Comput. Syst. Sci..

[27]  Nader H. Bshouty,et al.  Learning Multivariate Polynomials from Substitution and Equivalence Queries , 1996, Electron. Colloquium Comput. Complex..

[28]  Yishay Mansour,et al.  Simple Learning Algorithms for Decision Trees and Multivariate Polynomials , 2002, SIAM J. Comput..

[29]  Jack W. Carlyle,et al.  Realizations by Stochastic Finite Automata , 1971, J. Comput. Syst. Sci..

[30]  Christino Tamon,et al.  On Learning Branching Programs and Small Depth Circuits , 1996, Electron. Colloquium Comput. Complex..

[31]  Nader H. Bshouty,et al.  On Learning Programs and Small Depth Circuits , 1997, EuroCOLT.

[32]  Alexander A. Razborov,et al.  Exponential complexity lower bounds for depth 3 arithmetic circuits in algebras of functions over finite fields , 1998, Proceedings 39th Annual Symposium on Foundations of Computer Science (Cat. No.98CB36280).

[33]  Luca Trevisan,et al.  Pseudorandom generators without the XOR lemma , 1999, Proceedings. Fourteenth Annual IEEE Conference on Computational Complexity (Formerly: Structure in Complexity Theory Conference) (Cat.No.99CB36317).

[34]  Noam Nisan,et al.  Lower bounds on arithmetic circuits via partial derivatives , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.

[35]  Lisa Hellerstein,et al.  Learning read-once formulas over fields and extended bases , 1991, COLT '91.

[36]  Ran Raz,et al.  Deterministic polynomial identity testing in non-commutative models , 2004, Proceedings. 19th IEEE Annual Conference on Computational Complexity, 2004..

[37]  Marek Karpinski,et al.  Computational Complexity of Sparse Rational Interpolation , 1994, SIAM J. Comput..

[38]  Carsten Lund,et al.  Proof verification and the hardness of approximation problems , 1998, JACM.

[39]  Daniel A. Spielman,et al.  Randomness efficient identity testing of multivariate polynomials , 2001, STOC '01.

[40]  Jacob T. Schwartz,et al.  Fast Probabilistic Algorithms for Verification of Polynomial Identities , 1980, J. ACM.