On learning Read-k-Satisfy-j DNF

We study the learnability of Read-<italic>k</italic>-Satisfy-<italic>j</italic> (RkSj) DNF formulae. These are DNF formulae in which the maximal number of occurrences of a variable is bounded by <italic>k</italic>, and the number of terms satisfied by any assignment is at most <italic>j</italic>. We show that this class of functions is learnable in polynomial time, using Equivalence and Membership Queries, as long as <italic>k•j</italic>=<italic>O</italic>(log<italic>n</italic>/loglog<italic>n</italic>). Learnability was previously known only in case that both <italic>k</italic> and <italic>j</italic> are constants. We also present a family of boolean functions that have short (<italic>poly(n)</italic>) Read-2-Satisfy-1 DNF formulae but require CNF formulae of size ><inline-equation> <f> 2<sup><g>W</g><fen lp="par"><rad><rcd><it>n</it></rcd></rad><rp post="par"></fen> </sup></f> </inline-equation>. Therefore, our result does not seem to follow from the recent learnability result of [Bsh93].

[1]  Leonard Pitt,et al.  Exact learning of read-k disjoint DNF and not-so-disjoint DNF , 1992, COLT '92.

[2]  Sleiman Matar Learning with a minimal number of queries , 1993 .

[3]  Hans Ulrich Simon A Tight Omega(loglog n)-Bound on the Time for Parallel Ram's to Compute Nondegenerated Boolean Functions , 1983, FCT.

[4]  Mona Singh,et al.  Learning functions of k terms , 1990, COLT '90.

[5]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, CACM.

[6]  Thomas R. Hancock,et al.  Learning 2µ DNF Formulas and kµ Decision Trees , 1991, COLT.

[7]  Jeffrey C. Jackson,et al.  An efficient membership-query algorithm for learning DNF with respect to the uniform distribution , 1994, Proceedings 35th Annual Symposium on Foundations of Computer Science.

[8]  Vijay Raghavan,et al.  On the Limits of Proper Learnability of Subclasses of DNF Formulas , 1994, COLT '94.

[9]  E. Kushilevitz,et al.  On Learning Visual Concepts and DNF Formulae , 1996 .

[10]  Nader H. Bshouty,et al.  Exact learning via the Monotone theory , 1993, Proceedings of 1993 IEEE 34th Annual Foundations of Computer Science.

[11]  Dana Angluin,et al.  When won't membership queries help? , 1991, STOC '91.

[12]  Leslie G. Valiant,et al.  Computational limitations on learning from examples , 1988, JACM.

[13]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[14]  Marek Karpinski,et al.  Learning read-once formulas with queries , 1993, JACM.

[15]  Thomas R. Hancock,et al.  Learning 2u DNF formulas and ku decision trees , 1991, COLT 1991.

[16]  Roni Khardon On Using the Fourier Transform to Learn Disjoint DNF , 1994, Inf. Process. Lett..

[17]  H. Aizenstein,et al.  Exact learning of read-twice DNF formulas , 1991, [1991] Proceedings 32nd Annual Symposium of Foundations of Computer Science.

[18]  Vijay Raghavan,et al.  Read-Twice DNF Formulas are Properly Learnable , 1994, Inf. Comput..

[19]  Avrim Blum,et al.  Fast learning of k-term DNF formulas with queries , 1992, STOC '92.

[20]  Leonard Pitt,et al.  On the learnability of disjunctive normal form formulas , 2004, Machine Learning.

[21]  Lisa Hellerstein,et al.  Read-thrice DNF is hard to learn with membership and equivalence queries , 1992, Proceedings., 33rd Annual Symposium on Foundations of Computer Science.

[22]  Nader H. Bshouty Exact Learning Boolean Function via the Monotone Theory , 1995, Inf. Comput..

[23]  Leslie G. Valiant,et al.  On the learnability of Boolean formulae , 1987, STOC.

[24]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[25]  Eyal Kushilevitz,et al.  Learning decision trees using the Fourier spectrum , 1991, STOC '91.

[26]  Yishay Mansour,et al.  Weakly learning DNF and characterizing statistical query learning using Fourier analysis , 1994, STOC '94.