Lower Bound Methods and Separation Results for On-Line Learning Models

AbstractWe consider the complexity of concept learning in various common models for on-line learning, focusing on methods for proving lower bounds to the learning complexity of a concept class. Among others, we consider the model for learning with equivalence and membership queries. For this model we give lower bounds on the number of queries that are needed to learn a concept class $$\mathcal{C}$$ in terms of the Vapnik-Chervonenkis dimension of $$\mathcal{C}$$ , and in terms of the complexity of learning $$\mathcal{C}$$ with arbitrary equivalence queries. Furthermore, we survey other known lower bound methods and we exhibit all known relationships between learning complexities in the models considered and some relevant combinatorial parameters. As it turns out, the picture is almost complete. This paper has been written so that it can be read without previous knowledge of Computational Learning Theory.

[1]  Norbert Sauer,et al.  On the Density of Families of Sets , 1972, J. Comb. Theory A.

[2]  Michael Frazier,et al.  Learning conjunctions of Horn clauses , 2004, Machine Learning.

[3]  Dana Angluin,et al.  Learning Regular Sets from Queries and Counterexamples , 1987, Inf. Comput..

[4]  Leslie G. Valiant,et al.  Computational limitations on learning from examples , 1988, JACM.

[5]  H. Ishizaka Polynomial Time Learnability of Simple Deterministic Languages , 1990, Machine Learning.

[6]  Marek Karpinski,et al.  Learning read-once formulas with queries , 1993, JACM.

[7]  Eric B. Baum,et al.  Polynomial time algorithms for learning neural nets , 1990, Annual Conference Computational Learning Theory.

[8]  Piotr Berman,et al.  Learning one-counter languages in polynomial time , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[9]  David Haussler,et al.  Learnability and the Vapnik-Chervonenkis dimension , 1989, JACM.

[10]  György Turán,et al.  Sorting and Recognition Problems for Ordered Sets , 1985, SIAM J. Comput..

[11]  Dana Angluin,et al.  A Note on the Number of Queries Needed to Identify Regular Languages , 1981, Inf. Control..

[12]  L. Goddard Information Theory , 1962, Nature.

[13]  S. Shelah A combinatorial problem; stability and order for models and theories in infinitary languages. , 1972 .

[14]  Daniel J. Kleitman,et al.  Intersections ofk-element sets , 1981, Comb..

[15]  Dana Angluin,et al.  Queries and concept learning , 1988, Machine Learning.

[16]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[17]  Dana Angluin Negative results for equivalence queries , 1990, Mach. Learn..

[18]  Wolfgang Maass,et al.  On the complexity of learning from counterexamples , 1989, 30th Annual Symposium on Foundations of Computer Science.

[19]  J. Kahn,et al.  Balancing poset extensions , 1984 .

[20]  J. Spencer Probabilistic Methods in Combinatorics , 1974 .

[21]  Wolfgang Maass,et al.  On-line learning with an oblivious environment and the power of randomization , 1991, COLT '91.

[22]  Wolfgang Maass,et al.  On the complexity of learning from counterexamples and membership queries , 1990, Proceedings [1990] 31st Annual Symposium on Foundations of Computer Science.

[23]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, CACM.

[24]  Vladimir Vapnik,et al.  Chervonenkis: On the uniform convergence of relative frequencies of events to their probabilities , 1971 .

[25]  Wolfgang Maass,et al.  How fast can a threshold gate learn , 1994, COLT 1994.

[26]  Wolfgang Maass,et al.  Algorithms and Lower Bounds for On-Line Learning of Geometrical Concepts , 1994, Machine Learning.

[27]  Ronald L. Rivest,et al.  Learning Binary Relations and Total Orders , 1989, COLT 1989.

[28]  N. Littlestone Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).