PAC-Learning from General Examples

Abstract In this paper we study a new view on the PAC-learning model in which the examples are more complicated than in the standard model. There, an example usually is an element of the learning domain and its label indicates whether it belongs to the target concept. Here, the examples can be subsets and their labels indicate some relation to the target concept, e.g., whether they intersect it or not. We show how this setting can be easily transformed into the standard PAC-model however, for an analysis it is much more natural to stick to the original formulation. Then the central notion is that of the relative dimension of a target class with respect to a sample class which replaces the Vapnik-Chervonenkis dimension. The investigation of structural aspects of the relative dimension is followed by its applications to learning environments. It turns out that computing or bounding the relative dimension leads to interesting combinatorial problems even for simple target and sample classes. Sometimes the analysis is easier if one represents the concepts as unions or intersections of simpler ones. We present sharp bounds on the relative and the Vapnik-Chervonenkis dimension of the complicated class in terms of the simpler one.

[1]  Herbert Edelsbrunner,et al.  Algorithms in Combinatorial Geometry , 1987, EATCS Monographs in Theoretical Computer Science.

[2]  John Shawe-Taylor,et al.  Bounding Sample Size with the Vapnik-Chervonenkis Dimension , 1993, Discrete Applied Mathematics.

[3]  John N. Tsitsiklis,et al.  PAC Learning with Generalized Samples and an Applicaiton to Stochastic Geometry , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Norbert Sauer,et al.  On the Density of Families of Sets , 1972, J. Comb. Theory A.

[5]  Shai Ben-David,et al.  Learning with restricted focus of attention , 1993, COLT '93.

[6]  Vladimir Vapnik,et al.  Chervonenkis: On the uniform convergence of relative frequencies of events to their probabilities , 1971 .

[7]  Hanno Lefmann,et al.  An extremal problem for Graham-Rothschild parameter words , 1989, Comb..

[8]  P. Assouad Densité et dimension , 1983 .

[9]  N. Littlestone Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[10]  S. Shelah A combinatorial problem; stability and order for models and theories in infinitary languages. , 1972 .

[11]  de Ng Dick Bruijn A combinatorial problem , 1946 .

[12]  David Haussler,et al.  Learnability and the Vapnik-Chervonenkis dimension , 1989, JACM.

[13]  Michael C. Laskowski,et al.  Vapnik-Chervonenkis classes of definable sets , 1992 .

[14]  Steven Skiena,et al.  Probing Convex Polygons with X-Rays , 1988, SIAM J. Comput..

[15]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, STOC '84.