Min-max classifiers: Learnability, design and application

Abstract This paper introduces the class of min-max classifiers. These are binary-valued functions that can be used as pattern classifiers of both real-valued and binary-valued feature vectors. They are also lattice-theoretic generalization of Boolean functions and are also related to feed-forward neural networks and morphological signal operators. We studied supervised learning of these classifiers under the Probably Approximately Correct (PAC) model proposed by Valiant. Several subclasses of thresholded min-max functions are shown to be learnable, generalizing the learnability results for the corresponding classes of Boolean functions. We also propose a LMS algorithm for the practical training of these pattern classifiers. Experimental results using the LMS algorithm for handwritten character recognition are promising. For example, in our experiments the min-max classifiers were able to achieve error rates that are comparable or better than those generated using neural networks. The major advantage of min-max classifiers compared to neural networks is their simplicity and the faster convergence of their training algorithm.

[1]  Dan S. Bloomberg Connectivity-preserving morphological image transformations , 1991, Other Conferences.

[2]  Jean Serra,et al.  Image Analysis and Mathematical Morphology , 1983 .

[3]  Temple F. Smith Occam's razor , 1980, Nature.

[4]  Philippe Salembier Structuring element adaptation for morphological filters , 1992, J. Vis. Commun. Image Represent..

[5]  Xindong Wu,et al.  Lecture Notes in Machine Learning , 1994, Informatica.

[6]  Michael Kearns,et al.  Computational complexity of machine learning , 1990, ACM distinguished dissertations.

[7]  John M. Trenkle,et al.  Morphological feature-set optimization using the genetic algorithm , 1991, Optics & Photonics.

[8]  Leslie G. Valiant,et al.  Learning Disjunction of Conjunctions , 1985, IJCAI.

[9]  R. Schafer,et al.  Morphological systems for multidimensional signal processing , 1990, Proc. IEEE.

[10]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[11]  Petros Maragos,et al.  Energy separation in signal modulations with application to speech analysis , 1993, IEEE Trans. Signal Process..

[12]  G. Kane Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol 1: Foundations, vol 2: Psychological and Biological Models , 1994 .

[13]  Robert E. Schapire,et al.  Design and analysis of efficient learning algorithms , 1992, ACM Doctoral dissertation award ; 1991.

[14]  Ching Y. Suen,et al.  The State of the Art in Online Handwriting Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  David Haussler,et al.  Classifying learnable geometric concepts with the Vapnik-Chervonenkis dimension , 1986, STOC '86.

[16]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, STOC '84.

[17]  R.M. McElhaney,et al.  Algorithms for graphics and image processing , 1983, Proceedings of the IEEE.

[18]  Petros Maragos,et al.  Learnability of min-max pattern classifiers , 1991, Other Conferences.

[19]  Leslie G. Valiant,et al.  On the learnability of Boolean formulae , 1987, STOC.

[20]  Petros Maragos,et al.  Pattern Spectrum and Multiscale Shape Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Lei Xu,et al.  Recent Advances on Techniques of Static Feedforward Networks with Supervised Learning , 1992, Int. J. Neural Syst..

[22]  Petros Maragos,et al.  Morphological filters-Part I: Their set-theoretic analysis and relations to linear shift-invariant filters , 1987, IEEE Trans. Acoust. Speech Signal Process..

[23]  Francesco Palmieri A backpropagation algorithm for multilayer hybrid order statistic filters , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[24]  室 章治郎 Michael R.Garey/David S.Johnson 著, "COMPUTERS AND INTRACTABILITY A guide to the Theory of NP-Completeness", FREEMAN, A5判変形判, 338+xii, \5,217, 1979 , 1980 .

[25]  E. Gilbert Lattice Theoretic Properties of Frontal Switching Functions , 1954 .

[26]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[27]  Francesco Palmieri Adaptive recursive order statistic filters , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[28]  S. Thomas Alexander,et al.  Adaptive Signal Processing , 1986, Texts and Monographs in Computer Science.

[29]  King-Sun Fu,et al.  Shape Discrimination Using Fourier Descriptors , 1977, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Stephen S. Wilson Morphological Networks , 1989, Other Conferences.

[31]  Petros Maragos,et al.  Morphological systems for character image processing and recognition , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[32]  Ralph Roskies,et al.  Fourier Descriptors for Plane Closed Curves , 1972, IEEE Transactions on Computers.

[33]  David Haussler,et al.  Learnability and the Vapnik-Chervonenkis dimension , 1989, JACM.