This paper describes an approach to classification based on a probabilistic clustering method. Most current classifiers perform classification by modeling class conditional densities directly or by modeling class-dependent discriminant functions. The approach described in this paper uses class-independent multilayer perceptrons (MLP) to estimate the probability that two given feature vectors are in the same class. These probability estimates are used to partition the input into separate classes in a probabilistic clustering. Classification by probabilistic clustering potentially offers greater robustness to different compositions of training and test sets than existing classification methods. Experimental results demonstrating the effectiveness of the method are given for an optical character recognition (OCR) problem. The relationship of the current approach to mixture density estimation, mixture discriminant analysis, and other OCR and handwriting recognition techniques is discussed.
[1]
Thomas M. Breuel,et al.
Modeling the sample distribution for clustering OCR
,
2000,
IS&T/SPIE Electronic Imaging.
[2]
Donald Geman,et al.
Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images
,
1984,
IEEE Transactions on Pattern Analysis and Machine Intelligence.
[3]
Joshua B. Tenenbaum,et al.
Learning bilinear models for two-factor problems in vision
,
1997,
Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[4]
George Nagy,et al.
Style consistency in pattern fields
,
2000
.