Quantization complexity and training sample size in detection

For the k -hypothesis detection problem, it is shown that, among the k -classes of probability density functions with m fixed quantiles, histograms achieve the least favorable performance as measured by the probability of correct detection and Chernoff distance. It is assumed that the m cell probabilities are estimated using n training samples per class. With the aid of the estimated cell probabilities, new observations are processed. A distribution-free upper bound to the probability of \epsilon -deviation between the actual probability of correct detection and the theoretical (known quantiles) probability is derived as a function of (m,n,\epsilon,k,u_{o}) , where u_{o} is a uniform upper bound to the true class densities. The bound converges exponentially to zero as n \rightarrow \infty . Exponential convergence is obtained by choosing m = n^{\alpha}, 0 . Hence, the rule m = n^{\alpha} answers the long standing question of how to relate m and n in a distribution-free manner. The question of the optimal choice of a is also discussed.

[1]  Lee D. Davisson,et al.  The prediction error of stationary Gaussian time series of unknown covariance , 1965, IEEE Trans. Inf. Theory.

[2]  Peter J. Huber,et al.  Fisher Information and Spline Interpolation , 1974 .

[3]  Ludwik Kurz,et al.  Nonparametric detectors based on m-interval partitioning , 1972, IEEE Trans. Inf. Theory.

[4]  Irwin M. Jacobs Probability-of-error bounds for binary transmission on the slowly fading Rician channel , 1966, IEEE Trans. Inf. Theory.

[5]  B. Chandrasekaran,et al.  Independence of measurements and the mean recognition accuracy , 1971, IEEE Trans. Inf. Theory.

[6]  D. Lainiotis A class of upper-bounds on probability of error for multi-hypotheses pattern recognition , 1969 .

[7]  N. Glick Sample-Based Classification Procedures Derived from Density Estimators , 1972 .

[8]  R. Douglas Martin,et al.  Robust estimation via stochastic approximation , 1975, IEEE Trans. Inf. Theory.

[9]  B. Chandrasekaran,et al.  Comments on "On the mean accuracy of statistical pattern recognizers" by Hughes, G. F , 1969, IEEE Trans. Inf. Theory.

[10]  King-Sun Fu,et al.  Error estimation in pattern recognition via LAlpha -distance between posterior density functions , 1976, IEEE Trans. Inf. Theory.

[11]  Luc Devroye,et al.  A distribution-free performance bound in error estimation (Corresp.) , 1976, IEEE Trans. Inf. Theory.

[12]  Demetrios G. Lainiotis,et al.  A class of upper bounds on probability of error for multihypotheses pattern recognition (Corresp.) , 1969, IEEE Trans. Inf. Theory.

[13]  M. Rosenblatt Remarks on Some Nonparametric Estimates of a Density Function , 1956 .

[14]  G. F. Hughes,et al.  On the mean accuracy of statistical pattern recognizers , 1968, IEEE Trans. Inf. Theory.

[15]  Thomas M. Cover,et al.  Topics in Statistical Pattern Recognition , 1980 .

[16]  Lee D. Davisson A theory of adaptive filtering , 1966, IEEE Trans. Inf. Theory.

[17]  W. Bushnell The optimization and performance of detectors based on partition tests (Ph.D. Thesis abstr.) , 1975, IEEE Trans. Inf. Theory.

[18]  H. Vincent Poor,et al.  MAXIMUM-DISTANCE QUANTIZATION FOR DETECTION. , 1976 .

[19]  G. Wahba Interpolating Spline Methods for Density Estimation I. Equi-Spaced Knots , 1975 .

[20]  Laveen N. Kanal,et al.  Patterns in pattern recognition: 1968-1974 , 1974, IEEE Trans. Inf. Theory.

[21]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[22]  T. Kailath The Divergence and Bhattacharyya Distance Measures in Signal Selection , 1967 .

[23]  József Fritz,et al.  Distribution-free exponential error bound for nearest neighbor pattern classification , 1975, IEEE Trans. Inf. Theory.

[24]  Donald H. Foley Considerations of sample and feature size , 1972, IEEE Trans. Inf. Theory.

[25]  Panayota Papantoni-Kazakos,et al.  Small-sample efficiencies of rank tests , 1975, IEEE Trans. Inf. Theory.

[26]  Anil K. Jain,et al.  Independence, Measurement Complexity, and Classification Performance , 1975, IEEE Transactions on Systems, Man, and Cybernetics.