Contrivedness: The boundary between pattern recognition and numerology

Abstract The irreducible informational loss expended in a pattern search procedure is quantified using the concept of contrived entropy. In multivariate analysis this quantity is of value in distinguishing true patterns from statistical noise and in deciding to what depth a search procedure should be conducted. For a specific partition, the contrived entropy is defined as the partition entropy averaged over all possible permutations of event outcomes. The contrived entropy associated with a search procedure or set of attempted partitions is taken to be the expectation value of the minimized partition entropy for each permutation. The behavior of the contrived entropy is illustrated for a simple univariate case.

[1]  Donald H. Foley Considerations of sample and feature size , 1972, IEEE Trans. Inf. Theory.

[2]  E. Pitman Significance Tests Which May be Applied to Samples from Any Populations , 1937 .

[3]  A. Dempster An overview of multivariate data analysis , 1971 .

[4]  B. Chandrasekaran,et al.  On dimensionality and sample size in statistical pattern classification , 1971, Pattern Recognit..

[5]  R Fisher,et al.  Design of Experiments , 1936 .

[6]  Thomas M. Cover,et al.  Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition , 1965, IEEE Trans. Electron. Comput..

[7]  W. W. Bledsoe,et al.  Some Results on Multicategory Pattern Recognition , 1966, J. ACM.

[8]  S. L. Andersen,et al.  Permutation Theory in the Derivation of Robust Criteria and the Study of Departures from Assumption , 1955 .

[9]  Solomon Kullback,et al.  Information Theory and Statistics , 1960 .

[10]  Philip M. Lewis,et al.  The characteristic selection problem in recognition systems , 1962, IRE Trans. Inf. Theory.

[11]  Russ E. Davis,et al.  Predictability of Sea Surface Temperature and Sea Level Pressure Anomalies over the North Pacific Ocean , 1976 .

[12]  M. R. Mickey,et al.  Estimation of Error Rates in Discriminant Analysis , 1968 .

[13]  W. Hoeffding The Large-Sample Power of Tests Based on Permutations of Observations , 1952 .

[14]  George E. P. Box,et al.  Time Series Analysis: Forecasting and Control , 1977 .

[15]  G. F. Hughes,et al.  On the mean accuracy of statistical pattern recognizers , 1968, IEEE Trans. Inf. Theory.