The problem of estimating the probability of unobserved outcomes or, as it is sometimes called, the conditional probability of a new species, is studied. Good's estimator, which is essentially the same as Robbins' estimator, namely the number of singleton species observed divided by the sample size, is studied from a decision theory point of view. The results obtained are as follows: (1) When the total number of different species is assumed bounded by some known number, Good's and Robbins' estimators are inadmissible for squared error loss. (2) If the number of different species can be infinite, Good's and Robbins' estimators are admissible for squared error loss. (3) Whereas Robbins' estimator is a UMVUE for theunconditional probability of a new species obtained in one extra sample point, Robbins' estimator is not a uniformly minimum mean squared error unbiased estimator of the conditional probability of a new species. This answers a question raised by Robbins. (4) It is shown that for Robbins' model and squared error loss, there are admissible Bayes estimators which do not depend only on a minimal sufficient statistic. A discussion of interpretations and significance of the results is offered.
[1]
L. Lecam.
An Extension of Wald's Theory of Statistical Decision Functions
,
1955
.
[2]
Gerald S. Rogers,et al.
Mathematical Statistics: A Decision Theoretic Approach
,
1967
.
[3]
Warren W. Esty,et al.
The Efficiency of Good's Nonparametric Coverage Estimator
,
1986
.
[4]
E. Lehmann.
Testing Statistical Hypotheses
,
1960
.
[5]
Lars Holst.
Some Limit Theorems with Applications in Sampling Theory
,
1973
.
[6]
I. Good.
THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS
,
1953
.
[7]
Edward W. Frees,et al.
Nonparametric Estimation of the Probability of Discovering a New Species.
,
1987
.
[8]
N. Starr.
Linear Estimation of the Probability of Discovering a New Species
,
1979
.
[9]
H. Robbins.
Estimating the Total Probability of the Unobserved Outcomes of an Experiment
,
1968
.
[10]
Abraham Wald,et al.
Statistical Decision Functions
,
1951
.