In pattern recognition, when the ratio of the number of training samples to the dimensionality is small, parameter estimates become highly variable, causing the deterioration of classification performance. This problem has become more prevalent in remote sensing with the emergence of a new generation of sensors with as many as several hundred spectral bands. While the new sensor technology provides higher spectral and spatial resolution, enabling a greater number of spectrally separable classes to be identified, the needed labeled samples for designing the classifier remain difficult and expensive to acquire. Better parameter estimates can be obtained by exploiting a large number of unlabeled samples in addition to training samples, using the expectation maximization algorithm under the mixture model. However, the estimation method is sensitive to the presence of statistical outliers. In remote sensing data, miscellaneous classes with few samples are often difficult to identify and may constitute statistical outliers. Therefore, the authors propose to use a robust parameter-estimation method for the mixture model. The proposed method assigns full weight to training samples, but automatically gives reduced weight to unlabeled samples. Experimental results show that the robust method prevents performance deterioration due to statistical outliers in the data as compared to the estimates obtained from the direct EM approach.
[1]
R. Maronna.
Robust $M$-Estimators of Multivariate Location and Scatter
,
1976
.
[2]
R. Redner,et al.
Mixture densities, maximum likelihood, and the EM algorithm
,
1984
.
[3]
T. Stein.
International Geoscience And Remote Sensing Symposium
,
1992,
[Proceedings] IGARSS '92 International Geoscience and Remote Sensing Symposium.
[4]
David A. Landgrebe,et al.
Classification of multispectral data by joint supervised-unsupervised learning
,
1993
.
[5]
Philip H. Swain,et al.
Remote Sensing: The Quantitative Approach
,
1981,
IEEE Transactions on Pattern Analysis and Machine Intelligence.
[6]
K. M. L. Suxena,et al.
Introduction to Statistical Theory
,
1976
.
[7]
Frederick R. Forst,et al.
On robust estimation of the location parameter
,
1980
.
[8]
N. Campbell.
Mixture models and atypical values
,
1984
.
[9]
G. F. Hughes,et al.
On the mean accuracy of statistical pattern recognizers
,
1968,
IEEE Trans. Inf. Theory.
[10]
Saldju Tadjudin,et al.
CLASSIFICATION OF HIGH DIMENSIONAL DATA WITH LIMITED TRAINING SAMPLES
,
1998
.