Assessing reference dataset representativeness through confidence metrics based on information density

Abstract Land cover maps obtained from classification of remotely sensed imagery provide valuable information in numerous environmental monitoring and modeling tasks. However, many uncertainties and errors can directly or indirectly affect the quality of derived maps. This work focuses on one key aspect of the supervised classification process of remotely sensed imagery: the quality of the reference dataset used to develop a classifier. More specifically, the representative power of the reference dataset is assessed by contrasting it with the full dataset (e.g. entire image) needing classification. Our method is applicable in several ways: training or testing datasets (extracted from the reference dataset) can be compared with the full dataset. The proposed method moves beyond spatial sampling schemes (e.g. grid, cluster) and operates in the multidimensional feature space (e.g. spectral bands) and uses spatial statistics to compare information density of data to be classified with data used in the reference process. The working hypothesis is that higher information density, not in general but with respect to the entire classified image, expresses higher confidence in obtained results. Presented experiments establish a close link between confidence metrics and classification accuracy for a variety of image classifiers namely maximum likelihood, decision tree, Backpropagation Neural Network and Support Vector Machine. A sensitivity analysis demonstrates that spatially-continuous reference datasets (e.g. a square window) have the potential to provide similar classification confidence as typically-used spatially-random datasets. This is an important finding considering the higher acquisition costs for randomly distributed datasets. Furthermore, the method produces confidence maps that allow spatially-explicit comparison of confidence metrics within a given image for identification of over- and under-represented image portions. The current method is presented for individual image classification but, with sufficient evaluation from the remote sensing community it has the potential to become a standard for reference dataset reporting and thus allowing users to assess representativeness of reference datasets in a consistent manner across different classification tasks.

[1]  Philip H. Swain,et al.  Remote Sensing: The Quantitative Approach , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Francisco Herrera,et al.  Evolutionary stratified training set selection for extracting classification rules with trade off precision-interpretability , 2007, Data Knowl. Eng..

[3]  T. M. Lillesand,et al.  Remote Sensing and Image Interpretation , 1980 .

[4]  Sucharita Gopal,et al.  Uncertainty and Confidence in Land Cover Classification Using a Hybrid Classifier Approach , 2004 .

[5]  Richard Nock,et al.  Impact of learning set quality and size on decision tree performances , 2000, Int. J. Comput. Syst. Signals.

[6]  Linda C. van der Gaag,et al.  Visual exploration of uncertainty in remote-sensing classification , 1998 .

[7]  Giles M. Foody,et al.  Approaches for the production and evaluation of fuzzy land cover classifications from remotely-sensed data , 1996 .

[8]  Giles M. Foody,et al.  Sample size determination for image classification accuracy assessment and comparison , 2009 .

[9]  Giles M. Foody,et al.  The use of small training sets containing mixed pixels for accurate hard image classification: Training on mixed spectral responses for classification by a SVM , 2006 .

[10]  David A. Landgrebe,et al.  A Binary Tree Feature Selection Technique for Limited Training Sample Size , 1984 .

[11]  John A. Richards,et al.  Remote Sensing Digital Image Analysis: An Introduction , 1999 .

[12]  D. Stow,et al.  THE EFFECT OF TRAINING STRATEGIES ON SUPERVISED CLASSIFICATION AT DIFFERENT SPATIAL RESOLUTIONS , 2002 .

[13]  Giorgos Mountrakis,et al.  Integrating intermediate inputs from partially classified images within a hybrid classification framework: An impervious surface estimation example , 2010 .

[14]  Stephen V. Stehman,et al.  A Critical Evaluation of the Normalized Error Matrix in Map Accuracy Assessment , 2004 .

[15]  Huan Liu,et al.  Instance Selection and Construction for Data Mining , 2001 .

[16]  Stephen V. Stehman,et al.  Sampling designs for accuracy assessment of land cover , 2009 .

[17]  Raymond L. Czaplewski,et al.  Calibration of Remotely Sensed Proportion or Area Estimates for Misclassification Error , 1992 .

[18]  Giles M. Foody,et al.  Toward intelligent training of supervised image classifications: directing training data acquisition for SVM classification , 2004 .

[19]  Elisabetta Binaghi,et al.  Assessing the accuracy of soft thematic maps using fuzzy set-based error matrices , 1999, Remote Sensing.

[20]  Roland L. Redmond,et al.  Estimation and Mapping of Misclassification Probabilities for Thematic Land Cover Maps , 1998 .

[21]  Timothy A. Warner,et al.  The SAGE Handbook of Remote Sensing , 2009 .

[22]  Russell G. Congalton,et al.  Assessing the accuracy of remotely sensed data : principles and practices , 1998 .

[23]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[24]  Elisabetta Binaghi,et al.  A fuzzy set-based accuracy assessment of soft classification , 1999, Pattern Recognit. Lett..

[25]  Jungho Im,et al.  ISPRS Journal of Photogrammetry and Remote Sensing , 2022 .

[26]  Carla E. Brodley,et al.  Generating High-Quality Training Data for Automated Land-Cover Mapping , 2008, IGARSS 2008 - 2008 IEEE International Geoscience and Remote Sensing Symposium.

[27]  David A. Landgrebe,et al.  The effect of unlabeled samples in reducing the small sample size problem and mitigating the Hughes phenomenon , 1994, IEEE Trans. Geosci. Remote. Sens..

[28]  J. Wickham,et al.  Completion of the 2001 National Land Cover Database for the conterminous United States , 2007 .

[29]  Taskin Kavzoglu,et al.  Increasing the accuracy of neural network classification using refined training data , 2009, Environ. Model. Softw..

[30]  Partha Sarathi Roy,et al.  Land use land cover classification of Orissa using multi-temporal IRS-P6 awifs data: A decision tree approach , 2008, Int. J. Appl. Earth Obs. Geoinformation.

[31]  Scott Mitchell,et al.  Distance to second cluster as a measure of classification confidence , 2008 .

[32]  D. Civco,et al.  IMPERVIOUS SURFACE MAPPING FOR THE STATE OF CONNECTICUT 1 , 1997 .

[33]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  D. H. Card Using known map category marginal frequencies to improve estimates of thematic map accuracy , 1982 .

[35]  Christopher A. Barnes,et al.  Completion of the 2006 National Land Cover Database for the conterminous United States. , 2011 .

[36]  R. G. Pontlus Quantification Error Versus Location Error in Comparison of Categorical Maps , 2006 .

[37]  B. Ripley The Second-Order Analysis of Stationary Point Processes , 1976 .

[38]  Russell G. Congalton,et al.  A review of assessing the accuracy of classifications of remotely sensed data , 1991 .

[39]  B. Datt,et al.  On the relationship between training sample size and data dimensionality: Monte Carlo analysis of broadband multi-temporal classification , 2005 .

[40]  Anil K. Jain,et al.  Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  J. Campbell Introduction to remote sensing , 1987 .

[42]  R. Colwell Remote sensing of the environment , 1980, Nature.

[43]  D. R. Cutler,et al.  Effects of sample survey design on the accuracy of classification tree models in species distribution models , 2006 .

[44]  Derek D. Lichti,et al.  ISPRS Journal of Photogrammetry and Remote Sensing theme issue “Terrestrial Laser Scanning” , 2006 .

[45]  Rosa Maria Valdovinos,et al.  The Imbalanced Training Sample Problem: Under or over Sampling? , 2004, SSPR/SPR.

[46]  S. Stehman,et al.  Accuracy Assessment , 2003 .

[47]  Robert P. W. Duin,et al.  STATISTICAL PATTERN RECOGNITION , 2005 .

[48]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[49]  Steven E. Franklin,et al.  Automated training site selection for large-area remote-sensing image analysis , 1993 .

[50]  D. Walburn,et al.  London/New York , 2009 .

[51]  J. Lesparre,et al.  USING MIXED PIXELS FOR THE TRAINING OF A MAXIMUM LIKELIHOOD CLASSIFICATION , 2006 .

[52]  Stephen V. Stehman,et al.  Basic probability sampling designs for thematic map accuracy assessment , 1999 .

[53]  R. Pontius,et al.  Death to Kappa: birth of quantity disagreement and allocation disagreement for accuracy assessment , 2011 .

[54]  Giles M. Foody,et al.  Status of land cover classification accuracy assessment , 2002 .

[55]  M. Batistella,et al.  COMPARISON OF LAND-COVER CLASSIFICATION METHODS IN THE BRAZILIAN AMAZON BASIN , 2004 .