Impacts of sample design for validation data on the accuracy of feedforward neural network classification

Validation data are often used to evaluate the performance of a trained neural network and used in the selection of a network deemed optimal for the task at-hand. Optimality is commonly assessed with a measure, such as overall classification accuracy. The latter is often calculated directly from a confusion matrix showing the counts of cases in the validation set with particular labelling properties. The sample design used to form the validation set can, however, influence the estimated magnitude of the accuracy. Commonly, the validation set is formed with a stratified sample to give balanced classes, but also via random sampling, which reflects class abundance. It is suggested that if the ultimate aim is to accurately classify a dataset in which the classes do vary in abundance, a validation set formed via random, rather than stratified, sampling is preferred. This is illustrated with the classification of simulated and remotely-sensed datasets. With both datasets, statistically significant differences in the accuracy with which the data could be classified arose from the use of validation sets formed via random and stratified sampling (z = 2.7 and 1.9 for the simulated and real datasets respectively, for both p < 0.05%). The accuracy of the classifications that used a stratified sample in validation were smaller, a result of cases of an abundant class being commissioned into a rarer class. Simple means to address the issue are suggested.

[1]  Gerald Tesauro,et al.  Scaling and Generalization in Neural Networks: A Case Study , 1988, NIPS.

[2]  Sovan Lek,et al.  Neuronal Networks: Algorithms and Architectures for Ecologists and Evolutionary Ecologists , 2000 .

[3]  Jean-François Mas,et al.  Modelling deforestation using GIS and artificial neural networks , 2004, Environ. Model. Softw..

[4]  Paul M. Mather,et al.  The use of backpropagating artificial neural networks in land cover classification , 2003 .

[5]  Alan Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[6]  G. David Garson,et al.  Neural Networks: An Introductory Guide for Social Scientists , 1999 .

[7]  Steffen Fritz,et al.  Investigating the Feasibility of Geo-Tagged Photographs as Sources of Land Cover Input Data , 2016, ISPRS Int. J. Geo Inf..

[8]  Giles M. Foody,et al.  Evaluation of SVM, RVM and SMLR for Accurate Image Classification With Limited Ground Data , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[9]  Rudy Setiono,et al.  Feedforward Neural Network Construction Using Cross Validation , 2001, Neural Computation.

[10]  Michael Y. Hu,et al.  Artificial neural networks in bankruptcy prediction: General framework and cross-validation analysis , 1999, Eur. J. Oper. Res..

[11]  Lucy Bastin,et al.  The Sensitivity of Mapping Methods to Reference Data Quality: Training Supervised Image Classifications with Imperfect Reference Data , 2016, ISPRS Int. J. Geo Inf..

[12]  Paul M. Mather,et al.  Classification methods for remotely sensed data, 2nd ed , 2016 .

[13]  Lorenzo Bruzzone,et al.  An experimental comparison of neural and statistical non-parametric algorithms for supervised classification of remote-sensing images , 1996, Pattern Recognit. Lett..

[14]  Jim Piper,et al.  Variability and bias in experimentally measured classifier error rates , 1992, Pattern Recognit. Lett..

[15]  Lutz Prechelt,et al.  Automatic early stopping using cross validation: quantifying the criteria , 1998, Neural Networks.

[16]  Giles M. Foody,et al.  The significance of border training patterns in classification by a feedforward neural network using back propagation learning , 1999 .

[17]  Giles M. Foody,et al.  Improving specific class mapping from remotely sensed data by cost-sensitive learning , 2017 .

[18]  Richard Lippmann,et al.  Using Genetic Algorithms to Improve Pattern Classification Performance , 1990, NIPS.

[19]  Genong Yu,et al.  Artificial Neural Networks and Remote Sensing , 2009 .

[20]  Juan J. Flores,et al.  The application of artificial neural networks to the analysis of remotely sensed data , 2008 .

[21]  Giles M. Foody,et al.  Crop classification by support vector machine with intelligently selected training data for an operational application , 2008 .

[22]  Alice E. Smith,et al.  Bias and variance of validation methods for function approximation neural networks under conditions of sparse data , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[23]  Giles M. Foody,et al.  The effect of training set size and composition on artificial neural network classification , 1995 .

[24]  Jie Wang,et al.  Comparison of Classification Algorithms and Training Sample Sizes in Urban Land Classification with Landsat Thematic Mapper Imagery , 2014, Remote. Sens..

[25]  Paul M. Mather,et al.  Support vector machines for classification in remote sensing , 2005 .

[26]  Robert A. Schowengerdt,et al.  A review and analysis of backpropagation neural networks for classification of remotely-sensed multi-spectral imagery , 1995 .

[27]  Xiaohua Tong,et al.  Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas , 2017, Remote. Sens..

[28]  Bernard A. Engel,et al.  Optimization of training data required for neuro-classification , 1994 .

[29]  H. Hemond,et al.  Statistical generation of training sets for measuring NO3(-), NH4(+) and major ions in natural waters using an ion selective electrode array. , 2016, Environmental science. Processes & impacts.

[30]  Giles M. Foody,et al.  The use of small training sets containing mixed pixels for accurate hard image classification: Training on mixed spectral responses for classification by a SVM , 2006 .

[31]  D. Peddle,et al.  Multi-Source Image Classification II: An Empirical Comparison of Evidential Reasoning and Neural Network Approaches , 1994 .

[32]  G. Foody Thematic map comparison: Evaluating the statistical significance of differences in classification accuracy , 2004 .

[33]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[34]  Holger R. Maier,et al.  Improved validation framework and R-package for artificial neural network models , 2017, Environ. Model. Softw..

[35]  Taskin Kavzoglu,et al.  Increasing the accuracy of neural network classification using refined training data , 2009, Environ. Model. Softw..

[36]  Giles M. Foody,et al.  Good practices for estimating area and assessing accuracy of land change , 2014 .

[37]  Shanjun Mao,et al.  Spectral–spatial classification of hyperspectral images using deep convolutional neural networks , 2015 .

[38]  Okan K. Ersoy,et al.  Classification accuracy improvement of neural network classifiers by using unlabeled data , 1998, IEEE Trans. Geosci. Remote. Sens..

[39]  Wei Zhang,et al.  Multiple Classifier System for Remote Sensing Image Classification: A Review , 2012, Sensors.

[40]  R. Setiono,et al.  Effective neural network pruning using cross-validation , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[41]  Giles M. Foody,et al.  Hard and soft classifications by a neural network with a non-exhaustively defined set of classes , 2002 .

[42]  Yun Chen,et al.  Integration of Bayesian regulation back-propagation neural network and particle swarm optimization for enhancing sub-pixel mapping of flood inundation in river basins , 2016 .

[43]  David A. Landgrebe,et al.  Decision boundary feature extraction for neural networks , 1992, [Proceedings] 1992 IEEE International Conference on Systems, Man, and Cybernetics.

[44]  Jean-François Mas,et al.  Mapping land use/cover in a tropical coastal area using satellite sensor data, GIS and artificial neural networks , 2004 .

[45]  Qihao Weng,et al.  A survey of image classification methods and techniques for improving classification performance , 2007 .

[46]  Giles M. Foody,et al.  Toward intelligent training of supervised image classifications: directing training data acquisition for SVM classification , 2004 .

[47]  Stephen V. Stehman,et al.  Basic probability sampling designs for thematic map accuracy assessment , 1999 .