Automatic Discovery of Classification and Estimation Algorithms for Earth- Observation Satellite Imagery

Under NASA’s new Earth Observing System (EOS), satellite imagery is expected to arrive back on Earth at rates of gigabytes/day. Techniques for the extraction of useful information from such massive data streams must be efficient and scalable in order to survive in petabyte archive situations, and they must overcome the opacity inherent in the data by classifying or estimating pixels according to user-specified categories such as crop-type or forest health. We are in the process of applying GP to several related satellite remote sensing (RS) classification and estimation problems in such a way as to surmount the usual obstacles to large-scale exploitation of imagery. The fitness functions used for training are based on how well the discovered programs perform on a set of cases from Landsat Thematic Mapper (TM) imagery. Programs are rated on how well they perform on out-of-training-set samples of cases from the same imagery. We have carried out a number of preliminary experiments on a relatively simple binary classification task. Each case is a set of 7 spectral intensity readings for a pixel and an associated ground truth class: 1 for surface water, 0 for none. The GP system very rapidly discovers simple relations that correctly predict 98%+ for training and testing data sets. The key problem with the results we have observed so far is that the simple solutions rapidly drive out diversity in the population. Several approaches will be taken in further study in order to try to maintain diversity in the population.

[1]  Daniel L. Civco,et al.  Artificial Neural Networks for Land-Cover Classification and Mapping , 1993, Int. J. Geogr. Inf. Sci..

[2]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[3]  W. A. Tackett,et al.  The donut problem: scalability, generalization and breeding policies in genetic programming , 1994 .

[4]  Jon Sticklen,et al.  Knowledge-based segmentation of Landsat images , 1991, IEEE Trans. Geosci. Remote. Sens..

[5]  Jason M. Daida,et al.  Extracting curvilinear features from synthetic aperture radar images of Arctic ice: algorithm discovery using the genetic programming paradigm , 1995, 1995 International Geoscience and Remote Sensing Symposium, IGARSS '95. Quantitative Remote Sensing for Science and Applications.

[6]  Stephen Wharton,et al.  A Spectral-Knowledge-Based Approach for Urban Land-Cover Discrmination , 1987, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Kantilal L. Majumder,et al.  An expert system for land cover classification , 1995, IEEE Trans. Geosci. Remote. Sens..

[8]  James D. Westervelt,et al.  Using Neural Networks to Correlate Satellite Imagery and Ground-Truth Data , 1994 .

[9]  Horst Bischof,et al.  Multispectral classification of Landsat-images using neural networks , 1992, IEEE Trans. Geosci. Remote. Sens..

[10]  P. D. Heermann,et al.  Classification of multispectral remote sensing data using a back-propagation neural network , 1992, IEEE Trans. Geosci. Remote. Sens..

[11]  J. K. Kinnear,et al.  Alternatives in automatic function definition: a comparison of performance , 1994 .

[12]  Walter Alden Tackett,et al.  Genetic Programming for Feature Discovery and Image Discrimination , 1993, ICGA.

[13]  S. Arono,et al.  Geographic Information Systems: A Management Perspective , 1989 .

[14]  Stan Matwin,et al.  Automating reuse of software for expert system analysis of remote sensing data , 1994, IEEE Trans. Geosci. Remote. Sens..

[15]  John R. Koza,et al.  Evolution of a Computer Program for Classifying Protein Segments as Transmembrane Domains Using Genetic Programming , 1994, ISMB.

[16]  Una-May O'Reilly,et al.  Genetic Programming II: Automatic Discovery of Reusable Programs. , 1994, Artificial Life.

[17]  Eric V. Siegel Competitively evolving decision trees against fixed training cases for natural language processing , 1994 .

[18]  J. K. Kinnear,et al.  Advances in Genetic Programming , 1994 .