Neural Networks for Land Cover Applications

Following upon their resurgence as a research topic in the second half of the eighties [1][2], neural networks burst out in the remote sensing community by the end of the same decade when a bunch of investigators started looking at them as an effective alternative to more traditional methods for extracting information from data collected by airborne and spaceborne platforms [3] [4]. Indeed, for researchers used to handle multidimensional data, the possibility of building classification algorithms avoiding the assumptions required by the Bayesian methods on the data probability distributions seemed rather attractive. For the case of parameter retrieval, neural networks owned the advantage of determining the input-output relationship directly from the training data with no need to seek for an explicit modelling of the physical mechanisms, often nonlinear and with poorly understood factors involved [5]. Moreover, it was shown that multilayer feedforward networks formed a class of universal approximators, capable of approximating any real-valued continuous function provided a sufficient number of units in the hidden layer was considered [6]. There were enough reasons to explore the potentialities of such neurocomputational models, also known as associative memory based models, in a wide range of remote sensing applications. This was actually what happened throughout all nineties and, even if with less emphasis, is continuing up to date. Image classification, from SAR imagery to latest hyperspectral data, has probably been one of the most investigated fields in this context. However, the use of neural network for the retrieval of bio-geophysical parameters from remotely sensed data has also played a significant role. In particular, the synergy between these algorithms and RT (Radiative Transfer) electromagnetic models represented a new and effective way to replace the empirical approaches, often based on limited seasonally or regionally data, hence with little generalization capabilities. The use of the RT models is twofold. They can help in the sensitivity analysis of the electromagnetic quantities to the bio-geophysical parameters. They can be exploited for the synthetic generation of training data in case of a lack of experimental data and ground-truth. The combined use of electromagnetic models and neural networks is a topic treated in many published studies taking into account the most diverse applicative scenarios: from the inversion of radiance spectra to infer atmospheric ozone concentration profiles [7], to the retrieval of snow parameters from passive microwave measurements [8] or to the estimation of sea water optically active parameters from hyperspectral data [9], just to mention one case for the three main different domains: atmosphere, earth surface and ocean. As far as the network topology is concerned, MLP probably are on the top of the authors preference ranking, but the choice is widely spread over the numerous alternatives which include RBF (Radial Basis Function), recurrent architectures, as well as the Kohonen SOM [10] if we focus on the unsupervised image classification field. Such an extensive research activity confirmed that neural networks, besides representing a new and easy way of machine learning, possess particularly interesting properties, such as the capability of capturing subtle dependencies among the data, an inherent fault tolerance due to their parallel and distributed structure, a capability of positively merging pieces of information stemming from different sources. Only advantages? Obviously not. The conventional backpropagation learning algorithm can be stuck in a local minimum. Moreover, the choice of network architecture (i.e. number of hidden layers and nodes in each layer), weight initialization and number of iterations required for training may significantly affect the learning performance. Addressing these issues is one of the dominant themes of the current research. Some authors [11] suggest that SVM (Support Vector Machines) may reach in terms of accuracy performances similar to those obtainable by means of neural networks without suffering from the problem of local minima and with limited effort required for architecture design. Other authors [12] remain focused on neural models, aiming at automating the selection of the parameters characterizing the algorithm or searching for improvements in the learning procedures. A different ongoing line of research is more sensitive to the opportunities that should be provided by the data of the last generation EO (Earth Observation) missions. Hence it is more dedicated to exploit the aforementioned neural networks potentialities for more complex applications [13][14]. A couple of interesting examples belonging to this area of research will be illustrated in section 11.3 of this chapter, while section 11.2 recalls some theoretical aspects about neural models.

[1]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[2]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[3]  A. Fung Microwave Scattering and Emission Models and their Applications , 1994 .

[4]  Jeffrey L. Elman,et al.  Distributed Representations, Simple Recurrent Networks, and Grammatical Structure , 1991, Mach. Learn..

[5]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[6]  Philipp Slusallek,et al.  Introduction to real-time ray tracing , 2005, SIGGRAPH Courses.

[7]  Jenq-Neng Hwang,et al.  Handbook of Neural Network Signal Processing , 2000, IEEE Transactions on Neural Networks.

[8]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[9]  Richard P. Lippmann,et al.  An introduction to computing with neural nets , 1987 .

[10]  Ashbindu Singh,et al.  Review Article Digital change detection techniques using remotely-sensed data , 1989 .

[11]  Nikola Kasabov,et al.  Foundations Of Neural Networks, Fuzzy Systems, And Knowledge Engineering [Books in Brief] , 1996, IEEE Transactions on Neural Networks.

[12]  P. Swain,et al.  Neural Network Approaches Versus Statistical Methods In Classification Of Multisource Remote Sensing Data , 1990 .

[13]  Martin Fodslette Møller,et al.  A scaled conjugate gradient algorithm for fast supervised learning , 1993, Neural Networks.

[14]  Horst Bischof,et al.  Multispectral classification of Landsat-images using neural networks , 1992, IEEE Trans. Geosci. Remote. Sens..

[15]  William J. Emery,et al.  An Innovative Neural-Net Method to Detect Temporal Changes in High-Resolution Optical Satellite Imagery , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[16]  Lorenzo Bruzzone,et al.  Robust multiple estimator systems for the analysis of biophysical parameters from remotely sensed data , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[17]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[18]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[19]  Marco Diani,et al.  Retrieval of sea water optically active parameters from hyperspectral data by means of generalized radial basis function neural networks , 2001, IEEE Trans. Geosci. Remote. Sens..

[20]  Maurice Borgeaud,et al.  Interpreting ERS SAR signatures of agricultural crops in Flevoland, 1993-1996 , 2000, IEEE Trans. Geosci. Remote. Sens..

[21]  Lorenzo Bruzzone,et al.  An adaptive semiparametric and context-based approach to unsupervised change detection in multitemporal remote-sensing images , 2002, IEEE Trans. Image Process..

[22]  Michael A. Arbib,et al.  The handbook of brain theory and neural networks , 1995, A Bradford book.

[23]  Vojislav Kecman,et al.  Learning and Soft Computing: Support Vector Machines, Neural Networks, and Fuzzy Logic Models , 2001 .

[24]  Fabio Del Frate,et al.  Neural networks for oil spill detection using ERS-SAR data , 2000, IEEE Trans. Geosci. Remote. Sens..

[25]  Fabio Del Frate,et al.  Crop classification using multiconfiguration C-band SAR data , 2003, IEEE Trans. Geosci. Remote. Sens..

[26]  D. Solimini,et al.  Monitoring urban areas by using ERS-SAR data and neural networks algorithms , 1999, IEEE 1999 International Geoscience and Remote Sensing Symposium. IGARSS'99 (Cat. No.99CH36293).

[27]  William J. Emery,et al.  Comparing Statistical and Neural Network Methods Applied to Very High Resolution Satellite Images Showing Changes in Man-Made Structures at Rocky Flats , 2008, IEEE Transactions on Geoscience and Remote Sensing.

[28]  Fabio Del Frate,et al.  Use of Neural Networks for Automatic Classification From High-Resolution Images , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[29]  Pramod K. Varshney,et al.  An image change detection algorithm based on Markov random field models , 2002, IEEE Trans. Geosci. Remote. Sens..

[30]  Wei-Min Qi,et al.  Dynamic properties of Elman and modified Elman neural network , 2002, Proceedings. International Conference on Machine Learning and Cybernetics.

[31]  Robert J. Marks,et al.  Inversion Of Snow Parameters From Passive Microwave Remote Sensing Measurements By A Neural Network Trained With A Multiple Scattering Model , 1991, [Proceedings] IGARSS'91 Remote Sensing: Global Monitoring for Earth Management.

[32]  S. Casadio,et al.  Application of neural algorithms for a real-time estimation of ozone profiles from GOME measurements , 2002, IEEE Trans. Geosci. Remote. Sens..

[33]  D. H. Hoekman,et al.  The potential of SAR in crop classification using multi-configuration data , 2002 .