Multisource data classification with dependence trees

In order to apply a statistical approach to the classification of multisource remote-sensing data, one of the main problems to face lies in the estimation of probability distribution functions. This problem arises out of the difficulty of defining a common statistical model for such heterogeneous data. A possible solution is to adopt nonparametric approaches, which rely on the availability of training samples without any assumption about the related statistical distributions. The purpose of this paper is to investigate the suitability of the concept of dependence trees for the integration of multisource information through estimation of probability distributions. First, this concept, introduced by Chow and Liu (1968), is used to provide an approximation of a probability distribution defined in an N-dimensional space by a product of N-1 probability distributions defined in two-dimensional (2-D) spaces; this approximation corresponds, in terms of graph theoretical interpretation, to a tree of dependence. For each land cover class, a dependence tree is generated by minimizing an appropriate closeness measure. Then, a nonparametric estimation of the second-order probability distributions is carried out through the Parzen window approach, based on the implementation of 2-D Gaussian kernels. In this way, it is possible to reduce the complexity of the estimation, while capturing a significant part of the interdependence among variables. A comparison with other multisource data fusion methods, namely, the multilayer perceptron (MLP) method, the k-nearest neighbor (k-NN) method, and a Bayesian hierarchical classifier (BHC), is made. Experimental results obtained on multisensor [airborne thematic mapper (ATM) and synthetic aperture radar (SAR)] and multisource (experimental synthetic aperture radar (E-SAR) and a textural feature) data sets show that the proposed fusion method based on dependence trees is able to provide a classification accuracy similar to those of the other methods considered, but with the advantage of a reduced computational load.

[1]  Mihai Datcu,et al.  Spatial information retrieval from remote-sensing images. I. Information theoretical perspective , 1998, IEEE Trans. Geosci. Remote. Sens..

[2]  Peter E. Hart,et al.  The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.

[3]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[4]  C. G. Hilborn,et al.  The Condensed Nearest Neighbor Rule , 1967 .

[5]  P. Swain,et al.  Neural Network Approaches Versus Statistical Methods In Classification Of Multisource Remote Sensing Data , 1990 .

[6]  David T. Brown,et al.  A Note on Approximations to Discrete Probability Distributions , 1959, Inf. Control..

[7]  Athanasios Papoulis,et al.  Probability, Random Variables and Stochastic Processes , 1965 .

[8]  Lorenzo Bruzzone,et al.  An experimental comparison of neural and statistical non-parametric algorithms for supervised classification of remote-sensing images , 1996, Pattern Recognit. Lett..

[9]  T. Cacoullos Estimation of a multivariate density , 1966 .

[10]  E. Parzen 1. Random Variables and Stochastic Processes , 1999 .

[11]  M. Schroder,et al.  Bayesian modeling of remote sensing image content , 1999, IEEE 1999 International Geoscience and Remote Sensing Symposium. IGARSS'99 (Cat. No.99CH36293).

[12]  Philip M. Lewis,et al.  Approximating Probability Distributions to Reduce Storage Requirements , 1959, Information and Control.

[13]  A. Izenman Recent Developments in Nonparametric Density Estimation , 1991 .

[14]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[15]  Sebastiano B. Serpico,et al.  Classification of multisensor remote-sensing images by structured neural networks , 1995, IEEE Trans. Geosci. Remote. Sens..

[16]  Mihai Datcu,et al.  BAYESIAN METHODS : APPLICATIONS IN INFORMATION AGGREGATION AND IMAGE DATA MINING , 1999 .

[17]  Mihai Datcu,et al.  Interactive learning and probabilistic retrieval in remote sensing image archives , 2000, IEEE Trans. Geosci. Remote. Sens..

[18]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[19]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[20]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[21]  K. SEIDELSwiss Bayesian Labeling of Remote Sensing Image Content , 1998 .

[22]  Solomon Kullback,et al.  Approximating discrete probability distributions , 1969, IEEE Trans. Inf. Theory.

[23]  G. F. Hughes,et al.  On the mean accuracy of statistical pattern recognizers , 1968, IEEE Trans. Inf. Theory.

[24]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[25]  James Stephen Marron,et al.  Asymptotically Optimal Bandwidth Selection for Kernel Density Estimators from Randomly Right-Censored Samples. , 1987 .

[26]  J. Kruskal On the shortest spanning subtree of a graph and the traveling salesman problem , 1956 .

[27]  G. Sebestyen,et al.  An Algorithm for Non-Parametric Pattern Recognition , 1966, IEEE Trans. Electron. Comput..

[28]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.