D-ISODATA: A Distributed Algorithm for Unsupervised Classification of Remotely Sensed Data on Network of Workstations

With the advent of high-speed networks and the availability of powerful high-performance workstations, network of workstations has emerged as the most cost-effective platform for computation-intensive applications. One of the major applications for the network of workstations is in the field of remote sensing, where because of the high dimensionality of data, most of the existing data exploitation procedures are computation-intensive. To test the utility of the network of workstations in the field of remote sensing we have adopted a modified version of the well-known ISODATA classification procedure which may be considered as the benchmark for all unsupervised classification algorithms. The ISODATA algorithm is an iterative method that uses Euclidean distance as the similarity measure to cluster data elements into different classes. We have designed and developed a distributed version of ISODATA algorithm (D-ISODATA) on the network of workstations under a message-passing interface environment and have obtained promising speedup. To reduce the processing load and thereby increase the throughput, the ISODATA procedure is commonly applied to only the first few principal component images derived from the original set of the multispectral images. The drawback with the principal component approach is that it is based entirely on the statistical significance of the spectra, rather than the uniqueness of the individual spectra. As, small objects and ground features would likely manifest themselves in the last principal component images, that is, eigen images, discarding them prior to classification would lead to the loss of valuable information. The significant enhancement in processing speed on the network of workstations makes it possible for us to apply our distributed algorithm D-ISODATA to the entire set of multispectral images directly, thereby preserving all the spectral signatures in the data, regardless of their statistical significance.

[1]  Hazem Tawfik,et al.  Pattern recognition techniques in handoff and service area determination , 1994, Proceedings of IEEE Vehicular Technology Conference (VTC).

[2]  R. Chawla,et al.  The Stealth distributed scheduler , 1991, [1991] Proceedings. 11th International Conference on Distributed Computing Systems.

[3]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[4]  M. B. Merickel,et al.  Recovering valid clusters with ISODATA supervised by the CAIC , 1988, Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[5]  Nalini K. Ratha,et al.  Clustering using a coarse-grained parallel genetic algorithm: a preliminary study , 1995, Proceedings of Conference on Computer Architectures for Machine Perception.

[6]  T. M. Lillesand,et al.  Remote Sensing and Image Interpretation , 1980 .

[7]  G. Riccardi,et al.  Adaptation of the ISODATA clustering algorithm for vector supercomputer execution , 1988, Proceedings Supercomputing Vol.II: Science and Applications.

[8]  Witold Pedrycz,et al.  Fuzzy clustering in software reusability , 1997 .

[9]  Paul M. Mather,et al.  Computer Processing of Remotely-Sensed Images: An Introduction , 1988 .

[10]  G H Ball,et al.  A clustering technique for summarizing multivariate data. , 1967, Behavioral science.

[11]  Amjad Umar Distributed And Parallel Computing , 1998, IEEE Concurrency.

[12]  Hesham El-Rewini,et al.  Distributed and Parallel Computing , 1998 .

[13]  S. S. Shen,et al.  Effects of multispectral compression on machine exploitation , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[14]  Michael B. Merickel,et al.  Supervising ISODATA with an information theoretic stopping rule , 1990, Pattern Recognit..

[15]  N. B. Venkateswarlu,et al.  Fast isodata clustering algorithms , 1992, Pattern Recognit..

[16]  SkjellumAnthony,et al.  A high-performance, portable implementation of the MPI message passing interface standard , 1996 .

[17]  Abdel-Ouahab Boudraa,et al.  Left ventricle automated detection method in gated isotopic ventriculography using fuzzy clustering , 1993, IEEE Trans. Medical Imaging.

[18]  Andrew G. Tescher,et al.  Practical transform coding of multispectral imagery , 1995, IEEE Signal Process. Mag..

[19]  M. Berger,et al.  Cluster analysis of MOMS-02/D2 data using spectral bands in combination with texture images , 1995, 1995 International Geoscience and Remote Sensing Symposium, IGARSS '95. Quantitative Remote Sensing for Science and Applications.