Efficient parallel algorithm for pixel classification in remote sensing imagery

An important approach for image classification is the clustering of pixels in the spectral domain. Fast detection of different land cover regions or clusters of arbitrarily varying shapes and sizes in satellite images presents a challenging task. In this article, an efficient scalable parallel clustering technique of multi-spectral remote sensing imagery using a recently developed point symmetry-based distance norm is proposed. The proposed distributed computing time efficient point symmetry based K-Means technique is able to correctly identify presence of overlapping clusters of any arbitrary shape and size, whether they are intra-symmetrical or inter-symmetrical in nature. A Kd-tree based approximate nearest neighbor searching technique is used as a speedup strategy for computing the point symmetry based distance. Superiority of this new parallel implementation with the novel two-phase speedup strategy over existing parallel K-Means clustering algorithm, is demonstrated both quantitatively and in computing time, on two SPOT and Indian Remote Sensing satellite images, as even K-Means algorithm fails to detect the symmetry in clusters. Different land cover regions, classified by the algorithms for both images, are also compared with the available ground truth information. The statistical analysis is also performed to establish its significance to classify both satellite images and numeric remote sensing data sets, described in terms of feature vectors.

[1]  Edward Y. Chang,et al.  Parallel Spectral Clustering , 2008, ECML/PKDD.

[2]  Ujjwal Maulik,et al.  Parallel Point Symmetry Based Clustering for Gene Microarray Data , 2009, 2009 Seventh International Conference on Advances in Pattern Recognition.

[3]  Sunil Arya,et al.  ANN: library for approximate nearest neighbor searching , 1998 .

[4]  Ka Yee Yeung,et al.  Validating clustering for gene expression data , 2001, Bioinform..

[5]  Arnold L. Rosenberg,et al.  Bounded-Collision Memory-Mapping Schemes for Data Structures with Applications to Parallel Memories , 2007, IEEE Transactions on Parallel and Distributed Systems.

[6]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Jiawei Han,et al.  SRDA: An Efficient Algorithm for Large-Scale Discriminant Analysis , 2008, IEEE Transactions on Knowledge and Data Engineering.

[8]  Martin J. Wooster,et al.  Texture based feature extraction: Application to burn scar detection in Earth observation satellite sensor imagery , 2002 .

[9]  Sanguthevar Rajasekaran Efficient parallel hierarchical clustering algorithms , 2005, IEEE Transactions on Parallel and Distributed Systems.

[10]  Rafael C. González,et al.  Local Determination of a Moving Contrast Edge , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Peter B Barker,et al.  Benign and malignant breast lesions: diagnosis with multiparametric MR imaging. , 2003, Radiology.

[12]  Soon Myoung Chung,et al.  Parallel bisecting k-means with prediction clustering algorithm , 2006, The Journal of Supercomputing.

[13]  S. Bandyopadhyay,et al.  Nonparametric genetic clustering: comparison of validity indices , 2001, IEEE Trans. Syst. Man Cybern. Syst..

[14]  Larry Biehl,et al.  The Effect of Postemergence Herbicides on The Spectral Reflectance of Corn , 2008, Weed Technology.

[15]  Sanghamitra Bandyopadhyay,et al.  Satellite image classification using genetically guided fuzzy clustering with spatial information , 2005 .

[16]  Douglas A. Wolfe,et al.  Nonparametric Statistical Methods , 1973 .

[17]  Ujjwal Maulik,et al.  Fuzzy partitioning using a real-coded variable-length genetic algorithm for pixel classification , 2003, IEEE Trans. Geosci. Remote. Sens..

[18]  Ujjwal Maulik,et al.  Performance Evaluation of Some Clustering Algorithms and Validity Indices , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Peter S. Pacheco Parallel programming with MPI , 1996 .

[20]  Sanghamitra Bandyopadhyay,et al.  Pixel classification using variable string genetic algorithms with chromosome differentiation , 2001, IEEE Trans. Geosci. Remote. Sens..

[21]  Yunhao Liu,et al.  Effectively Utilizing Global Cluster Memory for Large Data-Intensive Parallel Programs , 2006, IEEE Trans. Parallel Distributed Syst..

[22]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[23]  C. Borror Nonparametric Statistical Methods, 2nd, Ed. , 2001 .

[24]  Srinivas Aluru,et al.  Space and time efficient parallel algorithms and software for EST clustering , 2003, IEEE Trans. Parallel Distributed Syst..

[25]  Fawwaz T. Ulaby,et al.  An automated unsupervised/supervised classification methodology , 1998, IGARSS '98. Sensing and Managing the Environment. 1998 IEEE International Geoscience and Remote Sensing. Symposium Proceedings. (Cat. No.98CH36174).

[26]  Imtiaz Ahmad,et al.  D-ISODATA: A Distributed Algorithm for Unsupervised Classification of Remotely Sensed Data on Network of Workstations , 1999, J. Parallel Distributed Comput..

[27]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[28]  D. Wolfe,et al.  Nonparametric Statistical Methods. , 1974 .

[29]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Sanghamitra Bandyopadhyay,et al.  A Point Symmetry-Based Clustering Technique for Automatic Evolution of Clusters , 2008, IEEE Transactions on Knowledge and Data Engineering.

[31]  Jon Louis Bentley,et al.  K-d trees for semidynamic point sets , 1990, SCG '90.

[32]  Sanghamitra Bandyopadhyay,et al.  GAPS: A clustering method using a new point symmetry-based distance measure , 2007, Pattern Recognit..