Distributed Parallel Endmember Extraction of Hyperspectral Data Based on Spark

Due to the increasing dimensionality and volume of remotely sensed hyperspectral data, the development of acceleration techniques for massive hyperspectral image analysis approaches is a very important challenge. Cloud computing offers many possibilities of distributed processing of hyperspectral datasets. This paper proposes a novel distributed parallel endmember extraction method based on iterative error analysis that utilizes cloud computing principles to efficiently process massive hyperspectral data. The proposed method takes advantage of technologies including MapReduce programming model, Hadoop Distributed File System HDFS, and Apache Spark to realize distributed parallel implementation for hyperspectral endmember extraction, which significantly accelerates the computation of hyperspectral processing and provides high throughput access to large hyperspectral data. The experimental results, which are obtained by extracting endmembers of hyperspectral datasets on a cloud computing platform built on a cluster, demonstrate the effectiveness and computational efficiency of the proposed method.

[1]  Antonio J. Plaza,et al.  Recent Developments in High Performance Computing for Remote Sensing: A Review , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[2]  Jon Atli Benediktsson,et al.  Hyperspectral Unmixing on GPUs and Multi-Core Processors: A Comparison , 2013, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[3]  Antonio J. Plaza,et al.  Parallel and Distributed Dimensionality Reduction of Hyperspectral Data on Cloud Computing Architectures , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[4]  Zhihui Wei,et al.  Sparse Non-negative Matrix Factorization on GPUs for Hyperspectral Unmixing , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[5]  Katarina Stanoevska-Slabeva,et al.  Grid and Cloud Computing, A Business Perspective on Technology and Applications , 2009, Grid and Cloud Computing.

[6]  Chao Yang,et al.  Cloud Computing Enabled Web Processing Service for Earth Observation Data Processing , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[7]  Antonio J. Plaza,et al.  Real-Time Implementation of the Vertex Component Analysis Algorithm on GPUs , 2013, IEEE Geoscience and Remote Sensing Letters.

[8]  Antonio J. Plaza,et al.  Real-Time Endmember Extraction on Multicore Processors , 2011, IEEE Geoscience and Remote Sensing Letters.

[9]  José M. Bioucas-Dias,et al.  Minimum Volume Simplex Analysis: A Fast Algorithm to Unmix Hyperspectral Data , 2008, IGARSS 2008 - 2008 IEEE International Geoscience and Remote Sensing Symposium.

[10]  Antonio J. Plaza,et al.  FPGA Implementation of the N-FINDR Algorithm for Remotely Sensed Hyperspectral Image Analysis , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[11]  José M. Bioucas-Dias,et al.  Vertex component analysis: a fast algorithm to unmix hyperspectral data , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Qian Du,et al.  High Performance Computing for Hyperspectral Remote Sensing , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[13]  José M. Bioucas-Dias,et al.  Does independent component analysis play a role in unmixing hyperspectral data? , 2005, IEEE Trans. Geosci. Remote. Sens..

[14]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[15]  Chein-I Chang,et al.  Fully constrained least squares linear spectral mixture analysis method for material quantification in hyperspectral imagery , 2001, IEEE Trans. Geosci. Remote. Sens..

[16]  José M. Bioucas-Dias,et al.  Hyperspectral unmixing algorithm via dependent component analysis , 2007, 2007 IEEE International Geoscience and Remote Sensing Symposium.

[17]  Antonio J. Plaza,et al.  Hyperspectral Unmixing Overview: Geometrical, Statistical, and Sparse Regression-Based Approaches , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[18]  Michael J. Franklin,et al.  Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing , 2012, NSDI.

[19]  Antonio J. Plaza,et al.  Parallel Hyperspectral Unmixing on GPUs , 2014, IEEE Geoscience and Remote Sensing Letters.

[20]  Chein-I Chang,et al.  A New Growing Method for Simplex-Based Endmember Extraction Algorithm , 2006, IEEE Transactions on Geoscience and Remote Sensing.