A Generic Framework for the Development of Geospatial Processing Pipelines on Clusters

The amount of remote sensing (RS) data available for applications is constantly growing due to the rise of very high resolution sensors and short-repeat-cycle satellites. Consequently, tackling the computational complexity in Earth observation information extraction is rising as a major challenge. Resorting to high-performance computing (HPC) is becoming a common practice, since this provides environments and programming facilities that are able to speed up processes. In particular, clusters are flexible cost-effective systems that are able to perform data-intensive tasks ideally fulfilling any computational requirement. However, their use typically implies a significant coding effort to build proper implementations of specific processing pipelines. This letter presents a generic framework for the development of RS images processing applications targeting cluster computing. It is based on common open-source libraries and leverages the parallelization of a wide variety of image processing pipelines in a transparent way. Performances on typical RS tasks implemented using the proposed framework demonstrate a great potential for the effective and timely processing of large amount of data.

[1]  Advances in Very-High-Resolution Remote Sensing , .

[2]  D. Lobell,et al.  Cropland distributions from temporal unmixing of MODIS data , 2004 .

[3]  Emmanuel Christophe,et al.  The Orfeo Toolbox remote sensing image processing software , 2009, 2009 IEEE International Geoscience and Remote Sensing Symposium.

[4]  Rajeev Thakur,et al.  On implementing MPI-IO portably and with high performance , 1999, IOPADS '99.

[5]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[6]  Michael J Ackerman,et al.  Engineering and algorithm design for an image processing Api: a technical report on ITK--the Insight Toolkit. , 2002, Studies in health technology and informatics.

[7]  Antonio J. Plaza,et al.  Recent Developments in High Performance Computing for Remote Sensing: A Review , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[8]  Samuel Williams,et al.  The Landscape of Parallel Computing Research: A View from Berkeley , 2006 .

[9]  Chein-I Chang,et al.  High Performance Computing in Remote Sensing , 2007, HiPC 2007.

[10]  A-Xing Zhu,et al.  How to Apply the Geospatial Data Abstraction Library (GDAL) Properly to Parallel Geospatial Raster I/O? , 2014, Trans. GIS.

[11]  Julien Michel,et al.  Remote Sensing Processing: From Multicore to GPU , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[12]  Albert Y. Zomaya,et al.  A Parallel File System with Application-Aware Data Layout Policies for Massive Remote Sensing Image Processing in Digital Earth , 2015, IEEE Transactions on Parallel and Distributed Systems.

[13]  Jon Atli Benediktsson,et al.  Advances in Very-High-Resolution Remote Sensing [Scanning the Issue] , 2013, Proc. IEEE.

[14]  Wen Zeng,et al.  pRPL 2.0: Improving the Parallel Raster Processing Library , 2014 .

[15]  Chenghu Zhou,et al.  A strategy for raster-based geocomputation under different parallel computing platforms , 2014, Int. J. Geogr. Inf. Sci..