pipsCloud: High performance cloud computing for remote sensing big data management and processing

With the increasing requirement of accurate and up-to-date resource & environmental information for regional and global monitoring, large-region covered multi-temporal, multi-spectral massive remote sensing (RS) datasets are exploited for processing. The remote sensing data processing generally follows a complex multi-stage processing chain, which consists of several independent processing steps subject to types of RS applications. In general the RS data processing for regional environmental and disaster monitoring are recognized as typical both compute-intensive and data-intensive applications. To solve the aforementioned issues efficiently, we propose pipsCloud which combine recent Cloud computing and HPC techniques to enable large-scale RS data processing system as on-demand real-time services. Benefiting from the ubiquity, elasticity and high-level of transparency of Cloud computing model, the massive RS data managing and data processing for dynamic environmental monitoring are all encapsulate as Cloud with Web interfaces. Where, a Hilbert-R+ based data indexing mechanism is employed for optimal query and access of RS imageries, RS data products as well as interim data. In the core platform beneath the Cloud services, we provide a parallel file system for massive high-dimensional RS data and offers interfaces for intensive irregular RS data accessing so as to provide improved data locality and optimized I/O performance. Moreover, we adopt an adaptive RS data analysis workflow manage system for on-demand workflow construction and collaborative execution of distributed complex chain of RS data processing, such as forest fire detection, mineral resources and coastline monitoring. Through the experimental analysis we have show the efficiency of the pipsCloud platform.

[1]  Antonio J. Plaza,et al.  Recent Developments in High Performance Computing for Remote Sensing: A Review , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[2]  Sassan Saatchi,et al.  The Global Rain Forest Mapping Project JERS-1 radar mosaic of tropical Africa: development and product characterization aspects , 2000, IEEE Trans. Geosci. Remote. Sens..

[3]  Jon Atli Benediktsson,et al.  Recent Advances in Techniques for Hyperspectral Image Processing , 2009 .

[4]  Ciprian Dobre,et al.  Parallel Programming Paradigms and Frameworks in Big Data Era , 2013, International Journal of Parallel Programming.

[5]  Georg Hager,et al.  Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes , 2009, 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing.

[6]  Wenji Zhao,et al.  Research on Critical Techniques of Disaster-Oriented Remote Sensing Quick Mapping , 2010, 2010 International Conference on Multimedia Technology.

[7]  Andreas Kassler,et al.  Deploying OpenStack: Virtual Infrastructure or Dedicated Hardware , 2014, 2014 IEEE 38th International Computer Software and Applications Conference Workshops.

[8]  Yeh-Ching Chung,et al.  InfiniBand virtualization on KVM , 2012, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings.

[9]  A. Kivity,et al.  kvm : the Linux Virtual Machine Monitor , 2007 .

[10]  Kawser Wazed Nafi,et al.  An experimental study of load balancing of OpenNebula open-source cloud computing platform , 2014, 2014 International Conference on Informatics, Electronics & Vision (ICIEV).

[11]  Derya Maktav,et al.  Foreword to the Special Issue on “Human Settlements: A Global Remote Sensing Challenge” , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[12]  Josh Simons,et al.  Performance Evaluation of HPC Benchmarks on VMware's ESXi Server , 2011, Euro-Par Workshops.

[13]  Masanobu Shimada,et al.  An overview of the JERS-1 SAR Global Boreal Forest Mapping (GBFM) project , 2004, IGARSS 2004. 2004 IEEE International Geoscience and Remote Sensing Symposium.

[14]  Mohamed H. Almeer Cloud Hadoop Map Reduce For Remote Sensing Image Analysis , 2012 .

[15]  Ying Li,et al.  Large oil spill surveillance with the use of MODIS and AVHRR images , 2011, 2011 International Conference on Remote Sensing, Environment and Transportation Engineering.

[16]  S. Saatchi,et al.  The Global Rain Forest Mapping project - A review , 2000 .

[17]  Yu Fang,et al.  Applying GPU and POSIX thread technologies in massive remote sensing image data processing , 2011, 2011 19th International Conference on Geoinformatics.

[18]  Zhengwei He,et al.  Zhouqu County 8.8 extra-large-scale debris flow characters of remote sensing image analysis , 2011, 2011 International Conference on Electronics, Communications and Control (ICECC).

[19]  ChangVictor,et al.  Cloud computing adoption framework , 2016 .

[20]  Shi Zhao-lian,et al.  Primary Study of Massive Imaging Auto-processing System “Pixel Factory” , 2006 .

[21]  Feng-Cheng Lin,et al.  Service Component Architecture for Geographic Information System in Cloud Computing Infrastructure , 2013, 2013 IEEE 27th International Conference on Advanced Information Networking and Applications (AINA).

[22]  Liping Di,et al.  Building an on-demand web service system for Global Agricultural Drought Monitoring and Forecasting , 2012, 2012 IEEE International Geoscience and Remote Sensing Symposium.

[23]  Jing Hua,et al.  A Reference Architecture for Scientific Workflow Management Systems and the VIEW SOA Solution , 2009, IEEE Transactions on Services Computing.

[24]  Muthu Ramachandran,et al.  Cloud Computing Adoption Framework – a security framework for business clouds , 2015 .

[25]  Katherine A. Yelick,et al.  Hybrid PGAS runtime support for multicore nodes , 2010, PGAS '10.

[26]  Albert Y. Zomaya,et al.  Task-Tree Based Large-Scale Mosaicking for Massive Remote Sensed Imageries with Dynamic DAG Scheduling , 2014, IEEE Transactions on Parallel and Distributed Systems.

[27]  Katherine A. Yelick,et al.  Communication optimizations for fine-grained UPC applications , 2005, 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05).

[28]  Kenjiro Taura,et al.  An Empirical Performance Study of Chapel Programming Language , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.

[29]  Manish Parashar,et al.  Enabling on-demand science via cloud computing , 2014, IEEE Cloud Computing.

[30]  Guo Lei,et al.  A parallel fusion method of remote sensing image based on IHS transformation , 2011, 2011 4th International Congress on Image and Signal Processing.

[31]  David Grove,et al.  X10 as a Parallel Language for Scientific Computation: Practice and Experience , 2011, 2011 IEEE International Parallel & Distributed Processing Symposium.

[32]  Antonio J. Plaza,et al.  Real-Time Endmember Extraction on Multicore Processors , 2011, IEEE Geoscience and Remote Sensing Letters.

[33]  P. Priyanga,et al.  Enabling Smart Cloud Services Through Remote Sensing : An Internet of Everything Enabler , 2015 .

[34]  Qian Du,et al.  High Performance Computing for Hyperspectral Remote Sensing , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[35]  Liu Renyi,et al.  Research of the landuse vector data storage and spatial index based on the main memory database. , 2015 .

[36]  Zhaowen Lin,et al.  Bare Metal Provisioning to OpenStack Using xCAT , 2013, J. Comput..

[37]  Yinghui Zhao,et al.  Remote sensing based soil moisture estimation on high performance PC server , 2010, 2010 The 2nd Conference on Environmental Science and Information Application Technology.

[38]  Chengqi Cheng,et al.  An index and retrieval method of spatial data based on GeoSOT global discrete grid system , 2013, 2013 IEEE International Geoscience and Remote Sensing Symposium - IGARSS.

[39]  Edward A. Lee,et al.  Scientific workflow management and the Kepler system , 2006, Concurr. Comput. Pract. Exp..

[40]  Lizhe Wang,et al.  Generic Parallel Programming for Massive Remote Sensing Data Processing , 2012, 2012 IEEE International Conference on Cluster Computing.

[41]  Antonio J. Plaza,et al.  Special issue on architectures and techniques for real-time processing of remotely sensed images , 2009, Journal of Real-Time Image Processing.

[42]  Yanying Wang,et al.  An Optimized Image Mosaic Algorithm with Parallel IO and Dynamic Grouped Parallel Strategy Based on Minimal Spanning Tree , 2010, 2010 Ninth International Conference on Grid and Cloud Computing.

[43]  Richard Wolski,et al.  The Eucalyptus Open-Source Cloud-Computing System , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[44]  Florian Pappenberger,et al.  High-Resolution 3-D Flood Information From Radar Imagery for Flood Hazard Management , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[45]  Taeyoung Kim,et al.  Parallel Processing with MPI for Inter-band Registration in Remote Sensing , 2011, 2011 IEEE 17th International Conference on Parallel and Distributed Systems.

[46]  Yan Ma,et al.  An Asynchronous Parallelized and Scalable Image Resampling Algorithm with Parallel I/O , 2009, ICCS.

[47]  M. N. Vora,et al.  Hadoop-HBase for large-scale data , 2011, Proceedings of 2011 International Conference on Computer Science and Network Technology.

[48]  Chein-I Chang,et al.  High Performance Computing in Remote Sensing , 2007, HiPC 2007.

[49]  Daniel S. Katz,et al.  Pegasus: A framework for mapping complex scientific workflows onto distributed systems , 2005, Sci. Program..

[50]  Anne H. Schistad Solberg,et al.  Remote Sensing of Ocean Oil-Spill Pollution , 2012, Proceedings of the IEEE.

[51]  Daniel Mandl Matsu: An Elastic Cloud Connected to a SensorWeb for Disaster Response , 2011 .

[52]  Pascal Bouvry,et al.  HPC Performance and Energy-Efficiency of Xen, KVM and VMware Hypervisors , 2013, 2013 25th International Symposium on Computer Architecture and High Performance Computing.

[53]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[54]  Rajkumar Buyya,et al.  Minimizing Execution Costs when Using Globally Distributed Cloud Services , 2010, 2010 24th IEEE International Conference on Advanced Information Networking and Applications.