Abstract. Within recent years, several new approaches and solutions for Big Data processing have been developed. The Geospatial world is still facing the lack of well-established distributed processing solutions tailored to the amount and heterogeneity of geodata, especially when fast data processing is a must. The goal of such systems is to improve processing time by distributing data transparently across processing (and/or storage) nodes. These types of methodology are based on the concept of divide and conquer. Nevertheless, in the context of geospatial processing, most of the distributed computing frameworks have important limitations regarding both data distribution and data partitioning methods. Moreover, flexibility and expendability for handling various data types (often in binary formats) are also strongly required. This paper presents a concept for tiling, stitching and processing of big geospatial data. The system is based on the IQLib concept ( https://github.com/posseidon/IQLib/ ) developed in the frame of the IQmulus EU FP7 research and development project ( http://www.iqmulus.eu ). The data distribution framework has no limitations on programming language environment and can execute scripts (and workflows) written in different development frameworks (e.g. Python, R or C#). It is capable of processing raster, vector and point cloud data. The above-mentioned prototype is presented through a case study dealing with country-wide processing of raster imagery. Further investigations on algorithmic and implementation details are in focus for the near future.
[1]
R. Kitchin,et al.
Big Data, new epistemologies and paradigm shifts
,
2014,
Big Data Soc..
[2]
Barrie Sosinsky,et al.
Cloud Computing Bible
,
2010
.
[3]
T. H. Tse,et al.
A Tale of Clouds: Paradigm Comparisons and Some Thoughts on Research Issues
,
2008,
2008 IEEE Asia-Pacific Services Computing Conference.
[4]
Yanchun Zhang,et al.
A Data as a Product Model for Future Consumption of Big Stream Data in Clouds
,
2015,
2015 IEEE International Conference on Services Computing.
[5]
Bin Jiang,et al.
Crowdsourcing, Citizen Science or Volunteered Geographic Information? The Current State of Crowdsourced Geographic Information
,
2016,
ISPRS Int. J. Geo Inf..
[6]
John A. Olson.
Data as a Service: Are We in the Clouds?
,
2009
.
[7]
Sushil Jajodia,et al.
Secure Cloud Computing
,
2014,
Springer New York.
[8]
Hassan A. Karimi,et al.
GEOSS clearinghouse: Integrating geospatial resources to support the global earth observation system of systems
,
2014
.
[9]
Vipin Kumar,et al.
Trends in big data analytics
,
2014,
J. Parallel Distributed Comput..
[10]
Jae-Gil Lee,et al.
Geospatial Big Data: Challenges and Opportunities
,
2015,
Big Data Res..
[11]
Ahmed Eldawy,et al.
The era of Big Spatial Data
,
2016,
ICDE.