pyjeo: A Python Package for the Analysis of Geospatial Data

A new Python package, pyjeo, that deals with the analysis of geospatial data has been created by the Joint Research Centre (JRC). Adopting the principles of open science, the JRC strives for transparency and reproducibility of results. In this view, it has been decided to release pyjeo as free and open software. This paper describes the design of pyjeo and how its underlying C/C++ library was ported to Python. Strengths and limitations of the design choices are discussed. In particular, the data model that allows the generation of on-the-fly data cubes is of importance. Two uses cases illustrate how pyjeo can contribute to open science. The first is an example of large-scale processing, where pyjeo was used to create a global composite of Sentinel-2 data. The second shows how pyjeo can be imported within an interactive platform for image analysis and visualization. Using an innovative mechanism that interprets Python code within a C++ library on-the-fly, users can benefit from all functions in the pyjeo package. Images are processed in deferred mode, which is ideal for prototyping new algorithms on geospatial data, and assess the suitability of the results created on the fly at any scale and location.

[1]  Julien Michel,et al.  Orfeo ToolBox: open source processing of remote sensing images , 2017, Open Geospatial Data, Software and Standards.

[2]  Douglas Thain,et al.  Distributed computing in practice: the Condor experience , 2005, Concurr. Pract. Exp..

[3]  David M. Beazley,et al.  Automated scientific software scripting with SWIG , 2003, Future Gener. Comput. Syst..

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  Russ Rew,et al.  NetCDF: an interface for scientific data access , 1990, IEEE Computer Graphics and Applications.

[6]  P. Soille Optimal removal of spurious pits in grid digital elevation models , 2004 .

[7]  簡聰富,et al.  物件導向軟體之架構(Object-Oriented Software Construction)探討 , 1989 .

[8]  Pierre Soille,et al.  A versatile data-intensive computing platform for information retrieval from big geospatial data , 2018, Future Gener. Comput. Syst..

[9]  Lewis Adam,et al.  The six faces of the data cube , 2017 .

[10]  Michael Dixon,et al.  Google Earth Engine: Planetary-scale geospatial analysis for everyone , 2017 .

[11]  Steve H. L. Liang,et al.  Open geospatial data, software and standards , 2016, Open Geospatial Data, Software and Standards.

[12]  Pierre Soille,et al.  Constrained connectivity for hierarchical image partitioning and simplification , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  P. Kempeneers,et al.  Optimizing Sentinel-2 image selection in a Big Data context , 2017 .

[14]  Martino Pesaresi,et al.  A New Method for Earth Observation Data Analytics Based on Symbolic Machine Learning , 2016, Remote. Sens..

[15]  Pierre Soille,et al.  Morphological carving , 2004, Pattern Recognit. Lett..

[16]  Edzer Pebesma,et al.  On-Demand Processing of Data Cubes from Satellite Image Collections with the gdalcubes Library , 2019, Data.

[17]  Patrick Hostert,et al.  A Pixel-Based Landsat Compositing Algorithm for Large Area Land Cover Mapping , 2013, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[18]  Daniel Caldeweyher,et al.  Open source GIS: a GRASS GIS approach. The international series in engineering and computer science , 2011, Int. J. Geogr. Inf. Sci..

[19]  Pierre Soille,et al.  An Efficient Algorithm for Drainage Network Extraction on DEMs , 1994, J. Vis. Commun. Image Represent..

[20]  Pierre Soille,et al.  Advances in Interactive Processing and Visualisation with Jupyter on the JRC Big Data Platform (JEODPP) , 2018 .

[21]  Steve Easterbrook,et al.  Open code for open science , 2014 .

[22]  Luc Vincent,et al.  Watersheds in Digital Spaces: An Efficient Algorithm Based on Immersion Simulations , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  E. Rupnik,et al.  MicMac – a free, open-source solution for photogrammetry , 2017, Open Geospatial Data, Software and Standards.