TerraBrasilis: A Spatial Data Analytics Infrastructure for Large-Scale Thematic Mapping

The physical phenomena derived from an analysis of remotely sensed imagery provide a clearer understanding of the spectral variations of a large number of land use and cover (LUC) classes. The creation of LUC maps have corroborated this view by enabling the scientific community to estimate the parameter heterogeneity of the Earth’s surface. Along with descriptions of features and statistics for aggregating spatio-temporal information, the government programs have disseminated thematic maps to further the implementation of effective public policies and foster sustainable development. In Brazil, PRODES and DETER have shown that they are committed to monitoring the mapping areas of large-scale deforestation systematically and by means of data quality assurance. However, these programs are so complex that they require the designing, implementation and deployment of a spatial data infrastructure based on extensive data analytics features so that users who lack a necessary understanding of standard spatial interfaces can still carry out research on them. With this in mind, the Brazilian National Institute for Space Research (INPE) has designed TerraBrasilis, a spatial data analytics infrastructure that provides interfaces that are not only found within traditional geographic information systems but also in data analytics environments with complex algorithms. To ensure it achieved its best performance, we leveraged a micro-service architecture with virtualized computer resources to enable high availability, lower size, simplicity to produce an increment, reliable to change and fault tolerance in unstable computer network scenarios. In addition, we tuned and optimized our databases both to adjust to the input format of complex algorithms and speed up the loading of the web application so that it was faster than other systems.

[1]  Gloria Bordogna,et al.  A Spatial Data Infrastructure Integrating Multisource Heterogeneous Geospatial Data and Time Series: A Study Case in Agriculture , 2016, ISPRS Int. J. Geo Inf..

[2]  Dan Hammer,et al.  Satellite-Based Forest Clearing Detection in the Brazilian Amazon: , 2014 .

[3]  Brian D. Killough,et al.  Overview of the Open Data Cube Initiative , 2018, IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium.

[4]  Ahmed Eldawy,et al.  SpatialHadoop: A MapReduce framework for spatial data , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[5]  Ian Williamson,et al.  Developing a common spatial data infrastructure between State and Local Government--an Australian case study , 2002, Int. J. Geogr. Inf. Sci..

[6]  Chrisa Tsinaraki,et al.  Big Data - a step change for SDI? , 2016, Int. J. Spatial Data Infrastructures Res..

[7]  Denisa Rodila,et al.  Building an Earth Observations Data Cube: lessons learned from the Swiss Data Cube (SDC) on generating Analysis Ready Data (ARD) , 2017 .

[8]  Ioannis Kanellopoulos,et al.  The European geoportal - one step towards the establishment of a European Spatial Data Infrastructure , 2005, Comput. Environ. Urban Syst..

[9]  Ian Williamson,et al.  Spatial data infrastructure management: lessons from corporate GIS development , 1999 .

[10]  Hadley Wickham,et al.  A Layered Grammar of Graphics , 2010 .

[11]  Damien Arvor,et al.  Big earth observation time series analysis for monitoring Brazilian agriculture , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[12]  Paul G. Brown,et al.  Overview of sciDB: large scale array storage, processing and analysis , 2010, SIGMOD Conference.

[13]  Peter Baumann,et al.  The multidimensional database system RasDaMan , 1998, SIGMOD '98.

[14]  Manzhu Yu,et al.  Big Earth data analytics: a survey , 2019, Big Earth Data.

[15]  Michael Stonebraker,et al.  SciDB: A Database Management System for Applications with Complex Analytics , 2013, Computing in Science & Engineering.

[16]  Laércio Massaru Namikawa,et al.  WATER BODY EXTRACTION FROM RAPIDEYE IMAGES: AN AUTOMATED METHODOLOGY BASED ON HUE COMPONENT OF COLOR TRANSFORMATION FROM RGB TO HSV MODEL , 2016, Revista Brasileira de Cartografia.

[17]  Paul A. Longley,et al.  The emergence of geoportals and their role in spatial data infrastructures , 2005, Comput. Environ. Urban Syst..

[18]  Max Craglia,et al.  SDI - From Spatial Data Infrastructure to Service Driven Infrastructure , 2005 .

[19]  Marcos Adami,et al.  DETER-B: The New Amazon Near Real-Time Deforestation Detection System , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[20]  Cesar Guerreiro Diniz,et al.  High spatial resolution land use and land cover mapping of the Brazilian Legal Amazon in 2008 using Landsat-5/TM and MODIS data , 2016 .

[21]  Gilberto Câmara,et al.  Web Services for Big Earth Observation data , 2017, GEOINFO.

[22]  Chaowei Yang,et al.  Utilizing Cloud Computing to address big geospatial data challenges , 2017, Comput. Environ. Urban Syst..

[23]  P. C. Smits,et al.  Resource Discovery in a European Spatial Data Infrastructure , 2007 .

[24]  Bertram C. Bruce,et al.  Reconceptualizing the role of the user of spatial data infrastructure , 2008 .

[25]  Qunying Huang,et al.  Cloud computing for geosciences: deployment of GEOSS clearinghouse on Amazon's EC2 , 2010, HPDGIS '10.

[26]  Divyakant Agrawal,et al.  $\mathcal{MD}$-HBase: design and implementation of an elastic data infrastructure for cloud-scale location services , 2012, Distributed and Parallel Databases.

[27]  Gilberto Câmara,et al.  A spatiotemporal calculus for reasoning about land-use trajectories , 2018, Int. J. Geogr. Inf. Sci..

[28]  Gilberto Câmara,et al.  Big data streaming for remote sensing time series analytics using MapReduce , 2017, GEOINFO.

[29]  Julia Wagemann,et al.  Geospatial web services pave new ways for server-based on-demand access and processing of Big Earth Data , 2018, Int. J. Digit. Earth.

[30]  Yosio Edemir Shimabukuro,et al.  Comparing annual MODIS and PRODES forest cover change data for advancing monitoring of Brazilian forest cover , 2008 .

[31]  Mark Bailey,et al.  The Grammar of Graphics , 2007, Technometrics.

[32]  Jason Lawrence,et al.  HIPI : A Hadoop Image Processing Interface for Image-based MapReduce Tasks , 2011 .

[33]  Ian Masser GIS Worlds: Creating Spatial Data Infrastructures , 2005 .

[34]  Joel H. Saltz,et al.  Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce , 2013, Proc. VLDB Endow..

[35]  Michael Stonebraker,et al.  EarthDB: scalable analysis of MODIS data using SciDB , 2012, BigSpatial '12.

[36]  Peng Yue,et al.  An SDI Approach for Big Data Analytics: The Case on Sensor Web Event Detection and Geoprocessing Workflow , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[37]  Chaowei Phil Yang,et al.  Redefining the possibility of digital Earth and geosciences with spatial cloud computing , 2013, Int. J. Digit. Earth.

[38]  Michael Dixon,et al.  Google Earth Engine: Planetary-scale geospatial analysis for everyone , 2017 .

[39]  Peter Baumann,et al.  Datacubes: A Technology Survey , 2018, IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium.

[40]  Gilberto Câmara,et al.  Big earth observation data analytics: matching requirements to system architectures , 2016, BigSpatial '16.

[41]  Tapani Sarjakoski,et al.  GeoCubes Finland - A Unified Approach for Managing Multi-resolution Raster Geodata in a National Geospatial Research Infrastructure , 2018 .

[42]  Douglas D. Nebert Spatial Data Infrastructure Concepts and Components [presentation] , 2009 .

[43]  Shen-Shyang Ho,et al.  A SciDB-based Framework for Efficient Satellite Data Storage and Query based on Dynamic Atmospheric Event Trajectory , 2015, BigSpatial@SIGSPATIAL.

[44]  Ian Williamson,et al.  Spatial Data Infrastructure Concepts , 1999 .

[45]  Roger S. Pressman,et al.  Software Engineering: A Practitioner's Approach , 1982 .

[46]  Edison Pignaton de Freitas,et al.  A Service-Oriented Middleware for Integrated Management of Crowdsourced and Sensor Data Streams in Disaster Management † , 2018, Sensors.

[47]  G. Câmara,et al.  Amazonian Deforestation Models , 2005, Science.