Geospatial semantics, ontology and knowledge graphs for big Earth data

Big Data has attracted a lot of attention from governments, industries and academia, and it has been applied to a large number of fields around the world. Big Earth data refers to big data associated with the Earth sciences that is characterized as being massive, multi-source, heterogeneous, multi-temporal, multi-scalar, highly dimensional, highly complex, nonstationary, and unstructured (Guo, 2017b; Guo et al., 2017a). Most big Earth data is related to a geographical location and is usually referred to as geospatial data (Lee & Kang, 2015). Geospatial data is not only an important component of and the main way of organizing and visualizing big Earth data, it is also the foundation for integrating multisource, heterogeneous big Earth data. With the development of Earth observation, deep exploration, computer simulations and other technologies, the capacity to acquire geospatial data has grown very fast, and thus the volume of geospatial data is also increasing exponentially. For example, according to the Union of Concerned Scientists Satellite Database (UCS (Union of Concerned Scientists), 2019), there are 2062 operational satellites currently in orbit around the Earth. These satellites produce huge amounts of remote sensing images with various spatial, temporal and spectral resolutions. Since the 1950s, in order to promote the full sharing and value-added utility of geospatial data, developed countries as well as some developing countries have launched a number of geospatial data-sharing initiatives and programs that have established many spatial data infrastructures (SDI), data centers, and data-sharing platforms around the world (Bai & Di, 2010; Goodchild, Fu, & Rich, 2007; Harvey & Tulloch, 2006; Hu, Janowicz, Prasad, & Gao, 2015) and great achievements have been made in geospatial data sharing. For instance, the Data Sharing Service System of CAS Earth (http://data.casearth.cn/) launched by the Chinese Academy of Sciences has integrated 5.02PB data resources since 2018, and the National Integrated Earth Observation Data Sharing Platform of China (http://www.chinageoss.org) has collected 2,140,000 scenes acquired by 9 land-observation satellites, 1,440,000 scenes from meteorological satellites, and 10,000 scenes from ocean-observation satellites. However, for most geospatial data-sharing projects, top-down organizational mechanisms are usually adopted, which results in networks with large amounts of data remaining in the hands of individual scientists and not being effectively shared. In terms of technology, the existing geospatial data-sharing platforms mainly use simple keyword-matching to search metadata, which leads to incomplete and inaccurate search results due to the lack of semantic reasoning. Therefore, in terms of the mechanisms used, geospatial data-sharing is moving towards a combination of top-down and bottom-up mechanisms and, in technical terms, towards precise data searches and BIG EARTH DATA 2019, VOL. 3, NO. 3, 187–190 https://doi.org/10.1080/20964471.2019.1652003

[1]  M. Goodchild,et al.  Sharing Geographic Information: An Assessment of the Geospatial One-Stop , 2007 .

[2]  Michiel Blind,et al.  Towards an Open Modelling Interface (OpenMI) the HarmonIT project , 2005 .

[3]  Min Feng,et al.  The development and application of e-Geoscience in China , 2016, Inf. Syst. Frontiers.

[4]  Francis Harvey,et al.  Local‐government data sharing: Evaluating the foundations of spatial data infrastructures , 2006, Int. J. Geogr. Inf. Sci..

[5]  Stefano Nativi,et al.  Environmental model access and interoperability: The GEO Model Web initiative , 2013, Environ. Model. Softw..

[6]  A-Xing Zhu,et al.  A similarity-based automatic data recommendation approach for geographic models , 2017, Int. J. Geogr. Inf. Sci..

[7]  Huadong Guo,et al.  Big data drives the development of Earth science , 2017 .

[8]  Krzysztof Janowicz,et al.  Metadata Topic Harmonization and Semantic Search for Linked‐Data‐Driven Geoportals: A Case Study Using ArcGIS Online , 2015, Trans. GIS.

[9]  P. Wallace The Union of Concerned Scientists , 2008 .

[10]  Jae-Gil Lee,et al.  Geospatial Big Data: Challenges and Opportunities , 2015, Big Data Res..

[11]  Tony Hey,et al.  The Fourth Paradigm: Data-Intensive Scientific Discovery , 2009 .

[12]  Songshan Yue,et al.  A data description model for reusing, sharing and integrating geo-analysis models , 2015, Environmental Earth Sciences.

[13]  Hao Jiang,et al.  Big Earth Data: a new challenge and opportunity for Digital Earth’s development , 2017, Int. J. Digit. Earth.

[14]  Chenghu Zhou,et al.  Spatial prediction based on Third Law of Geography , 2018, Ann. GIS.

[15]  Yuqi Bai,et al.  Review of Geospatial Data Systems' Support of Global Change Studies , 2013 .

[16]  Min Feng,et al.  Prototyping an online wetland ecosystem services model using open model sharing standards , 2011, Environ. Model. Softw..