Spatial Data Reduction Through Element-of-Interest (EOI) Extraction

Any large, multifaceted data collection that is challenging to handle with traditional management practices can be branded ‘Big Data.’ Any big data containing geo-referenced attributes can be considered big geospatial data. The increased proliferation of big geospatial data is currently reforming the geospatial industry into a data-driven enterprise. Challenges in the big spatial data domain can be summarized as the ‘Big Vs’ – variety, volume, velocity, veracity and value. Big spatial data sources can be considered in two broad classes, active and passive, as each is impacted to varying degrees. Some of these challenges may be alleviated by reducing unprocessed, or minimally processed, (raw) data to features, which we refer to as the extraction of Elements of Interest (EOI). In fact, many applications require EOI extraction from raw data to enable their basic employment. This chapter presents current state-of-the-art methods to create EOI from some types of georeferenced big data. We classify the data types into two realms: active and passive. Active data are those collected specifically for the purpose to which they are applied. Passive data are those collected for purposes other than those for which they are utilized, included those ‘collected’ for no particular purpose at all. The chapter then presents use cases from both the active and passive spatial realms, including the active applications of terrain feature extraction from digital elevation models and vegetation mapping from remotely-sensed imagery and passive applications like building identification from VGI and point-of-interest data mining from social networks for land use classification. Finally, the chapter concludes with future research needs.

[1]  M. Kanevski,et al.  Analysis and modelling of spatial environmental data , 2004 .

[2]  Julien Michel,et al.  Stable Mean-Shift Algorithm and Its Application to the Segmentation of Arbitrarily Large Remote Sensing Images , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[3]  R. Kitchin,et al.  Big data and human geography , 2013 .

[4]  Thomas Blaschke,et al.  Geographic Object-Based Image Analysis – Towards a new paradigm , 2014, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[5]  Shigeo Takahashi Algorithms for Extracting Surface Topology from Digital Elevation Models , 2006 .

[6]  Shashi Shekhar,et al.  Spatial big-data challenges intersecting mobility and cloud computing , 2012, MobiDE '12.

[7]  Timothy A. Warner,et al.  Estimation of optimal image object size for the segmentation of forest stands with multispectral IKONOS imagery , 2008 .

[8]  Martin Raubal,et al.  Graph Convolutional Neural Networks for Human Activity Purpose Imputation , 2018, NIPS 2018.

[9]  Hankui K. Zhang,et al.  Finer resolution observation and monitoring of global land cover: first mapping results with Landsat TM and ETM+ data , 2013 .

[10]  Gerald Weber,et al.  Efficient Processing of Stream Data over Persistent Data , 2013 .

[11]  Mark Graham,et al.  Geography and the future of big data, big data and the future of geography , 2013 .

[12]  I. Evans,et al.  Transformation (normalization) of slope gradient and surface curvatures, automated for statistical analyses from DEMs , 2015 .

[13]  Krzysztof Janowicz,et al.  GeoAI: spatially explicit artificial intelligence techniques for geographic knowledge discovery and beyond , 2019, Int. J. Geogr. Inf. Sci..

[14]  Clemens Eisank,et al.  Automated object-based classification of topography from SRTM data , 2012, Geomorphology.

[15]  Caspar A. Mücher,et al.  SegOptim - A new R package for optimizing object-based image analyses of high-spatial resolution remotely-sensed data , 2019, Int. J. Appl. Earth Obs. Geoinformation.

[16]  Josef Strobl,et al.  What’s wrong with pixels? Some recent developments interfacing remote sensing and GIS , 2001 .

[17]  Hongbo Su,et al.  Soil salinity mapping in Everglades National Park using remote sensing techniques and vegetation salt tolerance , 2017, Physics and Chemistry of the Earth, Parts A/B/C.

[18]  Erik Cambria,et al.  Big Social Data Analysis , 2013 .

[19]  Ali Daud,et al.  Towards Deep Learning Prospects: Insights for Social Media Analytics , 2019, IEEE Access.

[20]  J. A. Quintanilha,et al.  Regional and local topography subdivision and landform mapping using SRTM-derived data: a case study in southeastern Brazil , 2015, Environmental Earth Sciences.

[21]  Dieter Pfoser,et al.  A vocabulary recommendation method for spatiotemporal data discovery based on Bayesian network and ontologies , 2019 .

[22]  Min Zhang,et al.  Scale parameter selection by spatial statistics for GeOBIA: Using mean-shift based multi-scale segmentation as an example , 2015 .

[23]  Timothy A. Warner,et al.  Implementation of machine-learning classification in remote sensing: an applied review , 2018 .

[24]  Daniel A. Keim,et al.  Visual Analytics: Definition, Process, and Challenges , 2008, Information Visualization.

[25]  Christopher M. Danforth,et al.  The Geography of Happiness: Connecting Twitter Sentiment and Expression, Demographics, and Objective Characteristics of Place , 2013, PloS one.

[26]  Christophe Nicolle,et al.  Understandable Big Data: A survey , 2015, Comput. Sci. Rev..

[27]  Arno Schäpe,et al.  Multiresolution Segmentation : an optimization approach for high quality multi-scale image segmentation , 2000 .

[28]  Jiangye Yuan,et al.  Exploiting deep learning and volunteered geographic information for mapping buildings in Kano, Nigeria , 2018, Scientific Data.

[29]  Peng Gong,et al.  Mapping vegetation heights in China using slope correction ICESat data, SRTM, MODIS-derived and climate data , 2017 .

[30]  Gotthard Meinel,et al.  Automatic identification of building types based on topographic databases – a comparison of different data sources , 2015 .

[31]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[32]  Bo Wang,et al.  Discrepancy measures for selecting optimal combination of parameter values in object-based image analysis , 2012 .

[33]  Carol Miller,et al.  A spatial evaluation of global wildfire-water risks to human and natural systems. , 2018, The Science of the total environment.

[34]  Michael F. Goodchild,et al.  The quality of big (geo)data , 2013 .

[35]  Taghi M. Khoshgoftaar,et al.  Deep learning applications and challenges in big data analytics , 2015, Journal of Big Data.

[36]  William C. Bennett,et al.  Author Correction: Imaging and clinical data archive for head and neck squamous cell carcinoma patients treated with radiotherapy , 2018, Scientific Data.

[37]  Luca Scrucca,et al.  GA: A Package for Genetic Algorithms in R , 2013 .

[38]  Guoqing Li,et al.  Big spatial vector data management: a review , 2018 .

[39]  Mikhail Kanevski,et al.  Machine Learning Algorithms for GeoSpatial Data. Applications and Software Tools , 2008 .

[40]  David Riaño,et al.  Spectral mapping methods applied to LiDAR data: Application to fuel type mapping , 2019, Int. J. Appl. Earth Obs. Geoinformation.

[41]  Shashi Shekhar,et al.  Identifying patterns in spatial information: A survey of methods , 2011, WIREs Data Mining Knowl. Discov..

[42]  Yichun Xie,et al.  Remote sensing imagery in vegetation mapping: a review , 2008 .

[43]  Willem Bouten,et al.  Multi-scale and object-oriented image analysis of high-res LiDAR data for geomorphological mapping in alpine mountains , 2009 .

[44]  D. Boyd,et al.  CRITICAL QUESTIONS FOR BIG DATA , 2012 .

[45]  Zhenlong Li,et al.  Big Data and cloud computing: innovation opportunities and challenges , 2017, Int. J. Digit. Earth.

[46]  S. Arundel,et al.  Pairing semantics and object-based image analysis for national terrain mapping - a first-case scenario of cirques , 2016 .

[47]  Göran Ståhl,et al.  Forest biomass estimation over three distinct forest types using TanDEM-X InSAR data and simulated GEDI lidar data , 2019, Remote Sensing of Environment.

[48]  M. Shimada,et al.  Mapping the spatial-temporal variability of tropical forests by ALOS-2 L-band SAR big data analysis , 2019, Remote Sensing of Environment.

[49]  David M. Mark,et al.  Cognition-Based Extraction and Modelling of Topographic Eminences , 2010, Cartogr. Int. J. Geogr. Inf. Geovisualization.

[50]  Michael J. Starek,et al.  Unmanned aircraft system-derived crop height and normalized difference vegetation index metrics for sorghum yield and aphid stress assessment , 2017 .

[51]  Bin Jiang,et al.  Geospatial Big Data Handling Theory and Methods: A Review and Research Challenges , 2015, ArXiv.

[52]  Josef Strobl,et al.  Object-based classification of landforms based on their local geometry and geomorphometric context , 2011, Int. J. Geogr. Inf. Sci..

[53]  Jameela Al-Jaroodi,et al.  Applications of big data to smart cities , 2015, Journal of Internet Services and Applications.

[54]  F. Cánovas-García,et al.  A local approach to optimize the scale parameter in multiresolution segmentation for multispectral imagery , 2015 .

[55]  Bo Xu,et al.  GEOBIA vegetation mapping in Great Smoky Mountains National Park with spectral and non-spectral ancillary information. , 2010 .

[56]  Richard L. Church,et al.  UC Office of the President Recent Work Title An efficient measure of compactness for two-dimensional shapes and its application in regionalization problems Permalink , 2013 .

[57]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[58]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[59]  Hongming Zhou,et al.  Representational Learning with ELMs for Big Data , 2013 .

[60]  Mikhail Kanevski,et al.  Extreme Learning Machines for spatial environmental data , 2015, Comput. Geosci..

[61]  Dipankar Das,et al.  Enhanced SenticNet with Affective Labels for Concept-Based Opinion Mining , 2013, IEEE Intelligent Systems.

[62]  M. Goodchild,et al.  Data-driven geography , 2014, GeoJournal.

[63]  R. Valbuena,et al.  Mapping forest structural heterogeneity of tropical montane forest remnants from airborne laser scanning and Landsat time series , 2020 .

[64]  G. Miliaresis,et al.  Quantitative representation of mountain objects extracted from the global digital elevation model (GTOPO30) , 2002 .

[65]  Jan Dirk Wegner,et al.  Country-wide high-resolution vegetation height mapping with Sentinel-2 , 2019, Remote Sensing of Environment.

[66]  Francisco C. Pereira,et al.  Mining point-of-interest data from social networks for urban land use classification and disaggregation , 2015, Comput. Environ. Urban Syst..

[67]  Geoffrey J. Hay,et al.  Image objects and geographic objects , 2008 .

[68]  John R. Jensen,et al.  Introductory Digital Image Processing: A Remote Sensing Perspective , 1986 .

[69]  Jae-Gil Lee,et al.  Geospatial Big Data: Challenges and Opportunities , 2015, Big Data Res..

[70]  Chaogui Kang,et al.  Social Sensing: A New Approach to Understanding Our Socioeconomic Environments , 2015 .

[71]  N. Reid,et al.  Carbon storage value of native vegetation on a subhumid–semi-arid floodplain , 2013, Crop and Pasture Science.

[72]  Dirk Tiede,et al.  ESP: a tool to estimate scale parameter for multiresolution image segmentation of remotely sensed data , 2010, Int. J. Geogr. Inf. Sci..

[73]  Friedrich Quiel,et al.  Geomorphometric feature analysis using morphometric parameterization and artificial neural networks , 2008 .

[74]  Geoffrey J. Hay,et al.  Geographic object-based image analysis (GEOBIA): emerging trends and future opportunities , 2018 .

[75]  E. L. Usery,et al.  Automated classification of generic terrain features in digital elevation models , 1993 .