GeoPAT: A toolbox for pattern-based information retrieval from large geospatial databases

Abstract Geospatial Pattern Analysis Toolbox (GeoPAT) is a collection of GRASS GIS modules for carrying out pattern-based geospatial analysis of images and other spatial datasets. The need for pattern-based analysis arises when images/rasters contain rich spatial information either because of their very high resolution or their very large spatial extent. Elementary units of pattern-based analysis are scenes – patches of surface consisting of a complex arrangement of individual pixels (patterns). GeoPAT modules implement popular GIS algorithms, such as query, overlay, and segmentation, to operate on the grid of scenes. To achieve these capabilities GeoPAT includes a library of scene signatures – compact numerical descriptors of patterns, and a library of distance functions – providing numerical means of assessing dissimilarity between scenes. Ancillary GeoPAT modules use these functions to construct a grid of scenes or to assign signatures to individual scenes having regular or irregular geometries. Thus GeoPAT combines knowledge retrieval from patterns with mapping tasks within a single integrated GIS environment. GeoPAT is designed to identify and analyze complex, highly generalized classes in spatial datasets. Examples include distinguishing between different styles of urban settlements using VHR images, delineating different landscape types in land cover maps, and mapping physiographic units from DEM. The concept of pattern-based spatial analysis is explained and the roles of all modules and functions are described. A case study example pertaining to delineation of landscape types in a subregion of NLCD is given. Performance evaluation is included to highlight GeoPAT's applicability to very large datasets. The GeoPAT toolbox is available for download from http://sil.uc.edu/ .

[1]  Ranga Raju Vatsavai Object based image classification: state of the art and computational challenges , 2013, BigSpatial '13.

[2]  Chi-Ren Shyu,et al.  GeoIRIS: Geospatial Information Retrieval and Indexing System—Content Mining, Semantics Modeling, and Complex Queries , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[3]  A. Stein,et al.  Texture-based landform segmentation of LiDAR imagery , 2005 .

[4]  Ranga Raju Vatsavai Gaussian multiple instance learning approach for mapping the slums of the world using very high resolution imagery , 2013, KDD.

[5]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[6]  Mihai Datcu,et al.  Knowledge-driven Information-Mining in Remote Sensing Image Archives , 2002 .

[7]  Kevin McGarigal,et al.  Parsimony in landscape metrics: Strength, universality, and consistency , 2008 .

[8]  G. Medioni,et al.  Content-based image retrieval: an overview , 2004 .

[9]  J. Roosaare,et al.  Landscape Metrics and Indices: An Overview of Their Use in Landscape Research , 2009 .

[10]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[11]  Qihao Weng,et al.  A survey of image classification methods and techniques for improving classification performance , 2007 .

[12]  Thomas E. Dilts,et al.  The Landscape Similarity Toolbox: new tools for optimizing the location of control sites in experimental studies , 2010 .

[13]  Claudio Gutierrez,et al.  Survey of graph database models , 2008, CSUR.

[14]  John Krumm,et al.  Object recognition with color cooccurrence histograms , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[15]  Mihai Datcu,et al.  Information mining in remote sensing image archives: system evaluation , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[16]  Arnold W. M. Smeulders,et al.  Content-based image retrieval by viewpoint-invariant color indexing , 1999, Image Vis. Comput..

[17]  T. Stepinski,et al.  Geomorphons — a pattern recognition approach to classification and mapping of landforms , 2013 .

[18]  Tomasz F. Stepinski,et al.  LandEx—A GeoWeb Tool for Query and Retrieval of Spatial Patterns in Land Cover Datasets , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[19]  Leila Maria Garcia Fonseca,et al.  GeoDMA - Geographic Data Mining Analyst , 2013, Comput. Geosci..

[20]  T. Stepinski,et al.  Regionalization of multi-categorical landscapes using machine vision methods , 2013 .

[21]  Mihai Datcu,et al.  Information Mining in Remote Sensing Image Archives , 1999 .

[22]  Thomas Blaschke,et al.  Object-Based Image Analysis , 2008 .

[23]  Selim Aksoy,et al.  Learning bayesian classifiers for scene classification with a visual grammar , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[24]  Tomasz F. Stepinski,et al.  Example-Based Retrieval of Alike Land-Cover Scenes From NLCD2006 Database , 2013, IEEE Geoscience and Remote Sensing Letters.

[25]  T. Stepinski,et al.  Landscape similarity, retrieval, and machine mapping of physiographic units , 2014 .

[26]  Lasse Møller-Jensen,et al.  Large-area urban growth observations—a hierarchical kernel approach based on image texture , 2005 .

[27]  Gilberto Câmara,et al.  Spring: integrating remote sensing and gis by object-oriented data modelling , 1996, Comput. Graph..

[28]  Geoffrey J. Hay,et al.  Free and open source geographic information tools for landscape ecology. Ecol Inf 4:183-195 , 2009 .

[29]  Jeffrey A. Cardille,et al.  From the redwood forest to the Gulf Stream waters: human signature nearly ubiquitous in representative US landscapes , 2010 .

[30]  E. Wentz,et al.  Pattern Analysis Based on Type, Orientation, Size, and Shape , 2008 .

[31]  Selim Aksoy,et al.  VisiMine: interactive mining in image databases , 2002, IEEE International Geoscience and Remote Sensing Symposium.

[32]  Anil M. Cheriyadat,et al.  Image Based Characterization of Formal and Informal Neighborhoods in an Urban Landscape , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[33]  Stefan Lang,et al.  Object-based image analysis for remote sensing applications: modeling reality – dealing with complexity , 2008 .

[34]  Jeffrey A. Cardille,et al.  Representative Landscapes in the Forested Area of Canada , 2011, Environmental Management.

[35]  Marco Pastori,et al.  Information mining in remote sensing image archives: system concepts , 2003, IEEE Trans. Geosci. Remote. Sens..

[36]  R. G. Pontius Statistical Methods to Partition Effects of Quantity and Location During Comparison of Categorical Maps at Multiple Resolutions , 2002 .

[37]  S. Barr,et al.  INFERRING URBAN LAND USE FROM SATELLITE SENSOR IMAGES USING KERNEL-BASED SPATIAL RECLASSIFICATION , 1996 .

[38]  F. Csillag,et al.  Mutual information spectra for comparing categorical maps , 2006 .

[39]  John A. Richards,et al.  Remote Sensing Digital Image Analysis , 1986 .

[40]  Chi-Ren Shyu,et al.  Visual-Semantic Modeling in Content-Based Geospatial Information Retrieval Using Associative Mining Techniques , 2010, IEEE Geoscience and Remote Sensing Letters.

[41]  Jiang Li Information mining in remote sensing imagery , 2003 .

[42]  K. Riitters,et al.  A multi-scale analysis of landscape statistics , 1997, Landscape Ecology.

[43]  Ram M. Narayanan,et al.  Integrated spectral and spatial information mining in remote sensing imagery , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[44]  Roger Bivand,et al.  Using the R statistical data analysis language on GRASS 5 , 2000 .

[45]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[46]  R. Haines-Young,et al.  Quantifying landscape structure: a review of landscape indices and their application to forested landscapes , 1996 .

[47]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[48]  T. Nelson,et al.  Regionalization of Landscape Pattern Indices Using Multivariate Cluster Analysis , 2010, Environmental management.

[49]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[50]  Steven W. Zucker,et al.  Region growing: Childhood and adolescence* , 1976 .

[51]  Shruti Kohli,et al.  Data Analysis with R , 2014, 2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing.

[52]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[53]  Geoffrey J. Hay,et al.  Free and open source geographic information tools for landscape ecology , 2009, Ecol. Informatics.

[54]  Sung-Hyuk Cha Comprehensive Survey on Distance/Similarity Measures between Probability Density Functions , 2007 .

[56]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[57]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[58]  Tomasz F. Stepinski,et al.  Pattern-Based Assessment of Land Cover Change on Continental Scale With Application to NLCD 2001–2006 , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[59]  Peng Gao,et al.  Regionalization of forest pattern metrics for the continental United States using contiguity constrained clustering and partitioning , 2012, Ecol. Informatics.

[60]  Joseph Paul Cohen,et al.  Comparing semantically-blind and semantically-aware landscape similarity measures with application to query-by-content and regionalization , 2014, Ecol. Informatics.