Comparing semantically-blind and semantically-aware landscape similarity measures with application to query-by-content and regionalization

Abstract Local landscapes can be assessed for a degree of mutual similarity between their patterns using a methodology pioneered in the domain of Content-Based Image Retrieval (CBIR). They are represented by multivariate histograms of selected pattern features and their similarity is calculated as the similarity between those histograms. Potential similarity functions include semantically-blind measures — designed to compare only the patterns without taking into account intrinsic similarities between the features, and semantically-aware measures — that also take into account intrinsic (semantic) alikeness between landscape features. In this paper we compare the performance of these two types of landscape similarity measures. Two measures are evaluated, the semantically-blind Jensen–Shanon (JS) and the semantically-aware Earth Mover's Distance (EMD). The performance of these two measures is evaluated empirically by running tests pertaining to their potential practical applications: a query-by-content and regionalization into landscape types. Variants of the JS and EMD corresponding to 1D and 2D histograms are applied to the two test sites taken from the NLCD 2006. Each test site is divided into 400 3 × 3 km areal units serving as local landscapes. For each test site extensive query-by-content and regionalization calculations are run using different variants of JS and EMD. The results are compared using statistics and visual assessment. The EMD has been found to perform better than the JS, but the difference in performance is not pronounced, especially if a 1D histogram representation of landscapes is used. For the 2D histogram representation of landscapes the EMD consistently yields better results than the JS but suffers from high computational cost. The EMD also enables the comparison of landscapes having different legends.

[1]  Leonidas J. Guibas,et al.  A metric for distributions with applications to image databases , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[2]  James D. Wickham,et al.  Mapping and analyzing landscape patterns , 1994, Landscape Ecology.

[3]  Bruce T. Milne,et al.  Indices of landscape pattern , 1988, Landscape Ecology.

[4]  Roger White,et al.  Pattern based map comparisons , 2006, J. Geogr. Syst..

[5]  Sung-Hyuk Cha Comprehensive Survey on Distance/Similarity Measures between Probability Density Functions , 2007 .

[6]  Jens Vygen,et al.  The Book Review Column1 , 2020, SIGACT News.

[7]  Haibin Ling,et al.  Diffusion Distance for Histogram Comparison , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[8]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[9]  Tomasz F. Stepinski,et al.  Connected components labeling for giga-cell multi-categorical rasters , 2013, Comput. Geosci..

[10]  N. Mantel The detection of disease clustering and a generalized regression approach. , 1967, Cancer research.

[11]  Sing Bing Kang,et al.  Emerging Topics in Computer Vision , 2004 .

[12]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[13]  Arnold W. M. Smeulders,et al.  Content-based image retrieval by viewpoint-invariant color indexing , 1999, Image Vis. Comput..

[14]  Tomasz F. Stepinski,et al.  LandEx—A GeoWeb Tool for Query and Retrieval of Spatial Patterns in Land Cover Datasets , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[15]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Walter G. Whitford,et al.  Monitoring and Assessment , 2020, Ecology of Desert Systems.

[17]  Joachim M. Buhmann,et al.  Empirical Evaluation of Dissimilarity Measures for Color and Texture , 2001, Comput. Vis. Image Underst..

[18]  Tomasz F. Stepinski,et al.  Example-Based Retrieval of Alike Land-Cover Scenes From NLCD2006 Database , 2013, IEEE Geoscience and Remote Sensing Letters.

[19]  Claudio Gutierrez,et al.  Survey of graph database models , 2008, CSUR.

[20]  Anjali Goswami,et al.  Modularity for Mathematica , 2010 .

[21]  Leland Wilkinson,et al.  The History of the Cluster Heat Map , 2009 .

[22]  K. Riitters,et al.  A multi-scale analysis of landscape statistics , 1997, Landscape Ecology.

[23]  Peng Gao,et al.  Regionalization of forest pattern metrics for the continental United States using contiguity constrained clustering and partitioning , 2012, Ecol. Informatics.

[24]  Simon Urbanek,et al.  Unsupervised clustering of multidimensional distributions using earth mover distance , 2011, KDD.

[25]  T. Allen,et al.  Spatial and compositional pattern of alpine treeline, Glacier National Park, Montana , 1996 .

[26]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[27]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[28]  F. Herzog,et al.  Supplementing Land-Use Statistics with Landscape Metrics: Some Methodological Considerations , 2001, Environmental monitoring and assessment.

[29]  G. Medioni,et al.  Content-based image retrieval: an overview , 2004 .

[30]  Arnold W. M. Smeulders,et al.  Content-Based Image Retrieval , 2004 .

[31]  R. Haines-Young,et al.  Quantifying landscape structure: a review of landscape indices and their application to forested landscapes , 1996 .

[32]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[33]  T. Nelson,et al.  Regionalization of Landscape Pattern Indices Using Multivariate Cluster Analysis , 2010, Environmental management.

[34]  Ola Ahlqvist,et al.  Extending post-classification change detection using semantic similarity metrics to overcome class heterogeneity: A study of 1992 and 2001 U.S. National Land Cover Database changes , 2008 .

[35]  T. Stepinski,et al.  Regionalization of multi-categorical landscapes using machine vision methods , 2013 .

[36]  Azriel Rosenfeld,et al.  Sequential Operations in Digital Picture Processing , 1966, JACM.

[37]  Nicholas J. Clifford,et al.  Progress in Physical Geography: retrospect and prospect 2009—10 , 2009 .

[38]  Xueqi Cheng,et al.  Spectral methods for the detection of network community structure: a comparative analysis , 2010, ArXiv.

[39]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[40]  F. Schönhofer Monitoring and Assessment , 1991 .

[41]  David W. Jacobs,et al.  Approximate earth mover’s distance in linear time , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.