论文信息 - Multi-Scale Representation Learning for Spatial Feature Distributions using Grid Cells

Multi-Scale Representation Learning for Spatial Feature Distributions using Grid Cells

Unsupervised text encoding models have recently fueled substantial progress in Natural Language Processing (NLP). The key idea is to use neural networks to convert words in texts to vector space representations (embeddings) based on word positions in a sentence and their contexts. We see a strikingly similar situation in spatial analysis, which focuses on incorporating both absolute positions and spatial contexts of geographic objects such as Points of Interest (POIs) into models. A general space encoding method is valuable for a multitude of tasks such asPOI search, land use classification, point-based spatial interpolation and locationaware image classification. However, no such general model exists to date beyond simply applying discretizing or feed forward nets to coordinates, and little effort has been put into jointly modeling distributions with vastly different characteristics, which commonly emerges from GIS data. Meanwhile, Nobel Prize-winning Neuroscience research shows that grid cells in mammals provide a multi-scale periodic representation that functions as a metric for encoding space and are critical for recognizing places and for path-integration. Inspired by this research, wepropose a representation learning model called Space2vec to encode the absolutepositions and spatial relationships of places. We conduct experiments on realworld geographic data and predict types of POIs at given positions based on their1) locations and 2) nearby POIs. Results show that because of its multi-scale representations Space2vec outperforms well-established ML approaches such as RBF kernels, multi-layer feed forward nets, and tile embedding approaches.

[1] G. Baudat,et al. Kernel-based methods and function approximation , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[2] Nataliia Kussul,et al. Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data , 2017, IEEE Geoscience and Remote Sensing Letters.

[3] Simon D. Jones,et al. Investigating species-environment relationships at multiple scales: Differentiating between intrinsic scale and the modifiable areal unit problem , 2012 .

[4] A S Fotheringham,et al. The Modifiable Areal Unit Problem in Multivariate Statistical Analysis , 1991 .

[5] Herman J. Bierens,et al. The Nadaraya-Watson Kernel regression function estimator , 1988 .

[6] Yang Song,et al. Geo-Aware Networks for Fine-Grained Recognition , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[7] Krzysztof Janowicz,et al. POI Pulse: A Multi-granular, Semantic Signature–Based Information Observatory for the Interactive Visualization of Big Geosocial Data , 2015, Cartogr. Int. J. Geogr. Inf. Geovisualization.

[8] Fei-Fei Li,et al. Improving Image Classification with Location Context , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[9] Ni Lao,et al. GeoSVM: an efficient and effective tool to predict species' potential distributions , 2008, Journal of Plant Ecology.

[10] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[11] H. T. Blair,et al. Scale-Invariant Memory Representations Emerge from Moiré Interference between Grid Fields That Produce Theta Oscillations: A Computational Model , 2007, The Journal of Neuroscience.

[12] Krzysztof Janowicz,et al. From ITDL to Place2Vec: Reasoning About Place Type Similarity and Relatedness by Learning Embeddings From Augmented Spatial Contexts , 2017, SIGSPATIAL/GIS.

[13] P. Dixon. Ripley's K Function , 2006 .

[14] Song-Chun Zhu,et al. Learning Grid Cells as Vector Representation of Self-Position Coupled with Matrix Representation of Self-Motion , 2018, ICLR.

[15] Seung Woo Lee,et al. Birdsnap: Large-Scale Fine-Grained Visual Categorization of Birds , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[16] Razvan Pascanu,et al. Vector-based navigation using grid-like representations in artificial agents , 2018, Nature.

[17] Pietro Liò,et al. Graph Attention Networks , 2017, ICLR.

[18] Richard Field,et al. Refining area of occupancy to address the modifiable areal unit problem in ecology and conservation , 2018, Conservation biology : the journal of the Society for Conservation Biology.

[19] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Cyrus Shahabi,et al. Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting , 2017, ICLR.

[21] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[22] Krzysztof Janowicz,et al. Relaxing Unanswerable Geographic Questions Using A Spatially Explicit Knowledge Graph Embedding Model , 2019, AGILE Conference.

[23] Stan Openshaw,et al. Modifiable Areal Unit Problem , 2008, Encyclopedia of GIS.

[24] Krzysztof Janowicz,et al. Contextual Graph Attention for Answering Logical Queries over Incomplete Knowledge Graphs , 2019, K-CAP.

[25] Krzysztof Janowicz,et al. xNet+SC: Classifying Places Based on Images by Incorporating Spatial Contexts , 2018, GIScience.

[26] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[27] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[28] V. Maz'ya,et al. On approximate approximations using Gaussian kernels , 1996 .

[29] Alison Abbott,et al. Nobel prize for decoding brain’s sense of place , 2014, Nature.

[30] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31] Pietro Perona,et al. Presence-Only Geographical Priors for Fine-Grained Image Classification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[32] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[33] Xue-Xin Wei,et al. Emergence of grid-like representations by training recurrent neural networks to perform spatial localization , 2018, ICLR.

[34] Alexander J. Smola,et al. Deep Sets , 2017, 1703.06114.

[35] Mark Gahegan,et al. Frankenplace: Interactive Thematic Mapping for Ad Hoc Exploratory Search , 2015, WWW.