GeoAttn: Localization of Social Media Messages via Attentional Memory Network

Recent studies have demonstrated inspiring success in leveraging geo-tagged social media data for applications such as event detection, location recommendation and mobile healthcare. However, in most real-life social media streams, only a small percentage of data have explicit geo-location metadata, which hinders the power of social media from being fully unleashed. We study the problem of inferring geo-locations from social media messages. While a number of textbased geo-locating techniques have been proposed, they either fall short of automatically identifying indicative keywords from noisy social media posts or do not integrate rich prior knowledge of geological regions. We propose an attentive memory network called GeoAttn for localization of social media messages. To capture indicative keywords for location inference, GeoAttn consists of an attentive message encoder, which selectively focuses on location-indicative terms to derive a discriminative message representation. The message embedding is then fed into a memory network, which selectively attends to relevant Points-of-Interest (POIs) for location prediction. The message encoder and keyvalue memory network are jointly trained in an end-toend manner. The attention mechanisms in GeoAttn not only alleviate noisy information for higher prediction accuracy, but also provide interpretable attention scores that rationalize the predictions. Our experiments on a million-scale geo-tagged tweet dataset show that GeoAttn outperforms previous state-of-the-art location prediction methods by 15.5% in mean error distance, and is capable of locating over half of the tweets within 5km.

[1]  Jason Weston,et al.  Key-Value Memory Networks for Directly Reading Documents , 2016, EMNLP.

[2]  Jason Baldridge,et al.  Simple supervised document geolocation with geodesic grids , 2011, ACL.

[3]  Wei Shen,et al.  Improving Traffic Prediction with Tweet Semantics , 2013, IJCAI.

[4]  Michael R. Lyu,et al.  Geo-Teaser: Geo-Temporal Sequential Embedding Rank for POI Recommendation , 2018 .

[5]  Brendan T. O'Connor,et al.  Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters , 2013, NAACL.

[6]  C. Bishop Mixture density networks , 1994 .

[7]  David Allen,et al.  Geotagging one hundred million Twitter accounts with total variation minimization , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[8]  Henry A. Kautz,et al.  Finding your friends and following them to where you are , 2012, WSDM '12.

[9]  Sue Moon,et al.  Inferring Twitter user locations with 10 km accuracy , 2014, WWW.

[10]  Jiawei Han,et al.  Geographical topic discovery and comparison , 2011, WWW.

[11]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[12]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[13]  David Jurgens,et al.  That's What Friends Are For: Inferring Location in Online Social Media Platforms Based on Social Relationships , 2013, ICWSM.

[14]  Tomoki Taniguchi,et al.  Unifying Text, Metadata, and User Network Representations with a Neural Network for Geolocation Prediction , 2017, ACL.

[15]  Yanchi Liu,et al.  Point-of-Interest Demand Modeling with Human Mobility Patterns , 2017, KDD.

[16]  Helmut Leopold,et al.  Social Media , 2012, Elektrotech. Informationstechnik.

[17]  Timothy Baldwin,et al.  Continuous Representation of Location for Geolocation and Lexical Dialectology using Mixture Density Networks , 2017, EMNLP.

[18]  J. Carroll,et al.  A New Dimension of Health Care: Systematic Review of the Uses, Benefits, and Limitations of Social Media for Health Communication , 2013, Journal of medical Internet research.

[19]  Jason Weston,et al.  End-To-End Memory Networks , 2015, NIPS.

[20]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[21]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[22]  Jason Weston,et al.  Memory Networks , 2014, ICLR.

[23]  Javier Nogueras-Iso,et al.  Geocoding for texts with fine-grain toponyms: an experiment on a geoparsed hiking descriptions corpus , 2014, SIGSPATIAL/GIS.

[24]  Jason Baldridge,et al.  Supervised Text-based Geolocation Using Language Models on an Adaptive Grid , 2012, EMNLP.

[25]  Timothy Baldwin,et al.  Automatically Constructing a Normalisation Dictionary for Microblogs , 2012, EMNLP.

[26]  Kyumin Lee,et al.  You are where you tweet: a content-based approach to geo-locating twitter users , 2010, CIKM.

[27]  Timothy Baldwin,et al.  Twitter User Geolocation Using a Unified Text and Network Prediction Model , 2015, ACL.

[28]  Liyuan Liu,et al.  TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams , 2017, KDD.

[29]  Alexander J. Smola,et al.  Hierarchical geographical modeling of user locations from social media posts , 2013, WWW.

[30]  Hanan Samet,et al.  Geotagging with local lexicons to build indexes for textually-specified spatial data , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[31]  Diana Inkpen,et al.  Estimating User Location in Social Media with Stacked Denoising Auto-encoders , 2015, VS@HLT-NAACL.

[32]  Jie Tang,et al.  A Probabilistic Framework for Location Inference from Social Media , 2017, ArXiv.