Unsupervised embedding of trajectories captures the latent structure of mobility

Human mobility drives major societal phenomena including epidemics, economies, and innovation. Historically, mobility was constrained by geographic distance, however, in the globalizing world, language, culture, and history are increasingly important. Here, we show a mathematical equivalence between word2vec model and the gravity model of mobility and demonstrate that, by using three human trajectory datasets, word2vec encodes nuanced relationships between locations into a systematic and meaningful vector-space, providing a functional distance between locations, as well as a representation for studying the many dimensions of mobility. Focusing on the case of scientific mobility, we show that embeddings implicitly learn cultural, linguistic, and hierarchical relationships at multiple levels of granularity. Connecting neural embeddings to the gravity model opens up new avenues for the study of mobility.

[1]  Yong Li,et al.  Habit2vec: Trajectory Semantic Embedding for Living Pattern Recognition in Population , 2020, IEEE Transactions on Mobile Computing.

[2]  P. Braunerhjelm,et al.  Labour market mobility, knowledge diffusion and innovation , 2020, European Economic Review.

[3]  Nuno R. Faria,et al.  The effect of human mobility and control measures on the COVID-19 epidemic in China , 2020, Science.

[4]  Nees Jan van Eck,et al.  Collecting large-scale publication data at the level of individual researchers: a practical proposal for author name disambiguation , 2020, Scientometrics.

[5]  Li Linzhuo,et al.  Social centralization and semantic collapse: Hyperbolic embeddings of networks and text , 2020 .

[6]  Lingfei Wu,et al.  Social Centralization and Semantic Collapse: Hyperbolic Embeddings of Networks and Text , 2020, Poetics.

[7]  Olga Kononova,et al.  Unsupervised word embeddings capture latent knowledge from materials science literature , 2019, Nature.

[8]  C. Wagner,et al.  Returning Scientists and the Emergence of China’s Science System , 2019, Science and Public Policy.

[9]  Frank Schweitzer,et al.  The mobility network of scientists: analyzing temporal correlations in scientific careers , 2019, Applied Network Science.

[10]  Alessandro Crivellari,et al.  From Motion Activity to Geo-Embeddings: Generating and Exploring Vector Representations of Locations, Traces and Visitors through Large-Scale Mobility Data , 2019, ISPRS Int. J. Geo Inf..

[11]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection , 2018, J. Open Source Softw..

[12]  Alexander M. Petersen,et al.  Multiscale impact of researcher mobility , 2018, Journal of The Royal Society Interface.

[13]  Luca Pappalardo,et al.  Gravity and scaling laws of city to city migration , 2018, PloS one.

[14]  Sultan Orazbayev,et al.  The globalisation of scientific mobility, 1970–2014 , 2018, Applied Geography.

[15]  Hui Xiong,et al.  Representing Urban Functions through Zone Embedding with Human Mobility Patterns , 2018, IJCAI.

[16]  Vincent Larivière,et al.  Scientific mobility indicators in practice: International mobility profiles at the country level , 2018, El Profesional de la Información.

[17]  Jisun An,et al.  SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment , 2018, ACL.

[18]  Ulrich Kaiser,et al.  Experience matters: The role of academic scientist mobility for industrial innovation , 2018, Strategic Management Journal.

[19]  Dimitrios J. Economou,et al.  Prestige drives epistemic inequality in the diffusion of scientific ideas , 2018, EPJ Data Science.

[20]  Cassidy R. Sugimoto,et al.  A Global Comparison of Scientific Mobility and Collaboration According to National Scientific Capacities , 2018, Front. Res. Metr. Anal..

[21]  Cassidy R. Sugimoto,et al.  Travel bans and scientific mobility: utility of asymmetry and affinity indexes to inform science policy , 2018, Scientometrics.

[22]  Matt Taddy,et al.  The Geometry of Culture: Analyzing the Meanings of Class through Word Embeddings , 2018, American Sociological Review.

[23]  Vincent Larivière,et al.  The many faces of mobility: Using bibliometric data to measure the movement of scientists , 2018, J. Informetrics.

[24]  Juyong Park,et al.  Why do people move? Enhancing human mobility prediction using local functions based on public records and SNS data , 2018, PloS one.

[25]  Daniel Jurafsky,et al.  Word embeddings quantify 100 years of gender and ethnic stereotypes , 2017, Proceedings of the National Academy of Sciences.

[26]  C. Wagner,et al.  Open countries have strong science , 2017, Nature.

[27]  Vincent Larivière,et al.  Scientists have most impact when they're free to move , 2017, Nature.

[28]  Yong-Yeol Ahn,et al.  Element-centric clustering comparison unifies overlaps and hierarchy , 2017, Scientific Reports.

[29]  S. Hainsworth,et al.  A CRITICAL ASSESSMENT , 2014 .

[30]  Javier Ruiz-Castillo,et al.  Geographic mobility and research productivity in a selection of top world economics departments , 2017, Scientometrics.

[31]  Giuliano Armano,et al.  The Beneficial Role of Mobility for the Emergence of Innovation , 2017, Scientific Reports.

[32]  Bo An,et al.  POI2Vec: Geographical Latent Representation for Predicting Future Visitors , 2017, AAAI.

[33]  Woo-Sung Jung,et al.  Application of gravity model on the Korean urban bus network , 2016 .

[34]  Supun Chathuranga Nakandala,et al.  Gendered Conversation in a Social Game-Streaming Platform , 2016, ICWSM.

[35]  L. Nimrichter,et al.  The benefits of scientific mobility and international collaboration. , 2016, FEMS microbiology letters.

[36]  Philipp Koehn,et al.  Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , 2016 .

[37]  Xin Liu,et al.  Exploring the Context of Locations for Personalized Location Recommendations , 2016, IJCAI.

[38]  Tommi S. Jaakkola,et al.  Word Embeddings as Metric Recovery in Semantic Spaces , 2016, TACL.

[39]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[40]  Yurij L. Katchanov,et al.  Synchronous international scientific mobility in the space of affiliations: evidence from Russia , 2016, SpringerPlus.

[41]  Benjamin J. Wilson,et al.  Measuring Word Significance using Distributed Representations of Words , 2015, ArXiv.

[42]  Omer Levy,et al.  Improving Distributional Similarity with Lessons Learned from Word Embeddings , 2015, TACL.

[43]  Yanguang Chen The distance-decay function of geographical gravity model: Power law or exponential law? , 2015, 1503.02915.

[44]  Daniel B. Larremore,et al.  Systematic inequality and hierarchy in faculty hiring networks , 2015, Science Advances.

[45]  Omer Levy,et al.  Neural Word Embedding as Implicit Matrix Factorization , 2014, NIPS.

[46]  Chris Dyer Notes on Noise Contrastive Estimation and Negative Sampling , 2014, ArXiv.

[47]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[48]  Vincent D. Blondel,et al.  Career on the Move: Geography, Stratification, and Scientific Impact , 2014, Scientific Reports.

[49]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[50]  K. Jonkers,et al.  Research upon return: The effect of international mobility on scientific ties, production and impact , 2013 .

[51]  Alan Wilson,et al.  Entropy in Urban and Regional Modelling (Routledge Revivals) , 2013 .

[52]  Paula E. Stephan,et al.  Foreign-born scientists: mobility patterns for 16 countries , 2012, Nature Biotechnology.

[53]  Neil M. Ferguson,et al.  Evaluating the Adequacy of Gravity Models as a Description of Human Mobility for Epidemic Modelling , 2012, PLoS Comput. Biol..

[54]  Thed N. van Leeuwen,et al.  The Leiden ranking 2011/2012: Data collection, indicators, and interpretation , 2012, J. Assoc. Inf. Sci. Technol..

[55]  Marta C. González,et al.  A universal model for mobility and migration patterns , 2011, Nature.

[56]  Javier Otamendi,et al.  International temporary mobility of researchers: a cross-discipline study , 2011, Scientometrics.

[57]  B. Sampat,et al.  The Diffusion of Scientific Knowledge Across Time and Space: Evidence from Professional Transitions for the Superstars of Medicine , 2011 .

[58]  William Kerr,et al.  Economic Impacts of Immigration: A Survey , 2011 .

[59]  Marc Barthelemy,et al.  Spatial Networks , 2010, Encyclopedia of Social Network Analysis and Mining.

[60]  Michael T. Gastner,et al.  The complex network of global cargo ship movements , 2010, Journal of The Royal Society Interface.

[61]  Ulf Sandström,et al.  Combining curriculum vitae and bibliometric analysis: mobility, gender and research performance , 2009 .

[62]  Tim Turpin,et al.  CV analysis as a complementary methodological approach: investigating the mobility of Australian scientists , 2009 .

[63]  H. Stanley,et al.  Gravity model in the Korean highway , 2007, 0710.1274.

[64]  Gilad Lerman,et al.  Defining functional distance using manifold embeddings of gene ontology annotations , 2007, Proceedings of the National Academy of Sciences.

[65]  R. Boschma Proximity and Innovation: A Critical Assessment , 2005 .

[66]  John P A Ioannidis,et al.  Global estimates of high‐level brain drain and deficit , 2004, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[67]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[68]  Jennifer Widom,et al.  Scaling personalized web search , 2003, WWW '03.

[69]  J. Gaillard,et al.  The International Circulation of Scientists and Technologists , 1998 .

[70]  Reginald G. Golledge,et al.  Migration, Functional Distance, and the Urban Hierarchy , 1970 .

[71]  G. Zipf The P 1 P 2 D Hypothesis: On the Intercity Movement of Persons , 1946 .

[72]  C. W. Greene,et al.  THE FEDERATION OF AMERICAN SOCIETIES FOR EXPERIMENTAL BIOLOGY. , 1917, Science.

[73]  Mobility and Its Impact: Data and Evidence (The Global Competition for Talent: Mobility of the Highly Skilled) , 2019 .

[74]  Paula E. Stephan,et al.  The mover’s advantage: The superior performance of migrant scientists , 2014 .

[75]  Nees Jan van Eck,et al.  Large scale author name disambiguation using rule-based scoring and clustering , 2014 .

[76]  Yoshua Bengio,et al.  Hierarchical Probabilistic Neural Network Language Model , 2005, AISTATS.