A Deep Learning Approach for Identifying User Communities Based on Geographical Preferences and Its Applications to Urban and Environmental Planning

Understanding human mobility plays a vital role in urban and environmental planning as cities continue to grow. Ubiquitous geo-location, localization technology, and availability of big-data-ready computing infrastructure have enabled the development of more sophisticated models to characterize human mobility in urban areas. In this work, our main goal is to extract spatio-temporal features that characterize user mobility and, based on the similarity of these features, identify user communities. To this end, we propose a novel approach that leverages image processing techniques to represent user geographical preferences as images and then apply deep convolutional autoencoders to extract latent spatio-temporal mobility features from these images. These features are then fed to a clustering algorithm that identifies the underlying community structures. We use a diverse urban mobility dataset to validate the proposed framework. Our results show that the proposed framework is able to significantly increase the similarity between intra-community nodes (by up to 107%) as well as dissimilarity between inter-community nodes (up to 54%) when compared against no pre-processing of the datasets, i.e without pre-processing the datasets through any feature fusion method. Moreover, it was also able to reach up to 100% improvement when compared against community identification using Principal Component Analysis (PCA). Our results also show that the proposed approach yields significant increase in contact time amongst users belonging to the same community, by up to 80% when compared to the average contact time when not considering community structures, and by up to 150% when compared to the baseline. To the best of our knowledge, our proposal is the first to consider deep convolutional autoencoding to perform automatic extraction of non-linear spatio-temporal mobility features characterizing individual users from raw mobility datasets with the goal of identifying user communities.

[1]  Mo Li,et al.  Urban Traffic Prediction from Mobility Data Using Deep Learning , 2018, IEEE Network.

[2]  Francisco Charte,et al.  A practical tutorial on autoencoders for nonlinear feature fusion: Taxonomy, models, software and guidelines , 2018, Inf. Fusion.

[3]  Antonio Lima,et al.  Interdependence and predictability of human mobility and social interactions , 2012, Pervasive Mob. Comput..

[4]  Sebastian Johann Wetzel,et al.  Unsupervised learning of phase transitions: from principal component analysis to variational autoencoders , 2017, Physical review. E.

[5]  Ludovic Denoyer,et al.  Spatio-Temporal Neural Networks for Space-Time Series Forecasting and Relations Discovery , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[6]  Erik Cambria,et al.  Recent Trends in Deep Learning Based Natural Language Processing , 2017, IEEE Comput. Intell. Mag..

[7]  Qiang Liu,et al.  A Survey of Clustering With Deep Learning: From the Perspective of Network Architecture , 2018, IEEE Access.

[8]  Charu C. Aggarwal,et al.  Neural Networks and Deep Learning , 2018, Springer International Publishing.

[9]  Guangchun Luo,et al.  Location prediction on trajectory data: A review , 2018, Big Data Min. Anal..

[10]  Injong Rhee,et al.  STEP: A spatio-temporal mobility model for humans walks , 2010, The 7th IEEE International Conference on Mobile Ad-hoc and Sensor Systems (IEEE MASS 2010).

[11]  Kevin Heaslip,et al.  Inferring transportation modes from GPS trajectories using a convolutional neural network , 2018, ArXiv.

[12]  Alexandra Coman,et al.  Identifying Connectors and Communities: Understanding Their Impacts on the Performance of a DTN Publish/Subscribe System , 2009, 2009 International Conference on Computational Science and Engineering.

[13]  Yan Liu,et al.  Recurrent Neural Networks for Multivariate Time Series with Missing Values , 2016, Scientific Reports.

[14]  Matthias Grossglauser,et al.  CRAWDAD dataset epfl/mobility (v.2009-02-24) , 2009 .

[15]  Xiaochun Cao,et al.  Modularity Based Community Detection with Deep Learning , 2016, IJCAI.

[16]  Mark J. Embrechts,et al.  On the Use of the Adjusted Rand Index as a Metric for Evaluating Supervised Classification , 2009, ICANN.

[17]  Daniel Cremers,et al.  Clustering with Deep Learning: Taxonomy and New Methods , 2018, ArXiv.

[18]  Laura Ferrari,et al.  Urban Sensing Using Mobile Phone Network Data: A Survey of Research , 2014, ACM Comput. Surv..

[19]  Huachun Tan,et al.  Variational Deep Embedding: An Unsupervised and Generative Approach to Clustering , 2016, IJCAI.

[20]  Xing Xie,et al.  Mining interesting locations and travel sequences from GPS trajectories , 2009, WWW '09.

[21]  Xing Xie,et al.  A greener transportation mode: flexible routes discovery from GPS trajectory data , 2011, GIS.

[22]  Alex Pentland,et al.  Reality mining: sensing complex social systems , 2006, Personal and Ubiquitous Computing.

[23]  Yu Zheng,et al.  Trajectory Data Mining , 2015, ACM Trans. Intell. Syst. Technol..

[24]  Chih-Chieh Hung,et al.  Exploring Sequential Probability Tree for Movement-Based Community Discovery , 2014, IEEE Transactions on Knowledge and Data Engineering.

[25]  Licia Capra,et al.  Urban Computing: Concepts, Methodologies, and Applications , 2014, TIST.

[26]  Jiannong Cao,et al.  When Transportation Meets Communication: V2P over VANETs , 2010, 2010 IEEE 30th International Conference on Distributed Computing Systems.

[27]  Yingjie Tian,et al.  A Comprehensive Survey of Clustering Algorithms , 2015, Annals of Data Science.

[28]  Kurt Hornik,et al.  Neural networks and principal component analysis: Learning from examples without local minima , 1989, Neural Networks.

[29]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[30]  Boaz Lerner,et al.  Human Mobility-Pattern Discovery and Next-Place Prediction from GPS Data , 2016, MPRSS.

[31]  Elise Miller-Hooks,et al.  Large-Scale Vehicle Sharing Systems: Analysis of Vélib' , 2013 .

[32]  Bernhard Rinner,et al.  Security and Privacy Protection in Visual Sensor Networks , 2014, ACM Comput. Surv..

[33]  Tsuyoshi Murata,et al.  Learning Community Structure with Variational Autoencoder , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[34]  Mirco Musolesi,et al.  Using Autoencoders to Automatically Extract Mobility Features for Predicting Depressive States , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[35]  Katia Obraczka,et al.  Scale-Free Properties of Human Mobility and Applications to Intelligent Transportation Systems , 2018, IEEE Transactions on Intelligent Transportation Systems.

[36]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[37]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[38]  Jüri Lember,et al.  Bridging Viterbi and posterior decoding: a generalized risk approach to hidden path inference based on hidden Markov models , 2014, J. Mach. Learn. Res..

[39]  Enhong Chen,et al.  Learning Deep Representations for Graph Clustering , 2014, AAAI.

[40]  Richard Dosselmann,et al.  A comprehensive assessment of the structural similarity index , 2011, Signal Image Video Process..

[41]  Qiang Gao,et al.  Trajectory-User Linking via Variational AutoEncoder , 2018, IJCAI.

[42]  Tristan Henderson,et al.  CRAWDAD dataset dartmouth/campus (v.2004-11-09) , 2004 .

[43]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[44]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[45]  Kevin M. Curtin,et al.  A New Model for a Carpool Matching Service , 2015, PloS one.

[46]  A. Raftery,et al.  Detecting features in spatial point processes with clutter via model-based clustering , 1998 .

[47]  Pascal Vincent,et al.  Unsupervised Feature Learning and Deep Learning: A Review and New Perspectives , 2012, ArXiv.

[48]  Jürgen Schmidhuber,et al.  Multi-dimensional Recurrent Neural Networks , 2007, ICANN.

[49]  Xing Xie,et al.  GeoLife: A Collaborative Social Networking Service among User, Location and Trajectory , 2010, IEEE Data Eng. Bull..

[50]  Robin Doss,et al.  Routing Protocols in Opportunistic Networks – A Survey , 2018 .

[51]  Eran Toch,et al.  Analyzing large-scale human mobility data: a survey of machine learning methods and applications , 2019, Knowledge and Information Systems.

[52]  U. Berardi,et al.  Smart Cities: Definitions, Dimensions, Performance, and Initiatives , 2015 .

[53]  Shaojie Tang,et al.  Recent progress in routing protocols of mobile opportunistic networks: A clear taxonomy, analysis and evaluation , 2016, J. Netw. Comput. Appl..

[54]  Laurens van der Maaten,et al.  Accelerating t-SNE using tree-based algorithms , 2014, J. Mach. Learn. Res..

[55]  A. Stewart Fotheringham,et al.  Analysis of human mobility patterns from GPS trajectories and contextual information , 2016, Int. J. Geogr. Inf. Sci..

[56]  José Neves,et al.  Evolution of neural networks for classification and regression , 2007, Neurocomputing.

[57]  Jie Wu,et al.  LocalCom: A Community-based Epidemic Forwarding Scheme in Disruption-tolerant Networks , 2009, 2009 6th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks.

[58]  Nanning Zheng,et al.  Parallel vision for perception and understanding of complex scenes: methods, framework, and perspectives , 2017, Artificial Intelligence Review.

[59]  Sen Jia,et al.  Convolutional neural networks for hyperspectral image classification , 2017, Neurocomputing.

[60]  Yuki Endo,et al.  Deep Feature Extraction from Trajectories for Transportation Mode Estimation , 2016, PAKDD.

[61]  Hao Wang,et al.  Detecting Transportation Modes Using Deep Neural Network , 2017, IEICE Trans. Inf. Syst..

[62]  Richard B. Langley,et al.  The UTM Grid System , 2022 .

[63]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[64]  M. Barthelemy,et al.  How congestion shapes cities: from mobility patterns to scaling , 2014, Scientific Reports.

[65]  Yong Wang,et al.  Learning Traffic as Images: A Deep Convolutional Neural Network for Large-Scale Transportation Network Speed Prediction , 2017, Sensors.

[66]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[67]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[68]  Albert-László Barabási,et al.  Limits of Predictability in Human Mobility , 2010, Science.

[69]  Geoff Boeing The Structure and Dynamics of Cities: Urban Data Analysis and Theoretical Modeling, by Marc Barthelemy , 2017 .

[70]  Yurong Liu,et al.  A survey of deep neural network architectures and their applications , 2017, Neurocomputing.

[71]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[72]  T. Jaeger,et al.  Categorical Data Analysis: Away from ANOVAs (transformation or not) and towards Logit Mixed Models. , 2008, Journal of memory and language.

[73]  Mirco Musolesi,et al.  Trajectories of depression: unobtrusive monitoring of depressive states by means of smartphone mobility traces analysis , 2015, UbiComp.