Discovering New Socio-demographic Regional Patterns in Cities

During the past few years, the analysis of data generated from Location-Based Social Networks (LBSNs) have aided in the identification of urban patterns, understanding activity behaviours in urban areas, as well as producing novel recommender systems that facilitate users' choices. However, the recent advancement in machine learning techniques promises new deeper insights with the possibility of finding new spatio-temporal patterns in cities. In this paper, we show that one of the recent advancements in machine learning, Deep Belief Networks (DBNs), can discover a new type of pattern, which we refer to in the paper as the Socio-demographic Regional Pattern. This pattern illustrates the ability of predicting the district of a city given a set of weekly activities captured from LBSNs. Specifically, we have found instances of this embedded pattern for the boroughs in New York City by training a DBN model that can classify with nearly 70% accuracy the location of weekly region-footprints. We further validated the existence and complexity of this type of pattern by applying a probabilistic topic model, namely Latent Dirichlet Allocation (LDA). We believe that this research can yield to a deeper understanding about social commonalities and the geographical evolution of different regions and areas, between cities across the globe.

[1]  Yoshua Bengio,et al.  An empirical evaluation of deep architectures on problems with many factors of variation , 2007, ICML '07.

[2]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[3]  A. Pentland,et al.  Eigenbehaviors: identifying structure in routine , 2009, Behavioral Ecology and Sociobiology.

[4]  Peter Norvig,et al.  The Unreasonable Effectiveness of Data , 2009, IEEE Intelligent Systems.

[5]  Franco Zambonelli,et al.  Supporting location-aware services for mobile users with the whereabouts diary , 2008, MOBILWARE.

[6]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[7]  David Haussler,et al.  Unsupervised learning of distributions on binary vectors using two layer networks , 1991, NIPS 1991.

[8]  Yong Yu,et al.  Inferring gas consumption and pollution emission of vehicles throughout a city , 2014, KDD.

[9]  Yoshua Bengio,et al.  Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[10]  Sheng Tang,et al.  A density-based method for adaptive LDA model selection , 2009, Neurocomputing.

[11]  Daqing Zhang,et al.  Modeling User Activity Preference by Leveraging User Spatial Temporal Characteristics in LBSNs , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[12]  Satish V. Ukkusuri,et al.  Urban activity pattern classification using topic models from online geo-location data , 2014 .

[13]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[14]  Fu Jie Huang,et al.  A Tutorial on Energy-Based Learning , 2006 .

[15]  Declan O'Sullivan,et al.  Machine learning as a service for enabling Internet of Things and People , 2016, Personal and Ubiquitous Computing.

[16]  Hui Xiong,et al.  Discovering Urban Functional Zones Using Latent Activity Trajectories , 2015, IEEE Transactions on Knowledge and Data Engineering.

[17]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[18]  Hossein Mobahi,et al.  Deep learning from temporal coherence in video , 2009, ICML '09.

[19]  Daniel Gatica-Perez,et al.  Discovering routines from large-scale human locations using probabilistic topic models , 2011, TIST.

[20]  Cecilia Mascolo,et al.  Exploiting Semantic Annotations for Clustering Geographic Areas and Users in Location-based Social Networks , 2011, The Social Mobile Web.

[21]  Laura Ferrari,et al.  Discovering daily routines from Google Latitude with topic models , 2011, 2011 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops).

[22]  George A. Vouros,et al.  Determining Automatically the Size of Learned Ontologies , 2008, ECAI.

[23]  Satish V. Ukkusuri,et al.  A novel transit rider satisfaction metric: Rider sentiments measured from online social media data , 2013 .

[24]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[25]  Declan O'Sullivan,et al.  Towards Bridging the Gap between Machine Learning Researchers and Practitioners , 2015, 2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity).

[26]  Huan Liu,et al.  gSCorr: modeling geo-social correlations for new check-ins on location-based social networks , 2012, CIKM.

[27]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[28]  Kyumin Lee,et al.  Exploring Millions of Footprints in Location Sharing Services , 2011, ICWSM.

[29]  Declan O'Sullivan,et al.  Spatio-Temporal Clustering Approach for Detecting Functional Regions in Cities , 2016, 2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI).

[30]  Geoffrey E. Hinton,et al.  Restricted Boltzmann machines for collaborative filtering , 2007, ICML '07.

[31]  Felix Kling,et al.  When a city tells a story: urban topic analysis , 2012, SIGSPATIAL/GIS.

[32]  Xing Xie,et al.  Sensing the pulse of urban refueling behavior , 2013, UbiComp.

[33]  Hui Xiong,et al.  Exploiting geographic dependencies for real estate appraisal: a mutual perspective of ranking and clustering , 2014, KDD.

[34]  J J Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Franco Zambonelli,et al.  Extracting urban patterns from location-based social networks , 2011, LBSN '11.

[36]  M. Narasimha Murty,et al.  On Finding the Natural Number of Topics with Latent Dirichlet Allocation: Some Observations , 2010, PAKDD.

[37]  Ole Winther,et al.  Deep Belief Nets for Topic Modeling , 2015, ArXiv.

[38]  Geoffrey E. Hinton A Practical Guide to Training Restricted Boltzmann Machines , 2012, Neural Networks: Tricks of the Trade.

[39]  Stephan Sigg,et al.  An Alignment Approach for Context Prediction Tasks in UbiComp Environments , 2010, IEEE Pervasive Computing.