Learning functional compositions of urban spaces with crowd-augmented travel survey data

Regions in urban environments often afford a mixture of different utilities. Their identification allows urban planners to leverage important insights on the emerging functional dynamics of cities. With the increasing availability of human mobility data and other forms of online digital breadcrumbs, we can now characterize urban regions with multi-source features. In this work, we form a comprehensive view of urban regions by fusing features depicting their temporal, spatial, and demographic aspects. Aggregating 47K explicitly stated trip purposes into their respective destination regions, we obtain multi-dimensional ground-truths on the functionalities of urban spaces. Given fused features and training labels, we can perform supervised learning, via multi-output regression, to estimate the functional composition of urban spaces. With 14 functional dimensions, our approach using crowd-augmented travel survey predictors delivers a mean absolute error of 3.9, approximately half of the error resulting from a mean-based straw man approach (mean absolute error of 7.9). Clustering estimated regional functionalities, we find highly coherent cluster assignments (adjusted Rand Index of 0.81) compared to clustering directly on regional functionality labels. Finally, we provide an illustrative case-study where clustering of estimated region functionalities can be used to intuitively differentiate prototypical spatial neighbourhoods of a large metropolitan.

[1]  Alexei Pozdnoukhov,et al.  Temporal decomposition and semantic enrichment of mobility flows , 2013, LBSN '13.

[2]  Hui Xiong,et al.  Discovering Urban Functional Zones Using Latent Activity Trajectories , 2015, IEEE Transactions on Knowledge and Data Engineering.

[3]  Zhaohui Wu,et al.  This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS 1 Land-Use Classification Using Taxi GPS Traces , 2022 .

[4]  Li Gong,et al.  Revealing travel patterns and city structure with taxi trip data , 2016 .

[5]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[6]  Satish V. Ukkusuri,et al.  Inferring Urban Land Use Using Large-Scale Social Media Check-in Data , 2014 .

[7]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[8]  Shan Jiang,et al.  Clustering daily patterns of human activities in the city , 2012, Data Mining and Knowledge Discovery.

[9]  Francisco C. Pereira,et al.  Activity Recognition for a Smartphone Based Travel Survey Based on Cross-User History Data , 2014, 2014 22nd International Conference on Pattern Recognition.

[10]  Gang Pan,et al.  Mining the semantics of origin-destination flows using taxi traces , 2012, UbiComp '12.

[11]  S. Glantz,et al.  Primer of Applied Regression & Analysis of Variance , 1990 .

[12]  John Krumm,et al.  Placer: semantic place labels from diary data , 2013, UbiComp.

[13]  Max L. Wilson,et al.  A data driven approach to mapping urban neighbourhoods , 2014, SIGSPATIAL/GIS.

[14]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[15]  Chenghu Zhou,et al.  A new insight into land use classification based on aggregated mobile phone data , 2013, Int. J. Geogr. Inf. Sci..

[16]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[17]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[18]  N. Smirnov Table for Estimating the Goodness of Fit of Empirical Distributions , 1948 .

[19]  Kazutoshi Sumiya,et al.  Urban area characterization based on crowd behavioral lifelogs over Twitter , 2012, Personal and Ubiquitous Computing.

[20]  Dietmar Bauer,et al.  Inferring land use from mobile phone activity , 2012, UrbComp '12.

[21]  Concha Bielza,et al.  A survey on multi‐output regression , 2015, WIREs Data Mining Knowl. Discov..

[22]  Víctor Soto,et al.  Characterizing Urban Landscapes Using Geolocated Tweets , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[23]  Felix Kling,et al.  When a city tells a story: urban topic analysis , 2012, SIGSPATIAL/GIS.

[24]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[25]  Kazutoshi Sumiya,et al.  Urban Area Characterization Based on Semantics of Crowd Activities in Twitter , 2011, GeoS.