Twitter-based Urban Area Characterization by Non-negative Matrix Factorization

Due to the remarkable growth of various social networks boosted by the pervasive mobile devices, massive crowds can become social sensors which can share microbolgs on a variety of social situations and natural phenomena in urban space in real-time. In order to take advantages of the novel realm of crowd-sourced lifelogs to characterize urban areas, we attempt to explore characteristics of complex and dynamic urban areas by monitoring crowd behavior via location-based social networks. In particular, we define social conditions consisting of crowd's experiential features extracted from the analysis of Twitter-based crowd's lifelogs. Then, we explore latent characteristic faces of urban areas in term of 5-dimensional social conditions by applying Non-negative Matrix Factorization (NMF). In the experiments with massive geo-tagged tweets, we classify urban areas into representative groups based on their latent patterns which enable to comprehensively understand images of the urban areas focusing on crowd's daily lives.

[1]  Kevin Lynch,et al.  The Image of the City , 1960 .

[2]  Tat-Seng Chua,et al.  Mining Travel Patterns from Geotagged Photos , 2012, TIST.

[3]  Gavin McArdle,et al.  City-scale traffic simulation from digital footprints , 2012, UrbComp '12.

[4]  Kazutoshi Sumiya,et al.  "Geo-social media analytics: exploring and exploiting geo-social experience from crowd-sourced lifelogs" by R. Lee, S. Wakamiya, and K. Sumiya with Ching-man Au Yeung as coordinator , 2014, LINK.

[5]  Shou-De Lin,et al.  Exploiting large-scale check-in data to recommend time-sensitive routes , 2012, UrbComp '12.

[6]  Kazutoshi Sumiya,et al.  Urban area characterization based on crowd behavioral lifelogs over Twitter , 2012, Personal and Ubiquitous Computing.

[7]  Xing Xie,et al.  Discovering regions of different functions in a city using human mobility and POIs , 2012, KDD.

[8]  Ciro Cattuto,et al.  Dynamical classes of collective attention in twitter , 2011, WWW.

[9]  Sune Lehmann,et al.  Understanding the Demographics of Twitter Users , 2011, ICWSM.

[10]  Nello Cristianini,et al.  Nowcasting Events from the Social Web with Statistical Learning , 2012, TIST.

[11]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[12]  Andrzej Cichocki,et al.  Nonnegative Matrix and Tensor Factorization T , 2007 .

[13]  Takashi Inui,et al.  Extracting Semantic Orientations of Words using Spin Model , 2005, ACL.

[14]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[15]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.

[16]  Kazutoshi Sumiya,et al.  Discovery of unusual regional social activities using geo-tagged microblogs , 2011, World Wide Web.

[17]  A. Raftery,et al.  Nearest-Neighbor Clutter Removal for Estimating Features in Spatial Point Processes , 1998 .

[18]  Kazutoshi Sumiya,et al.  Exploring Reflection of Urban Society through Cyber-Physical Crowd Behavior on Location-Based Social Network , 2012, DASFAA Workshops.