Who are My Familiar Strangers?: Revealing Hidden Friend Relations and Common Interests from Smart Card Data

The newly emerging location-based social networks (LBSN) such as Tinder and Momo extends social interaction from friends to strangers, providing novel experiences of making new friends. Familiar strangers refer to the strangers who meet frequently in daily life and may share common interests; thus they may be good candidates for friend recommendation. In this paper, we study the problem of discovering familiar strangers, specifically, public transportation trip companions, and their common interests. We collect 5.7 million transaction records of smart cards from about 3.02 million people in the city of Beijing, China. We first analyze this dataset and reveal the temporal and spatial characteristics of passenger encounter behaviors. Then we propose a stability metric to measure hidden friend relations. This metric facilitates us to employ community detection techniques to capture the communities of trip companions. Further, we infer common interests of each community using a topic model, i.e., LDA4HFC (Latent Dirichlet Allocation for Hidden Friend Communities) model. Such topics for communities help to understand how hidden friend clusters are formed. We evaluate our method using large-scale and real-world datasets, consisting of two-week smart card records and 901,855 Point of Interests (POIs) in Beijing. The results show that our method outperforms three baseline methods with higher recommendation accuracy. Moreover, our case study demonstrates that the discovered topics interpret the communities very well.

[1]  Gregor Heinrich Parameter estimation for text analysis , 2009 .

[2]  Mao Ye,et al.  User association analysis of locales on location based social networks , 2011, LBSN '11.

[3]  Yizhou Sun,et al.  LCARS: a location-content-aware recommender system , 2013, KDD.

[4]  Moustafa Youssef,et al.  CheckInside: a fine-grained indoor location-based social network , 2014, UbiComp.

[5]  Kazutoshi Sumiya,et al.  Discovery of unusual regional social activities using geo-tagged microblogs , 2011, World Wide Web.

[6]  Marc Lemercier,et al.  Familiar Strangers detection in online social networks , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[7]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Huan Liu,et al.  A Social Identity Approach to Identify Familiar Strangers in a Social Network , 2009, ICWSM.

[9]  Cecilia Mascolo,et al.  Exploiting place features in link prediction on location-based social networks , 2011, KDD.

[10]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[11]  Eric Paulos,et al.  The familiar stranger: anxiety, comfort, and play in public places , 2004, CHI.

[12]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[14]  Mohamed F. Mokbel,et al.  Recommendations in location-based social networks: a survey , 2015, GeoInformatica.

[15]  Xianfeng Huang,et al.  Understanding metropolitan patterns of daily encounters , 2013, Proceedings of the National Academy of Sciences.

[16]  Jiawei Han,et al.  Geo-Friends Recommendation in GPS-based Cyber-physical Social Network , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[17]  Zhenyu Chen,et al.  Predicting Passengers in Public Transportation Using Smart Card Data , 2015, ADC.

[18]  Le Minh Kieu,et al.  Passenger Segmentation Using Smart Card Data , 2015, IEEE Transactions on Intelligent Transportation Systems.

[19]  Zhiyong Cui,et al.  Perspectives on Stability and Mobility of Passenger's Travel Behavior through Smart Card Data , 2015, ArXiv.

[20]  Shazia Wasim Sadiq,et al.  Discovering interpretable geo-social communities for user behavior prediction , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[21]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[22]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.