Social Network Aggregation Using Face-Recognition

With the rapid growth of the social web an increasing num- ber of people started to replicate their off-line preferences and lives in an on-line environment. Consequently, the social web provides an enormous source for social network data, which can be used in both commercial and research applications. However, people often take part in multiple social network sites and, generally, they share only a selected amount of data to the audience of a specific platform. Consequently, the interlink- age of social graphs from different sources getting increasingly impor- tant for applications such as social network analysis, personalization, or recommender systems. This paper proposes a novel method to enhance available user re-identification systems for social network data aggrega- tion based on face-recognition algorithms. Furthermore, the method is combined with traditional text-based approaches in order to attempt a counter-balancing of the weaknesses of both methods. Using two sam- ples of real-world social networks (with 1610 and 1690 identities each) we show that even though a pure face-recognition based method gets out- performed by the traditional text-based method (area under the ROC curve 0.986 vs. 0.938) the combined method significantly outperforms both of these (0.998, p = 0.0001) suggesting that the face-based method indeed carries complimentary information to raw text attributes.

[1]  B. Olivier Cyberspace and Identity , 2011 .

[2]  V. Kshirsagar,et al.  Face recognition using Eigenfaces , 2011, 2011 3rd International Conference on Computer Research and Development.

[3]  Matthew Rowe,et al.  Interlinking Distributed Social Graphs , 2009, LDOW.

[4]  Federica Cena,et al.  User identification for cross-system personalisation , 2009, Inf. Sci..

[5]  William E. Winkler,et al.  The State of Record Linkage and Current Research Problems , 1999 .

[6]  Danushka Bollegala,et al.  Extracting Key Phrases to Disambiguate Personal Names on the Web , 2006, CICLing.

[7]  Lawrence Sirovich,et al.  Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[9]  Andrew McCallum,et al.  Disambiguating Web appearances of people in a social network , 2005, WWW '05.

[10]  Alessandro Acquisti,et al.  Information revelation and privacy in online social networks , 2005, WPES '05.

[11]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[12]  Kôiti Hasida,et al.  POLYPHONET: An advanced social network extraction system from the Web , 2007, J. Web Semant..

[13]  Ilaria Torre,et al.  User data distributed on the social web: how to identify users on different social systems and collecting data about them , 2010, HetRec '10.

[14]  Peter Y. K. Cheung,et al.  Adaptive automatic facial feature segmentation , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[15]  Pedro M. Domingos,et al.  Entity Resolution with Markov Logic , 2006, Sixth International Conference on Data Mining (ICDM'06).

[16]  Ahmed K. Elmagarmid,et al.  Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.

[17]  Bradley Malin,et al.  Unsupervised Name Disambiguation via Social Network Similarity , 2005 .

[18]  Harry Wechsler,et al.  Reliable face recognition methods - system design, implementation and evaluation , 2006 .

[19]  Fabian Abel,et al.  User profile elicitation and conversion in a mashup environment , 2009 .

[20]  P. Ivax,et al.  A THEORY FOR RECORD LINKAGE , 2004 .

[21]  Rama Chellappa,et al.  Face Processing: Advanced Modeling and Methods , 2006, J. Electronic Imaging.

[22]  Premkumar T. Devanbu,et al.  The missing links: bugs and bug-fix commits , 2010, FSE '10.

[23]  Irma Veldman Matching Profiles from Social Network Sites , 2009 .

[24]  Peter Mika,et al.  Flink: Semantic Web technology for the extraction and analysis of social networks , 2005, J. Web Semant..