Temporal Activity Path Based Character Correction in Social Networks

Vast amount of multimedia data contains massive and multifarious social information which is used to construct large-scale social networks. In a complex social network, a character should be ideally denoted by one and only one vertex. However, it is pervasive that a character is denoted by two or more vertices with different names, thus it is usually considered as multiple, different characters. This problem causes incorrectness of results in network analysis and mining. The factual challenge is that character uniqueness is hard to correctly confirm due to lots of complicated factors, e.g. name changing and anonymization, leading to character duplication. Early, limited research has shown that previous methods depended overly upon supplementary attribute information from databases. In this paper, we propose a novel method to merge the character vertices which refer to as the same entity but are denoted with different names. With this method, we firstly build the relationship network among characters based on records of social activities participated, which are extracted from multimedia sources. Then define temporal activity paths (TAPs) for each character over time. After that, we measure similarity of the TAPs for any two characters. If the similarity is high enough, the two vertices should be considered to the same character. Based on TAPs, we can determine whether to merge the two character vertices. Our experiments shown that this solution can accurately confirm character uniqueness in large-scale social network.

[1]  Lin Wu,et al.  Unsupervised Metric Fusion Over Multiview Data by Graph Random Walk-Based Cross-View Diffusion , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Philip S. Yu,et al.  PathSim , 2011, Proc. VLDB Endow..

[3]  Lin Wu,et al.  What-and-Where to Match: Deep Spatially Multiplicative Integration Networks for Person Re-identification , 2017, Pattern Recognit..

[4]  Avita Katal,et al.  Big data: Issues, challenges, tools and Good practices , 2013, 2013 Sixth International Conference on Contemporary Computing (IC3).

[5]  Miltiadis D. Lytras,et al.  The impact of Social Multimedia Systems on cyberlearners , 2013, Comput. Hum. Behav..

[6]  Ming Gu,et al.  De-Anonymizing Dynamic Social Networks , 2011, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.

[7]  Andrew McCallum,et al.  Disambiguating Web appearances of people in a social network , 2005, WWW '05.

[8]  Lin Wu,et al.  Effective Multi-Query Expansions: Robust Landmark Retrieval , 2015, ACM Multimedia.

[9]  Yang Wang,et al.  Structured Deep Hashing with Convolutional Neural Networks for Fast Person Re-identification , 2017, Comput. Vis. Image Underst..

[10]  M. Newman,et al.  Vertex similarity in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Seref Sagiroglu,et al.  Big data: A review , 2013, 2013 International Conference on Collaboration Technologies and Systems (CTS).

[12]  Zenggang Xiong,et al.  Joint fingerprinting and encryption in hybrid domains for multimedia sharing in social networks , 2014, J. Vis. Lang. Comput..

[13]  Lin Wu,et al.  LBMCH: Learning Bridging Mapping for Cross-modal Hashing , 2015, SIGIR.

[14]  Xue Li,et al.  Deep Attention-Based Spatially Recursive Networks for Fine-Grained Visual Recognition , 2019, IEEE Transactions on Cybernetics.

[15]  Ling Shao,et al.  Cycle-Consistent Deep Generative Hashing for Cross-Modal Retrieval , 2018, IEEE Transactions on Image Processing.

[16]  Lin Wu,et al.  Robust Subspace Clustering for Multi-View Data by Exploiting Correlation Consensus , 2015, IEEE Transactions on Image Processing.

[17]  Hao Zhang,et al.  Visualization of Clustered Network Graphs Based on Constrained Optimization Partition Layout , 2014 .

[18]  Lin Wu,et al.  Robust Hashing for Multi-View Data: Jointly Learning Low-Rank Kernelized Similarity Consensus and Hash Functions , 2016, Image Vis. Comput..

[19]  Yi-Liang Zhao,et al.  Multimedia venue semantic modeling based on multimodal data , 2017, J. Vis. Commun. Image Represent..

[20]  Lin Wu,et al.  Exploiting Correlation Consensus: Towards Subspace Clustering for Multi-modal Data , 2014, ACM Multimedia.

[21]  Vitaly Shmatikov,et al.  Robust De-anonymization of Large Sparse Datasets , 2008, 2008 IEEE Symposium on Security and Privacy (sp 2008).

[22]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[23]  Lin Wu,et al.  An efficient framework of Bregman divergence optimization for co-ranking images and tags in a heterogeneous network , 2015, Multimedia Tools and Applications.

[24]  K. Liu,et al.  Towards identity anonymization on graphs , 2008, SIGMOD Conference.

[25]  Lin Wu,et al.  Effective Multi-Query Expansions: Collaborative Deep Networks for Robust Landmark Retrieval , 2017, IEEE Transactions on Image Processing.

[26]  Lin Wu,et al.  Efficient image and tag co-ranking: a bregman divergence optimization method , 2013, ACM Multimedia.

[27]  Xindong Wu,et al.  Data mining with big data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[28]  Yang Wang,et al.  Towards metric fusion on multi-view data: a cross-view based graph random walk approach , 2013, CIKM.

[29]  Changsheng Xu,et al.  Social Multimedia Ming: From Special to General , 2016, 2016 IEEE International Symposium on Multimedia (ISM).

[30]  Lin Wu,et al.  Clustering via geometric median shift over Riemannian manifolds , 2013, Inf. Sci..

[31]  Lin Wu,et al.  Iterative Views Agreement: An Iterative Low-Rank Based Structured Optimization Method to Multi-View Spectral Clustering , 2016, IJCAI.

[32]  Changsheng Xu,et al.  On Analyzing the 'Variety' of Big Social Multimedia , 2015, 2015 IEEE International Conference on Multimedia Big Data.

[33]  Lin Wu,et al.  Beyond Low-Rank Representations: Orthogonal Clustering Basis Reconstruction with Optimized Graph Structure for Multi-view Spectral Clustering , 2017, Neural Networks.

[34]  Charles X. Ling,et al.  Extracting Academic Activity Transaction in Chinese Documents , 2014 .

[35]  David J. Crandall,et al.  De-Anonymizing Users Across Heterogeneous Social Computing Platforms , 2013, ICWSM.

[36]  Flora Amato,et al.  SOS: A multimedia recommender System for Online Social networks , 2017, Future Gener. Comput. Syst..

[37]  Stéphane Frénot,et al.  C3PO: A Spontaneous and Ephemeral Social Networking Framework for a Collaborative Creation and Publishing of Multimedia Contents , 2014, MoWNet.

[38]  Yue Gao,et al.  Real-Time Multimedia Social Event Detection in Microblog , 2018, IEEE Transactions on Cybernetics.

[39]  Jian Pei,et al.  An Iterative Fusion Approach to Graph-Based Semi-Supervised Learning from Multiple Views , 2014, PAKDD.

[40]  Lin Wu,et al.  Deep adaptive feature embedding with local sample distributions for person re-identification , 2017, Pattern Recognit..

[41]  Lin Wu,et al.  Shifting Hypergraphs by Probabilistic Voting , 2014, PAKDD.

[42]  Lin Wu,et al.  Multiview Spectral Clustering via Structured Low-Rank Matrix Factorization , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[43]  Michael Hicks,et al.  Deanonymizing mobility traces: using social network as a side-channel , 2012, CCS.

[44]  Xuelong Li,et al.  Overlapping Community Detection for Multimedia Social Networks , 2017, IEEE Transactions on Multimedia.

[45]  Vitaly Shmatikov,et al.  De-anonymizing Social Networks , 2009, 2009 30th IEEE Symposium on Security and Privacy.