Fuzzy Random Walkers with Second Order Bounds: An Asymmetric Analysis

Edge-fuzzy graphs constitute an essential modeling paradigm across a broad spectrum of domains ranging from artificial intelligence to computational neuroscience and social network analysis. Under this model, fundamental graph properties such as edge length and graph diameter become stochastic and as such they are consequently expressed in probabilistic terms. Thus, algorithms for fuzzy graph analysis must rely on non-deterministic design principles. One such principle is Random Walker, which is based on a virtual entity and selects either edges or, like in this case, vertices of a fuzzy graph to visit. This allows the estimation of global graph properties through a long sequence of local decisions, making it a viable strategy candidate for graph processing software relying on native graph databases such as Neo4j. As a concrete example, Chebyshev Walktrap, a heuristic fuzzy community discovery algorithm relying on second order statistics and on the teleportation of the Random Walker, is proposed and its performance, expressed in terms of community coherence and number of vertex visits, is compared to the previously proposed algorithms of Markov Walktrap, Fuzzy Walktrap, and Fuzzy Newman–Girvan. In order to facilitate this comparison, a metric based on the asymmetric metrics of Tversky index and Kullback–Leibler divergence is used.

[1]  R. W. Hamming,et al.  On the distribution of numbers , 1970, Bell Syst. Tech. J..

[2]  Athanasios K. Tsakalidis,et al.  Evaluating Twitter Influence Ranking with System Theory , 2016, WEBIST.

[3]  Ioannis Hatzilygeroudis,et al.  Modeling ReTweet Diffusion Using Emotional Content , 2014, AIAI.

[4]  Christos Makris,et al.  On converting community detection algorithms for fuzzy graphs in Neo4j , 2016, ArXiv.

[5]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[6]  Ioannis Hatzilygeroudis,et al.  Conversation Emotional Modeling in Social Networks , 2014, 2014 IEEE 26th International Conference on Tools with Artificial Intelligence.

[7]  Jean-Loup Guillaume,et al.  Fast unfolding of community hierarchies in large networks , 2008, ArXiv.

[8]  Christos Makris,et al.  T-PICE: Twitter Personality Based Influential Communities Extraction System , 2014, 2014 IEEE International Congress on Big Data.

[9]  Y. Saad,et al.  GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems , 1986 .

[10]  Christos Makris,et al.  Eventually consistent cardinality estimation with applications in biodata mining , 2016, SAC.

[11]  Martin Rosvall,et al.  Maps of Information Flow Reveal Community Structure In Complex Networks , 2007 .

[12]  Lu-Xing Yang,et al.  A delayed computer virus propagation model and its dynamics , 2012 .

[13]  Amy Nicole Langville,et al.  Google's PageRank and beyond - the science of search engine rankings , 2006 .

[14]  Dickson K. W. Chiu,et al.  Identifying Personality-Based Communities in Social Networks , 2013, ER Workshops.

[15]  L. Brillouin Maxwell's Demon Cannot Operate: Information and Entropy. I , 1951 .

[16]  Huda Mutab Al Mutab Fuzzy Graphs , 2019, JOURNAL OF ADVANCES IN MATHEMATICS.

[17]  Camille Couprie,et al.  Power watersheds: A new image segmentation framework extending graph cuts, random walker and optimal spanning forest , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[18]  Kenneth DeJong,et al.  Learning with genetic algorithms: An overview , 1988, Machine Learning.

[19]  François Fouss,et al.  Random-Walk Computation of Similarities between Nodes of a Graph with Application to Collaborative Recommendation , 2007, IEEE Transactions on Knowledge and Data Engineering.

[20]  Jeffrey O. Kephart,et al.  Directed-graph epidemiological models of computer viruses , 1991, Proceedings. 1991 IEEE Computer Society Symposium on Research in Security and Privacy.

[21]  Leo Grady,et al.  A Seeded Image Segmentation Framework Unifying Graph Cuts And Random Walker Which Yields A New Algorithm , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[22]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[23]  Charalampos E. Tsourakakis Fast Counting of Triangles in Large Real Networks without Counting: Algorithms and Laws , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[24]  W. H. Zurek Algorithmic Information Content, Church — Turing Thesis, Physical Entropy, and Maxwell’s Demon , 1991 .

[25]  Christos Faloutsos,et al.  Graphs over time: densification laws, shrinking diameters and possible explanations , 2005, KDD '05.

[26]  Isidoros Perikos,et al.  Towards detecting emotional communities in Twitter , 2015, 2015 IEEE 9th International Conference on Research Challenges in Information Science (RCIS).

[27]  Doina Caragea,et al.  Graph Databases , 2019, Encyclopedia of Big Data Technologies.

[28]  Lang Tong,et al.  Blind identification and equalization based on second-order statistics: a time domain approach , 1994, IEEE Trans. Inf. Theory.

[29]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[30]  Andreas Kanavos,et al.  Tensor-based document retrieval over Neo4j with an application to PubMed mining , 2016, 2016 7th International Conference on Information, Intelligence, Systems & Applications (IISA).

[31]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  P. Gács,et al.  Algorithms , 1992 .

[33]  Jitendra K. Tugnait,et al.  On channel estimation using superimposed training and first-order statistics , 2003, IEEE Communications Letters.

[34]  Jerry M. Mendel,et al.  Tutorial on higher-order statistics (spectra) in signal processing and system theory: theoretical results and some applications , 1991, Proc. IEEE.

[35]  Jure Leskovec,et al.  Empirical comparison of algorithms for network community detection , 2010, WWW '10.

[36]  Ronald B. Morgan,et al.  Implicitly Restarted GMRES and Arnoldi Methods for Nonsymmetric Systems of Equations , 2000, SIAM J. Matrix Anal. Appl..

[37]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[38]  Vasileios Megalooikonomou,et al.  An adaptive higher order scheduling policy with an application to biosignal processing , 2016, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).

[39]  Douglas M. Priest,et al.  Algorithms for arbitrary precision floating point arithmetic , 1991, [1991] Proceedings 10th IEEE Symposium on Computer Arithmetic.

[40]  Athanasios K. Tsakalidis,et al.  A Neo4j Implementation of Fuzzy Random Walkers , 2016, SETN.

[41]  Terrence J. Sejnowski,et al.  Enhanced detection of artifacts in EEG data using higher-order statistics and independent component analysis , 2007, NeuroImage.

[42]  Christos Faloutsos,et al.  Kronecker Graphs: An Approach to Modeling Networks , 2008, J. Mach. Learn. Res..

[43]  Christos Makris,et al.  Predicting Information Diffusion Patterns in Twitter , 2014, AIAI.

[44]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[45]  Vasileios Megalooikonomou,et al.  Regularizing large biosignals with finite differences , 2016, 2016 7th International Conference on Information, Intelligence, Systems & Applications (IISA).

[46]  Kenneth A. De Jong,et al.  Using Genetic Algorithms to Solve NP-Complete Problems , 1989, ICGA.

[47]  John Scott Social Network Analysis , 1988 .

[48]  Eric Moulines,et al.  A blind source separation technique using second-order statistics , 1997, IEEE Trans. Signal Process..

[49]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[50]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[51]  Christos Makris,et al.  Real Time Analytics for Measuring User Influence on Twitter , 2015, 2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI).

[52]  Luciano Rossoni,et al.  Models and methods in social network analysis , 2006 .

[53]  C. M. Lim,et al.  Application of higher order statistics/spectra in biomedical signals--a review. , 2010, Medical engineering & physics.

[54]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[55]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[56]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[57]  K. D. Jong Learning with Genetic Algorithms: An Overview , 2005, Machine Learning.

[58]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[59]  Charu C. Aggarwal,et al.  Mining Text Data , 2012, Springer US.

[60]  Vasileios Megalooikonomou,et al.  Higher order graph centrality measures for Neo4j , 2015, 2015 6th International Conference on Information, Intelligence, Systems and Applications (IISA).