Auditing the Sensitivity of Graph-based Ranking with Visual Analytics

Graph mining plays a pivotal role across a number of disciplines, and a variety of algorithms have been developed to answer who/what type questions. For example, what items shall we recommend to a given user on an e-commerce platform? The answers to such questions are typically returned in the form of a ranked list, and graph-based ranking methods are widely used in industrial information retrieval settings. However, these ranking algorithms have a variety of sensitivities, and even small changes in rank can lead to vast reductions in product sales and page hits. As such, there is a need for tools and methods that can help model developers and analysts explore the sensitivities of graph ranking algorithms with respect to perturbations within the graph structure. In this paper, we present a visual analytics framework for explaining and exploring the sensitivity of any graph-based ranking algorithm by performing perturbation-based what-if analysis. We demonstrate our framework through three case studies inspecting the sensitivity of two classic graph-based ranking algorithms (PageRank and HITS) as applied to rankings in political news media and social networks.

[1]  Stephan Günnemann,et al.  Adversarial Attacks on Neural Networks for Graph Data , 2018, KDD.

[2]  Thorsten Joachims,et al.  Fairness of Exposure in Rankings , 2018, KDD.

[3]  Florian Boudin,et al.  TopicRank: Graph-Based Topic Ranking for Keyphrase Extraction , 2013, IJCNLP.

[4]  Hanghang Tong,et al.  N2N: Network Derivative Mining , 2019, CIKM.

[5]  Mingjie Sun,et al.  Data Poisoning Attack against Unsupervised Node Embedding Methods , 2018, ArXiv.

[6]  Bonnie Berger,et al.  Pairwise Global Alignment of Protein Interaction Networks by Matching Neighborhood Topology , 2007, RECOMB.

[7]  Cho-Jui Hsieh,et al.  Attack Graph Convolutional Networks by Adding Fake Nodes , 2018, ArXiv.

[8]  Amy Nicole Langville,et al.  Sensitivity and Stability of Ranking Vectors , 2011, SIAM J. Sci. Comput..

[9]  Chris North,et al.  The Effect of Edge Bundling and Seriation on Sensemaking of Biclusters in Bipartite Graphs , 2019, IEEE Transactions on Visualization and Computer Graphics.

[10]  Arjan Kuijper,et al.  Visual Analysis of Large Graphs: State‐of‐the‐Art and Future Research Challenges , 2011, Eurographics.

[11]  Wei Chen,et al.  RankExplorer: Visualization of Ranking Changes in Large Time Series Data , 2012, IEEE Transactions on Visualization and Computer Graphics.

[12]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[13]  Tobias Schreck,et al.  GUIRO: User-Guided Matrix Reordering , 2020, IEEE Transactions on Visualization and Computer Graphics.

[14]  Cheryl Z. Qian,et al.  Visualizing Rank Time Series of Wikipedia Top-Viewed Pages , 2017, IEEE Computer Graphics and Applications.

[15]  Quanshi Zhang,et al.  Interpretable Convolutional Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16]  Ziqiang Shi,et al.  Link Prediction Adversarial Attack , 2018, ArXiv.

[17]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[18]  Xiaoru Yuan,et al.  TrajRank: Exploring travel behaviour on a route by trajectory ranking , 2015, 2015 IEEE Pacific Visualization Symposium (PacificVis).

[19]  Florian Probst,et al.  Identifying Key Users in Online Social Networks: A PageRank Based Approach , 2010, ICIS.

[20]  Samuel B. Williams,et al.  ASSOCIATION FOR COMPUTING MACHINERY , 2000 .

[21]  Yong Wang,et al.  DeepDrawing: A Deep Learning Approach to Graph Drawing , 2019, IEEE Transactions on Visualization and Computer Graphics.

[22]  Jure Leskovec,et al.  Community Interaction and Conflict on the Web , 2018, WWW.

[23]  Hanghang Tong,et al.  AURORA: Auditing PageRank on Large Graphs , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[24]  Hanspeter Pfister,et al.  LineUp: Visual Analysis of Multi-Attribute Rankings , 2013, IEEE Transactions on Visualization and Computer Graphics.

[25]  Alex Endert,et al.  Podium: Ranking Data Using Mixed-Initiative Visual Analytics , 2018, IEEE Transactions on Visualization and Computer Graphics.

[26]  Qi Xuan,et al.  Fast Gradient Attack on Network Embedding , 2018, ArXiv.

[27]  Lada A. Adamic,et al.  The political blogosphere and the 2004 U.S. election: divided they blog , 2005, LinkKDD '05.

[28]  Ran Chen,et al.  SRVis: Towards Better Spatial Integration in Ranking Visualization , 2019, IEEE Transactions on Visualization and Computer Graphics.

[29]  Dan F. M. Goodman,et al.  Graph Drawing by Stochastic Gradient Descent , 2017, IEEE Transactions on Visualization and Computer Graphics.

[30]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[31]  Michael I. Jordan,et al.  Link Analysis, Eigenvectors and Stability , 2001, IJCAI.

[32]  Silvia Miksch,et al.  Visualizing Sets and Set-typed Data: State-of-the-Art and Future Challenges , 2014, EuroVis.

[33]  David F. Gleich,et al.  PageRank beyond the Web , 2014, SIAM Rev..

[34]  Martin Wattenberg,et al.  The What-If Tool: Interactive Probing of Machine Learning Models , 2019, IEEE Transactions on Visualization and Computer Graphics.

[35]  Michael Burch,et al.  A Taxonomy and Survey of Dynamic Graph Visualization , 2017, Comput. Graph. Forum.

[36]  Charles Perin,et al.  Investigating the Direct Manipulation of Ranking Tables for Time Navigation , 2015, CHI.

[37]  Kenney Ng,et al.  Clustervision: Visual Supervision of Unsupervised Clustering , 2018, IEEE Transactions on Visualization and Computer Graphics.

[38]  Kwan-Liu Ma,et al.  What Would a Graph Look Like in this Layout? A Machine Learning Approach to Large Graph Visualization , 2017, IEEE Transactions on Visualization and Computer Graphics.

[39]  Yixin Chen,et al.  ECGLens: Interactive Visual Exploration of Large Scale ECG Data for Arrhythmia Detection , 2018, CHI.

[40]  Jure Leskovec,et al.  Learning to Discover Social Circles in Ego Networks , 2012, NIPS.

[41]  Chi-Wing Fu,et al.  Interactive Structure-aware Blending of Diverse Edge Bundling Visualizations , 2020, IEEE Transactions on Visualization and Computer Graphics.

[42]  Kwan-Liu Ma,et al.  A Deep Generative Model for Graph Layout , 2019, IEEE Transactions on Visualization and Computer Graphics.

[43]  Michael Burch,et al.  The State of the Art in Visualizing Dynamic Graphs , 2014, EuroVis.

[44]  Le Song,et al.  Adversarial Attack on Graph Structured Data , 2018, ICML.

[45]  Binghui Wang,et al.  Certified Robustness of Community Detection against Adversarial Structural Perturbation via Randomized Smoothing , 2020, WWW.

[46]  Marco Gori,et al.  ItemRank: A Random-Walk Based Scoring Algorithm for Recommender Engines , 2007, IJCAI.

[47]  Marco Cavallo,et al.  Clustrophile 2: Guided Visual Clustering Analysis , 2018, IEEE Transactions on Visualization and Computer Graphics.