CiteRivers: Visual Analytics of Citation Patterns

The exploration and analysis of scientific literature collections is an important task for effective knowledge management. Past interest in such document sets has spurred the development of numerous visualization approaches for their interactive analysis. They either focus on the textual content of publications, or on document metadata including authors and citations. Previously presented approaches for citation analysis aim primarily at the visualization of the structure of citation networks and their exploration. We extend the state-of-the-art by presenting an approach for the interactive visual analysis of the contents of scientific documents, and combine it with a new and flexible technique to analyze their citations. This technique facilitates user-steered aggregation of citations which are linked to the content of the citing publications using a highly interactive visualization approach. Through enriching the approach with additional interactive views of other important aspects of the data, we support the exploration of the dataset over time and enable users to analyze citation patterns, spot trends, and track long-term developments. We demonstrate the strengths of our approach through a use case and discuss it based on expert user feedback.

[1]  John T. Stasko,et al.  Combining Computational Analyses and Interactive Visualization for Document Exploration and Sensemaking in Jigsaw , 2013, IEEE Transactions on Visualization and Computer Graphics.

[2]  M. Newman Coauthorship networks and patterns of scientific collaboration , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[4]  Jean-Daniel Fekete,et al.  This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS , 2022 .

[5]  Katy Börner,et al.  Atlas of Science - Visualizing What We Know , 2010 .

[6]  Matthias Hein,et al.  Spectral clustering based on the graph p-Laplacian , 2009, ICML '09.

[7]  Jean-Daniel Fekete,et al.  Excentric Labeling: Dynamic Neighborhood Labeling for Data Visualization , 2003 .

[8]  Thorsten Joachims,et al.  Identifying Temporal Patterns and Key Players in Document Collections , 1995 .

[9]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[10]  Weimao Ke,et al.  Major Information Visualization Authors, Papers and Topics in the ACM Library , 2004 .

[11]  Jevin D. West Eigenfactor: ranking and mapping scientific knowledge , 2010 .

[12]  Susan T. Dumais,et al.  PivotPaths: Strolling through Faceted Information Spaces , 2012, IEEE Transactions on Visualization and Computer Graphics.

[13]  Xin Tong,et al.  TextFlow: Towards Better Understanding of Evolving Topics in Text , 2011, IEEE Transactions on Visualization and Computer Graphics.

[14]  Jaegul Choo,et al.  UTOPIAN: User-Driven Topic Modeling Based on Interactive Nonnegative Matrix Factorization , 2013, IEEE Transactions on Visualization and Computer Graphics.

[15]  Thomas Ertl,et al.  Visual Classifier Training for Text Document Retrieval , 2012, IEEE Transactions on Visualization and Computer Graphics.

[16]  Fernanda B. Viégas,et al.  Visualizing email content: portraying relationships from conversational histories , 2006, CHI.

[17]  Lucy T. Nowell,et al.  ThemeRiver: Visualizing Thematic Changes in Large Document Collections , 2002, IEEE Trans. Vis. Comput. Graph..

[18]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[19]  Paul Rayson,et al.  Comparing Corpora using Frequency Profiling , 2000, Proceedings of the workshop on Comparing corpora -.

[20]  James J. Thomas,et al.  Visualizing the non-visual: spatial analysis and interaction with information from text documents , 1995, Proceedings of Visualization 1995 Conference.

[21]  Carlos Guestrin,et al.  Beyond keyword search: discovering relevant scientific literature , 2011, KDD.

[22]  Sebastian Koch,et al.  Visual Analysis and Dissemination of Scientific Literature Collections with SurVis , 2016, IEEE Transactions on Visualization and Computer Graphics.

[23]  P. Hanrahan,et al.  Flow map layout , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[24]  Thomas Ertl,et al.  Iterative Integration of Visual Insights during Scalable Patent Search and Analysis , 2011, IEEE Transactions on Visualization and Computer Graphics.

[25]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[26]  Cynthia A. Brewer,et al.  ColorBrewer.org: An Online Tool for Selecting Colour Schemes for Maps , 2003 .

[27]  Martin Wattenberg,et al.  Parallel Tag Clouds to explore and analyze faceted text corpora , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[28]  Martin Wattenberg,et al.  Stacked Graphs – Geometry & Aesthetics , 2008, IEEE Transactions on Visualization and Computer Graphics.

[29]  Chaomei Chen,et al.  CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature , 2006, J. Assoc. Inf. Sci. Technol..

[30]  J. Cheeger A lower bound for the smallest eigenvalue of the Laplacian , 1969 .

[31]  Fangzhao Wu,et al.  OpinionFlow: Visual Analysis of Opinion Diffusion on Social Media , 2014, IEEE Transactions on Visualization and Computer Graphics.

[32]  Chaomei Chen,et al.  Visualizing the Intellectual Structure with Paper-Reference Matrices , 2009, IEEE Transactions on Visualization and Computer Graphics.

[33]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[34]  Martin Wattenberg Baby names, visualization, and social data analysis , 2005 .

[35]  Dafna Shahaf,et al.  Metro maps of science , 2012, KDD.

[36]  Jeffrey Heer,et al.  Replication of the Keyword Extraction part of the paper "'Without the Clutter of Unimportant Words': Descriptive Keyphrases for Text Visualization" , 2019, ArXiv.

[37]  William Ribarsky,et al.  HierarchicalTopics: Visually Exploring Large Text Collections Using Topic Hierarchies , 2013, IEEE Transactions on Visualization and Computer Graphics.

[38]  Christian Posse,et al.  IN-SPIRE InfoVis 2004 Contest Entry , 2004, IEEE Symposium on Information Visualization.

[39]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[40]  Chaomei Chen,et al.  Delineating the citation impact of scientific discoveries , 2007, JCDL '07.

[41]  Mary Czerwinski,et al.  Understanding research trends in conferences using paperLens , 2005, CHI Extended Abstracts.

[42]  Marcos André Gonçalves,et al.  A brief survey of automatic methods for author name disambiguation , 2012, SGMD.

[43]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[44]  Qiang Zhang,et al.  TIARA: a visual exploratory text analytic system , 2010, KDD '10.

[45]  Daniel Fried,et al.  Maps of Computer Science , 2013, 2014 IEEE Pacific Visualization Symposium.

[46]  Ben Shneiderman,et al.  Rapid understanding of scientific paper collections: Integrating statistics, text analytics, and visualization , 2012, J. Assoc. Inf. Sci. Technol..

[47]  M. Sheelagh T. Carpendale,et al.  Lark: Coordinating Co-located Collaboration with Information Visualization , 2009, IEEE Transactions on Visualization and Computer Graphics.

[48]  Jaegul Choo,et al.  CiteVis : Exploring Conference Paper Citation Data Visually , 2013 .

[49]  C. Lee Giles,et al.  ParsCit: an Open-source CRF Reference String Parsing Package , 2008, LREC.

[50]  Le Song,et al.  WilmaScope Graph Visualisation , 2004, IEEE Symposium on Information Visualization.

[51]  K. Brner Atlas of Science: Visualizing What We Know , 2010 .

[52]  Michael Ley,et al.  The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives , 2002, SPIRE.

[53]  Martin Wattenberg,et al.  Participatory Visualization with Wordle , 2009, IEEE Transactions on Visualization and Computer Graphics.