Maps of Computer Science

We describe a practical approach for visual exploration of research papers. Specifically, we use the titles of papers from the DBLP database to create what we call maps of computer science (MoCS). Words and phrases from the paper titles are the cities in the map, and countries are created based on word and phrase similarity, calculated using co-occurence. With the help of heatmaps, we can visualize the profile of a particular conference or journal over the base map. Similarly, heatmap profiles can be made of individual researchers or groups such as a department. The visualization system also makes it possible to change the data used to generate the base map. For example, a specific journal or conference can be used to generate the base map and then the heatmap overlays can be used to show the evolution of research topics in the field over the years. As before, individual researchers or research group profiles can be visualized using heatmap overlays over a specific journal or conference base map. We outline a modular and extensible system for term extraction using natural language processing techniques, and show the applicability of methods of information retrieval to calculation of term similarity and creation of a topic map. The system is available at mocs.cs.arizona.edu.

[1]  Ben Shneiderman,et al.  Ordered treemap layouts , 2001, IEEE Symposium on Information Visualization, 2001. INFOVIS 2001..

[2]  Martin Wattenberg,et al.  TIMELINESTag clouds and the case for vernacular visualization , 2008, INTR.

[3]  Myron Wish,et al.  Three-Way Multidimensional Scaling , 1978 .

[4]  Mark Harrower,et al.  Tips for Designing Effective Animated Maps , 2003 .

[5]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[6]  Furu Wei,et al.  Context preserving dynamic word cloud visualization , 2010, 2010 IEEE Pacific Visualization Symposium (PacificVis).

[7]  Michael Ley,et al.  DBLP - Some Lessons Learned , 2009, Proc. VLDB Endow..

[8]  Martin Wattenberg,et al.  Participatory Visualization with Wordle , 2009, IEEE Transactions on Visualization and Computer Graphics.

[9]  Alexandru Telea,et al.  Visualization of areas of interest in software architecture diagrams , 2006, SoftVis '06.

[10]  Martin Wattenberg,et al.  Mapping Text with Phrase Nets , 2009, IEEE Transactions on Visualization and Computer Graphics.

[11]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[12]  P. Jaccard,et al.  Etude comparative de la distribution florale dans une portion des Alpes et des Jura , 1901 .

[13]  Arjan Kuijper,et al.  Visual Analysis of Large Graphs: State‐of‐the‐Art and Future Research Challenges , 2011, Eurographics.

[14]  M. Callon,et al.  Mapping the dynamics of science and technology : sociology of science in the real world , 1988 .

[15]  Shimei Pan,et al.  Interactive, topic-based visual text summarization and analysis , 2009, CIKM.

[16]  Pak Chung Wong,et al.  TOPIC ISLANDS/sup TM/-a wavelet-based text visualization system , 1998, Proceedings Visualization '98 (Cat. No.98CB36276).

[17]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[18]  Emden R. Gansner,et al.  Graphviz and Dynagraph – Static and Dynamic Graph Drawing Tools , 2003 .

[19]  Oren Etzioni,et al.  Grouper: A Dynamic Clustering Interface to Web Search Results , 1999, Comput. Networks.

[20]  Katerina T. Frantzi,et al.  Automatic recognition of multi-word terms , 1998 .

[21]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[22]  Kwan-Liu Ma,et al.  Semantic‐Preserving Word Clouds by Seam Carving , 2011, Comput. Graph. Forum.

[23]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[24]  David M. Blei,et al.  Syntactic Topic Models , 2008, NIPS.

[25]  Emden R. Gansner,et al.  Graphviz - Open Source Graph Drawing Tools , 2001, GD.

[26]  Martin Wattenberg,et al.  Parallel Tag Clouds to explore and analyze faceted text corpora , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[27]  Wolfgang Kienreich,et al.  The InfoSky visual explorer: Exploiting Hierarchical Structure and Document Similarities , 2002, Inf. Vis..

[28]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[29]  Timo Honkela,et al.  Self-Organizing Maps of Document Collections: A New Approach to Interactive Exploration , 1996, KDD.

[30]  Ulrik Brandes,et al.  Organizing Search Results with a Reference Map , 2012, IEEE Transactions on Visualization and Computer Graphics.

[31]  Pak Chung Wong,et al.  TOPIC ISLANDS/sup TM/-a wavelet-based text visualization system , 1998 .

[32]  James J. Thomas,et al.  Visualizing the non-visual: spatial analysis and interaction with information from text documents , 1995, Proceedings of Visualization 1995 Conference.

[33]  Stephen G. Kobourov,et al.  Force-Directed Drawing Algorithms , 2013, Handbook of Graph Drawing and Visualization.

[34]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Tobias Höllerer,et al.  Interactive Folksonomic Analytics with the Tag River Visualization , 2011 .

[36]  Hal Daumé,et al.  Incorporating Lexical Priors into Topic Models , 2012, EACL.

[37]  Ludo Waltman,et al.  Software survey: VOSviewer, a computer program for bibliometric mapping , 2009, Scientometrics.

[38]  Katy Börner,et al.  Plug-and-play macroscopes , 2011, Commun. ACM.

[39]  Chaomei Chen,et al.  Visualizing knowledge domains , 2005, Annu. Rev. Inf. Sci. Technol..

[40]  A. McCallum,et al.  Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[41]  Jimeng Sun,et al.  FacetAtlas: Multifaceted Visualization for Rich Text Corpora , 2010, IEEE Transactions on Visualization and Computer Graphics.

[42]  Padhraic Smyth,et al.  TopicNets: Visual Analysis of Large Text Corpora with Topic Modeling , 2012, TIST.

[43]  M. Sheelagh T. Carpendale,et al.  Bubble Sets: Revealing Set Relations with Isocontours over Existing Visualizations , 2009, IEEE Transactions on Visualization and Computer Graphics.

[44]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[45]  Yifan Hu,et al.  Visualizing Graphs and Clusters as Maps , 2010, IEEE Computer Graphics and Applications.

[46]  David M. Mark,et al.  The distance-similarity metaphor in region-display spatializations , 2006, IEEE Computer Graphics and Applications.

[47]  김종덕,et al.  Interactive. , 1996, Nursing older people.

[48]  Rosane Minghim,et al.  Semantic Wordification of Document Collections , 2012, Comput. Graph. Forum.

[49]  Lucy T. Nowell,et al.  ThemeRiver: Visualizing Thematic Changes in Large Document Collections , 2002, IEEE Trans. Vis. Comput. Graph..

[50]  Daniel W. Archambault,et al.  Fully Automatic Visualisation of Overlapping Sets , 2009, Comput. Graph. Forum.

[51]  Bongshin Lee,et al.  ManiWordle: Providing Flexible Control over Wordle , 2010, IEEE Transactions on Visualization and Computer Graphics.

[52]  Giuseppe Di Battista,et al.  Topographic Visualization of Prefix Propagation in the Internet , 2006, IEEE Transactions on Visualization and Computer Graphics.

[53]  Emilio Di Giacomo,et al.  Graph Visualization Techniques for Web Clustering Engines , 2007, IEEE Transactions on Visualization and Computer Graphics.

[54]  Sara Irina Fabrikant,et al.  Spatialization Methods: A Cartographic Research Agenda for Non-geographic Information Visualization , 2003 .

[55]  Hideki Mima,et al.  Automatic recognition of multi-word terms:. the C-value/NC-value method , 2000, International Journal on Digital Libraries.

[56]  Michael J. Muller,et al.  Getting our head in the clouds: toward evaluation studies of tagclouds , 2007, CHI.

[57]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[58]  Yifan Hu,et al.  Putting recommendations on the map: visualizing clusters and relations , 2009, RecSys '09.

[59]  K. Börner,et al.  Mapping topics and topic bursts in PNAS , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[60]  Walter Didimo,et al.  Visual Analysis of Large Graphs Using (X,Y)-Clustering and Hybrid Visualizations , 2010, IEEE Transactions on Visualization and Computer Graphics.