Visualization of Relational Structure Among Scientific Articles

This paper describes an ongoing technique to collecting, mining, clustering and visualizing scientific articles and their relations in information science. We aim to provide a valuable tool for researchers in quick analyzing the relationship and retrieving the relevant documents. Our system, called CAVis, first automatically searches and retrieves articles from the Internet using given keywords. These articles are next converted into readable text documents. The system next analyzes these documents and it creates similarity matrix. A clustering algorithm is then applied to group the relevant papers into corresponding clusters. Finally, we provide a visual interface so that users can easily view the structure and the citing relations among articles. From the view, they can navigate through the collection as well as retrieve a particular article.

[1]  Katherine W. McCain,et al.  Visualizing a discipline: an author co-citation analysis of information science, 1972–1995 , 1998 .

[2]  Michalis Faloutsos,et al.  BIVTECI: a bibliographic visualization tool , 1996, CASCON.

[3]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[5]  Niklas Elmqvist,et al.  CiteWiz: a tool for the visualization of scientific citation networks , 2007 .

[6]  Chaomei Chen,et al.  Visualising Semantic Spaces and Author Co-Citation Networks in Digital Libraries , 1999, Inf. Process. Manag..

[7]  Mao Lin Huang,et al.  Visualization of large citation networks with space-efficient multi-layer optimization , 2007 .

[8]  Vijay V. Raghavan,et al.  Visualization of document co-citation counts , 2002, Proceedings Sixth International Conference on Information Visualisation.

[9]  Miguel A. Andrade-Navarro,et al.  Automatic extraction of keywords from scientific text: application to the knowledge domain of protein families , 1998, Bioinform..

[10]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[11]  Kang Zhang,et al.  FAÇADE: a fast and effective approach to the discovery of dense clusters in noisy spatial data , 2004, SIGMOD '04.

[12]  Chaomei Chen,et al.  Searching for intellectual turning points: Progressive knowledge domain visualization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Ulrik Brandes,et al.  Visualization of Bibliographic Networks with a Reshaped Landscape Metaphor , 2002, VisSym.

[14]  Ramana Rao,et al.  An organic user interface for searching citation links , 1995, CHI '95.

[15]  Benjamin King Step-Wise Clustering Procedures , 1967 .

[16]  Ying Liu,et al.  Text Mining Biomedical Literature for Discovering Gene-to-Gene Relationships: A Comparative Study of Algorithms , 2005, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[17]  Chaomei Chen,et al.  Web site design with the patron in mind: A step-by-step guide for libraries , 2006 .

[18]  Peter Eades,et al.  A Heuristic for Graph Drawing , 1984 .

[19]  Mao Lin Huang,et al.  EncCon: An Approach to Constructing Interactive Visualization of Large Hierarchical Data , 2005, Inf. Vis..