Visualizing Bibliographic Databases as Graphs and Mining Potential Research Synergies

Bibliographic databases are a prosperous field for data mining research and social network analysis. They contain rich information, which can be analyzed across different dimensions(e.g., author, year, venue, topic) and can be exploited in multiple ways. The representation and visualization of bibliographic databases as graphs and the application of data mining techniques can help us uncover interesting knowledge concerning potential synergies between researchers, possible matchings between researchers and venues, or even the ideal venue for presenting a research work. In this paper, we propose a novel representation model for bibliographic data, which combines co-authorship and content similarity information, and allows for the formation of scientific networks. Using a graph visualization tool from the biological domain, we are able to provide comprehensive visualizations that help us uncover hidden relations between authors and suggest potential synergies between researchers or groups.

[1]  Iraklis Varlamis,et al.  Text Relatedness Based on a Word Thesaurus , 2010, J. Artif. Intell. Res..

[2]  Alan F. Smeaton,et al.  Analysis of papers from twenty-five years of SIGIR conferences: what have we been doing for the last quarter of a century? , 2002, SIGIR Forum.

[3]  Mao Lin Huang,et al.  Visualization of Individual's Knowledge by Analyzing the Citation Networks , 2007, Computer Graphics, Imaging and Visualisation (CGIV 2007).

[4]  George Karypis,et al.  Multi.Objective Hypergraph Partitioning Algorithms for Cut and Maximum Subdomain Degree Minimization , 2003, ICCAD.

[5]  Jati K. Sengupta,et al.  Introduction to Information , 1993 .

[6]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[7]  Weimao Ke,et al.  Major Information Visualization Authors, Papers and Topics in the ACM Library , 2004 .

[8]  Jian Pei,et al.  Understanding Importance of Collaborations in Co-authorship Networks: A Supportiveness Analysis Approach , 2009, SDM.

[9]  Alan F. Smeaton,et al.  Analysis of papers from twenty-five years of SIGIR conferences: what have we been doing for the last quarter of a century? , 2002, SIGF.

[10]  Michael Schroeder,et al.  Semantic Search with GoPubMed , 2009, REWERSE.

[11]  Jörg Sander,et al.  Analysis of SIGMOD's co-authorship graph , 2003, SGMD.

[12]  Thomas L. Griffiths,et al.  The Author-Topic Model for Authors and Documents , 2004, UAI.

[13]  Luis Alfonso Ureña López,et al.  Text Categorization using bibliographic records: beyond document content , 2005, Proces. del Leng. Natural.

[14]  Sofia Stamou,et al.  Semantic relatedness hits bibliographic data , 2009, WIDM.

[15]  Randy Goebel,et al.  DBconnect: mining research community on DBLP data , 2007, WebKDD/SNA-KDD '07.

[16]  Gerhard Weikum,et al.  Graph-based text classification: learn from your neighbors , 2006, SIGIR.

[17]  Yizhou Sun,et al.  BibNetMiner: mining bibliographic information networks , 2008, SIGMOD Conference.

[18]  Mao Lin Huang,et al.  Analysis and Visualization of Co-authorship Networks for Understanding Academic Collaboration and Knowledge Domain of Individual Researchers , 2006, International Conference on Computer Graphics, Imaging and Visualisation (CGIV'06).

[19]  Michael Schroeder,et al.  Unraveling Protein Networks with Power Graph Analysis , 2008, PLoS Comput. Biol..

[20]  Jano Moreira de Souza,et al.  Competence mining for virtual scientific community creation , 2004, Int. J. Web Based Communities.

[21]  Loïc Royer Unraveling the Structure and Assessing the Quality of Protein Interaction Networks with Power Graph Analysis , 2010, Ausgezeichnete Informatikdissertationen.

[22]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[23]  Takashi Yukawa,et al.  An Automated Research Paper Classification Method for the IPC system with the Concept Base , 2008, NTCIR.

[24]  Ryutaro Ichise,et al.  Community mining tool using bibliography data , 2005, Ninth International Conference on Information Visualisation (IV'05).