Bibliometric analysis of CiteSeer data for countries

This article describes the results of our analysis of the data from the CiteSeer digital library. First, we examined the data from the point of view of source top-level Internet domains from which the data were collected. Second, we measured country shares in publications indexed by CiteSeer and compared them to those based on mainstream bibliographic data from the Web of Science and Scopus. And third, we concentrated on analyzing publications and their citations aggregated by countries. This way, we generated rankings of the most influential countries in computer science using several non-recursive as well as recursive methods such as citation counts or PageRank. We conclude that even if East Asian countries are underrepresented in CiteSeer, its data may well be used along with other conventional bibliographic databases for comparing the computer science research productivity and performance of countries.

[1]  Massimo Franceschet,et al.  A comparison of bibliometric indicators for computer science scholars and journals on Web of Science and Google Scholar , 2010, Scientometrics.

[2]  Junping Qiu,et al.  Scientific research competitiveness of world universities in computer science , 2008, Scientometrics.

[3]  Peter Willett,et al.  Computer science research in Malaysia: a bibliometric analysis , 2011, Aslib Proc..

[4]  Dalibor Fiala,et al.  Mining citation information from CiteSeer data , 2011, Scientometrics.

[5]  Yuan An,et al.  Characterizing and Mining Citation Graph of Computer Science Literature , 2001 .

[6]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[7]  J. Gerring A case study , 2011, Technology and Society.

[8]  Jia-Ling Koh,et al.  Hierarchical Topic-Based Communities Construction for Authors in a Literature Database , 2010, IEA/AIE.

[9]  魏屹东,et al.  Scientometrics , 2018, Encyclopedia of Big Data.

[10]  François Rousselot,et al.  PageRank for bibliographic networks , 2008, Scientometrics.

[11]  C. Lee Giles,et al.  Who gets acknowledged: Measuring scientific contributions through automatic acknowledgment indexing , 2004, Proc. Natl. Acad. Sci. USA.

[12]  Massimo Franceschet,et al.  The role of conference publications in CS , 2010, Commun. ACM.

[13]  Elisabeth Logan,et al.  Citation analysis using scientific publications on the Web as data source: A case study in the XML research area , 2002, Scientometrics.

[14]  Hongyuan Zha,et al.  Discovering Temporal Communities from Social Network Documents , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[15]  Jacques Wainer,et al.  Scientific production in Computer Science: A comparative study of Brazil and other countries , 2009, Scientometrics.

[16]  Dror G. Feitelson,et al.  Predictive ranking of computer scientists using CiteSeer data , 2004, J. Documentation.

[17]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[18]  Judit Bar-Ilan,et al.  Web of Science with the Conference Proceedings Citation Indexes: the case of computer science , 2010, Scientometrics.

[19]  Nan Ma,et al.  A comparative study of research performance in computer science , 2004, Scientometrics.

[20]  Andreas Strotmann,et al.  Can citation analysis of Web publications better detect research fronts? , 2007, J. Assoc. Inf. Sci. Technol..

[21]  Chaomei Chen Domain visualization for digital libraries , 2000, 2000 IEEE Conference on Information Visualization. An International Conference on Computer Visualization and Graphics.

[22]  Pedro Albarrán,et al.  A comparison of the scientific performance of the U.S. and the European Union at the turn of the XXI century , 2009 .

[23]  Bart Selman,et al.  Tracking evolving communities in large linked networks , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[24]  B. M. Gupta,et al.  Mapping of Indian computer science research output, 1999–2008 , 2010, Scientometrics.

[25]  Judit Bar-Ilan,et al.  An ego-centric citation analysis of the works of Michael O. Rabin based on multiple citation indexes , 2006, Inf. Process. Manag..