Hybrid citation-word representations in science mapping: Portolan charts of research fields?

The mapping of scientific fields, based on principles established in the seventies, has recently shown a remarkable development and applications are now booming with progress in computing efficiency. We examine here the convergence of two thematic mapping approaches, citation‐based and word‐based, which rely on quite different sociological backgrounds. A corpus in the nanoscience field was broken down into research themes, using the same clustering technique on the 2 networks separately. The tool for comparison is the table of intersections of the M clusters (here M=50) built on either side. A classical visual exploitation of such contingency tables is based on correspondence analysis. We investigate a rearrangement of the intersection table (block modeling), resulting in pseudo‐map. The interest of this representation for confronting the two breakdowns is discussed. The amount of convergence found is, in our view, a strong argument in favor of the reliability of bibliometric mapping. However, the outcomes are not convergent at the degree where they can be substituted for each other. Differences highlight the complementarity between approaches based on different networks. In contrast with the strong informetric posture found in recent literature, where lexical and citation markers are considered as miscible tokens, the framework proposed here does not mix the two elements at an early stage, in compliance with their contrasted logic. © 2011 Wiley Periodicals, Inc.

[1]  Bart De Moor,et al.  Weighted hybrid clustering by combining text mining and bibliometrics on a large-scale journal database , 2010, J. Assoc. Inf. Sci. Technol..

[2]  N. Mullins,et al.  The Group Structure of Cocitation Clusters: A Comparative Study , 1977 .

[3]  Henry G. Small,et al.  The synthesis of specialty narratives from co-citation clusters , 1986, J. Am. Soc. Inf. Sci..

[4]  K. McCain Mapping Economics through the Journal Literature: An Experiment in Journal Cocitation Analysis. , 1991 .

[5]  Alain Lelu Clusters and factors: neural algorithms for a novel representation of huge and highly multidimensional data sets , 1994 .

[6]  Michel Zitt,et al.  Co-citations and co-sitations: A cautionary view on an analogy , 2002, Scientometrics.

[7]  Henry Small The synthesis of specialty narratives from co-citation clusters , 1986 .

[8]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[9]  Alison Callahan,et al.  Contextual cocitation: Augmenting cocitation analysis and its applications , 2010, J. Assoc. Inf. Sci. Technol..

[10]  J. Bertin La graphique et le traitement graphique de l'information , 1977 .

[11]  Domenges,et al.  Analyse factorielle sphérique: Une exploration , 1979 .

[12]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[13]  Howard D. White,et al.  Author cocitation: A literature measure of intellectual structure , 1981, J. Am. Soc. Inf. Sci..

[14]  D. Aksnes,et al.  Researchers’ perceptions of citations , 2009 .

[15]  Henk F. Moed,et al.  Mapping of science by combined co-citation and word analysis, I. Structural aspects , 1991, J. Am. Soc. Inf. Sci..

[16]  Alain Lelu La méthode de classification non-supervisée K-means axiales , 2008 .

[17]  Chaomei Chen,et al.  Visualising Semantic Spaces and Author Co-Citation Networks in Digital Libraries , 1999, Inf. Process. Manag..

[18]  Katherine W. McCain,et al.  The author cocitation structure of macroeconomics , 1983, Scientometrics.

[19]  Wolfgang Glänzel,et al.  A new methodological approach to bibliographic coupling and its application to the national, regional and institutional level , 2005, Scientometrics.

[20]  J. Hartigan Direct Clustering of a Data Matrix , 1972 .

[21]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[22]  M. Callon,et al.  From translations to problematic networks: An introduction to co-word analysis , 1983 .

[23]  Anthony E. Cawkell,et al.  Mapping Scientific Frontiers: The Quest for Knowledge Visualization , 2003, J. Documentation.

[24]  Wiebe E. Bijker,et al.  Science in action : how to follow scientists and engineers through society , 1989 .

[25]  Anthony F. J. van Raan,et al.  Monitoring Scientific Developments from a Dynamic Perspective: Self-Organized Structuring to Map Neural Network Research , 1998, Journal of the American Society for Information Science.

[26]  Kevin W. Boyack,et al.  Mapping the backbone of science , 2004, Scientometrics.

[27]  Chaomei Chen,et al.  Visualizing knowledge domains , 2005, Annu. Rev. Inf. Sci. Technol..

[28]  Michel Zitt,et al.  Mapping nanosciences by citation flows: A preliminary analysis , 2007, Scientometrics.

[29]  Mark S. Granovetter T H E S T R E N G T H O F WEAK TIES: A NETWORK THEORY REVISITED , 1983 .

[30]  Gary G. Yen,et al.  Time line visualization of research fronts , 2003, J. Assoc. Inf. Sci. Technol..

[31]  Michel Callon,et al.  On Interests and their Transformation: Enrolment and Counter-Enrolment , 1982 .

[32]  Jacques Bertin,et al.  Semiologie graphique : les diagrammes les réseaux, les cartes , 1969 .

[33]  Gaston Heimeriks,et al.  Mapping research topics using word-reference co-occurrences: A method and an exploratory case study , 2006, Scientometrics.

[34]  Alain Guénoche,et al.  Representation and Evaluation of Partitions , 2002 .

[35]  Paul Wouters,et al.  Citation cycles and peer review cycles , 2006, Scientometrics.

[36]  Michel Zitt,et al.  Hybrid maps of scientific fields: an application to nanosciences , 2008 .

[37]  Michel Zitt,et al.  Delineating complex scientific fields by an hybrid lexical-citation method: An application to nanosciences , 2006, Inf. Process. Manag..

[38]  Gary G Yen,et al.  Crossmaps: Visualization of overlapping relationships in collections of journal papers , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[39]  John Law,et al.  Putting Facts Together: A Study of Scientific Persuasion , 1982 .

[40]  Bart De Moor,et al.  A hybrid mapping of information science , 2008, Scientometrics.

[41]  Arie Rip,et al.  Mapping of science: possibilities and limitations , 1988 .

[42]  C. R. Rao,et al.  An Alternative to Correspondence Analysis Using Hellinger Distance. , 1997 .

[43]  Kevin W Boyack,et al.  Mapping knowledge domains: Characterizing PNAS , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Boris Mirkin,et al.  Mathematical Classification and Clustering , 1996 .

[45]  Susan E. Cozzens,et al.  What do citations count? the rhetoric-first model , 1989, Scientometrics.

[46]  Loet Leydesdorff Why words and co‐words cannot map the development of the sciences , 1997 .

[47]  Henk F. Moed,et al.  Mapping of Science by Combined Co-Citation and Word Analysis. I. Structural Aspects , 1991 .

[48]  Gabriel Pinski,et al.  Citation influence for journal aggregates of scientific publications: Theory, with application to the literature of physics , 1976, Inf. Process. Manag..

[49]  Loet Leydesdorff,et al.  The university-industry knowledge relationship: Analyzing patents and the science base of technologies , 2004, J. Assoc. Inf. Sci. Technol..

[50]  B. C. Griffith,et al.  The Structure of Scientific Literatures I: Identifying and Graphing Specialties , 1974 .

[51]  E. Diday Une nouvelle méthode en classification automatique et reconnaissance des formes la méthode des nuées dynamiques , 1971 .

[52]  Michel Zitt,et al.  A simple method for dynamic scientometrics using lexical analysis , 1991, Scientometrics.

[53]  Michel Zitt,et al.  Reassessment of co-citation methods for science indicators: Effect of methods improving recall rates , 1996, Scientometrics.

[54]  Eugene Garfield,et al.  KeyWords Plus - Algorithmic Derivative Indexing , 1993, J. Am. Soc. Inf. Sci..

[55]  Michel Zitt,et al.  Development of a method for detection and trend analysis of research fronts built by lexical or cocitation analysis , 1994, Scientometrics.

[56]  F. Marcotorchino,et al.  Block seriation problems: A unified approach. Reply to the problem of H. Garcia and J. M. Proth (Applied Stochastic Models and Data Analysis, 1, (1), 25–34 (1985)) , 1987 .

[57]  Loet Leydesdorff,et al.  Theories of Citation , 1998 .

[58]  Terttu Luukkonen,et al.  Why has Latour's theory of citations been ignored by the bibliometric community? discussion of sociological interpretations of citation analysis , 2006, Scientometrics.

[59]  Chaomei Chen,et al.  Mapping Scientific Frontiers: The Quest for Knowledge Visualization , 2012, Springer London.

[60]  Gabriel Pinski,et al.  Structure of the Biomedical Literature , 1976, J. Am. Soc. Inf. Sci..

[61]  Henry G. Small,et al.  Citation context analysis of a co-citation cluster: Recombinant-DNA , 1980, Scientometrics.

[62]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .

[63]  Howard D. White,et al.  Authors as citers over time , 2001, J. Assoc. Inf. Sci. Technol..

[64]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[65]  Michel Zitt,et al.  Delineation of the genomics field by hybrid citation-lexical methods: interaction with experts and validation process , 2010, Scientometrics.

[66]  Alison Callahan,et al.  Contextual cocitation: Augmenting cocitation analysis and its applications , 2010 .

[67]  Eugene Garfield,et al.  Random thoughts on citationology its theory and practice , 1998, Scientometrics.

[68]  Leo Egghe,et al.  New informetric aspects of the Internet: some reflections - many problems , 2000, J. Inf. Sci..