A comparative study of first and all-author co-citation counting, and two different matrix generation approaches applied for author co-citation analyses

AimThe present article contributes to the current methodological debate concerning author co-citation analyses. (ACA) The study compares two different units of analyses, i.e. first- versus inclusive all-author co-citation counting, as well as two different matrix generation approaches, i.e. a conventional multivariate and the so-called Drexel approach, in order to investigate their influence upon mapping results. The aim of the present study is therefore to provide more methodological awareness and empirical evidence concerning author co-citation studies.MethodThe study is based on structured XML documents extracted from the IEEE collection. These data allow the construction of ad-hoc citation indexes, which enables us to carry out the hitherto largest all-author co-citation study. Four ACA are made, combining the different units of analyses with the different matrix generation approaches. The results are evaluated quantitatively by means of multidimensional scaling, factor analysis, Procrustes and Mantel statistics.ResultsThe results show that the inclusion of all cited authors can provide a better fit of data in two-dimensional mappings based on MDS, and that inclusive all-author co-citation counting may lead to stronger groupings in the maps. Further, the two matrix generation approaches produce maps that have some resemblances, but also many differences at the more detailed levels. The Drexel approach produces results that have noticeably lower stress values and are more concentrated into groupings. Finally, the study also demonstrates the importance of sparse matrices and their potential problems in connection with factor analysis.ConclusionWe can confirm that inclusive all-ACA produce more coherent groupings of authors, whereas the present study cannot clearly confirm previous findings that first-ACA identifies more specialties, though some vague indication is given. Most crucially, strong evidence is given to the determining effect that matrix generation approaches have on the mapping of author co-citation data and thus the interpretation of such maps. Evidence is provided for the seemingly advantages of the Drexel approach.

[1]  Peter Ingwersen,et al.  Comparative study between first and all-author co-citation analysis based on citation indexes generated from XML data , 2007 .

[2]  D. Kendall,et al.  Mathematics in the Archaeological and Historical Sciences , 1971, The Mathematical Gazette.

[3]  Kevin W. Boyack,et al.  Identifying a better measure of relatedness for mapping science , 2006 .

[4]  Olle Persson All author citations versus first author citations , 2004, Scientometrics.

[5]  P. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 1999 .

[6]  Stephen J. Bensman,et al.  Classification and powerlaws: The logarithmic transformation , 2006, J. Assoc. Inf. Sci. Technol..

[7]  J. Nunnally Psychometric Theory (2nd ed), New York: McGraw-Hill. , 1978 .

[8]  Ronald Rousseau,et al.  Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient , 2003, J. Assoc. Inf. Sci. Technol..

[9]  Claude E. Shannon,et al.  The Mathematical Theory of Communication , 1950 .

[10]  S. Mulaik Foundations of Factor Analysis , 1975 .

[11]  Ronald Rousseau,et al.  Author cocitation analysis and Pearson's r , 2004, J. Assoc. Inf. Sci. Technol..

[12]  W. G Zel THE NEED FOR STANDARDS IN BIBLIOMETRIC RESEARCH AND TECHNOLOGY , 1996 .

[13]  Pia Borlund,et al.  Matrix comparison, Part 2: Measuring the resemblance between proximity measures or ordination results by use of the mantel and procrustes statistics , 2007, J. Assoc. Inf. Sci. Technol..

[14]  John C. Gower,et al.  Statistical methods of comparing different multivariate analyses of the same data , 1971 .

[15]  L. Cronbach Coefficient alpha and the internal structure of tests , 1951 .

[16]  Katherine W. McCain,et al.  Visualizing a discipline: an author co-citation analysis of information science, 1972–1995 , 1998 .

[17]  Jesper W. Schneider,et al.  Matrix comparison, Part 1: Motivation and important issues for measuring the resemblance between proximity measures or ordination results , 2007 .

[18]  C. Lee Giles,et al.  CiteSeer: an automatic citation indexing system , 1998, DL '98.

[19]  Chaomei Chen,et al.  Visualising Semantic Spaces and Author Co-Citation Networks in Digital Libraries , 1999, Inf. Process. Manag..

[20]  Kevin W Boyack,et al.  Mapping knowledge domains: Characterizing PNAS , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[21]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[22]  Gabriella Kazai,et al.  Overview of INEX 2005 , 2005, INEX.

[23]  S. Mulaik,et al.  Foundations of Factor Analysis , 1975 .

[24]  Loet Leydesdorff,et al.  Co-occurrence matrices and their applications in information science: Extending ACA to the Web environment , 2006 .

[25]  Dangzhi Zhao,et al.  Towards all-author co-citation analysis , 2006, Inf. Process. Manag..

[26]  Wolfgang Glänzel,et al.  The need for standards in bibliometric research and technology , 2005, Scientometrics.

[27]  P. Schönemann,et al.  Fitting one matrix to another under choice of a central dilation and a rigid motion , 1970 .

[28]  Loet Leydesdorff,et al.  Co-occurrence matrices and their applications in information science: Extending ACA to the Web environment , 2006, J. Assoc. Inf. Sci. Technol..

[29]  M. Chan,et al.  Risk factors for citation errors in peer-reviewed nursing journals. , 2001, Journal of advanced nursing.

[30]  Howard D. White,et al.  Pathfinder networks and author cocitation analysis: A remapping of paradigmatic information scientists , 2003, J. Assoc. Inf. Sci. Technol..

[31]  Derek J. de Solla Price,et al.  The analysis of square matrices of scientometric transactions , 2005, Scientometrics.

[32]  Patrick J. F. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 2003 .

[33]  Katherine W. McCain,et al.  Visualizing a Discipline: An Author Co-Citation Analysis of Information Science, 1972-1995 , 1998, J. Am. Soc. Inf. Sci..

[34]  P. Green,et al.  Analyzing multivariate data , 1978 .

[35]  Howard D. White,et al.  Author cocitation: A literature measure of intellectual structure , 1981, J. Am. Soc. Inf. Sci..

[36]  Ronald Rousseau,et al.  A classification of author co-citations: Definitions and search strategies , 2004, J. Assoc. Inf. Sci. Technol..

[37]  Katherine W. McCain,et al.  Mapping authors in intellectual space: A technical overview , 1990, J. Am. Soc. Inf. Sci..

[38]  Andreas Strotmann,et al.  Can citation analysis of Web publications better detect research fronts? , 2007, J. Assoc. Inf. Sci. Technol..

[39]  N. Mantel The detection of disease clustering and a generalized regression approach. , 1967, Cancer research.