Introduction to bibliometrics for construction and maintenance of thesauri: Methodical considerations

The paper introduces bibliometrics to the research area of knowledge organization – more precisely in relation to construction and maintenance of thesauri. As such, the paper reviews related work that has been of inspiration for the assembly of a semi‐automatic, bibliometric‐based, approach for construction and maintenance. Similarly, the paper discusses the methodical considerations behind the approach. Eventually, the semi‐automatic approach is used to verify the applicability of bibliometric methods as a supplement to construction and maintenance of thesauri. In the context of knowledge organization, the paper outlines two fundamental approaches to knowledge organization, that is, the manual intellectual approach and the automatic algorithmic approach. Bibliometric methods belong to the automatic algorithmic approach, though bibliometrics do have special characteristics that are substantially different from other methods within this approach.

[1]  Birger Hjørland The classification of psychology : A case study in the classification of a knowledge field , 1998 .

[2]  Henk F. Moed,et al.  Mapping of science by combined co-citation and word analysis: II: Dynamical aspects , 1991, J. Am. Soc. Inf. Sci..

[3]  R. E. Burton,et al.  The “half‐life” of some scientific and technical literatures , 1960 .

[4]  Howard D. White,et al.  Author cocitation: A literature measure of intellectual structure , 1981, J. Am. Soc. Inf. Sci..

[5]  Henk F. Moed,et al.  Combining Mapping and Citation Analysis for Evaluative Bibliometric Purposes: A Bibliometric Study , 1999, J. Am. Soc. Inf. Sci..

[6]  Henry G. Small,et al.  Citation context analysis of a co-citation cluster: Recombinant-DNA , 1980, Scientometrics.

[7]  A. F. J. VAN RAAN,et al.  In matters of quantitative studies of science the fault of theorists is offering too little and asking too much , 1998, Scientometrics.

[8]  James D. Anderson,et al.  The nature of indexing: how humans and machines analyze messages and texts for retrieval - Part I: Research, and the nature of human indexing , 2001, Inf. Process. Manag..

[9]  Lorna K. Rees-Potter Dynamic thesaural systems: A bibliometric study of terminological and conceptual change in sociology and economics with application to the design of dynamic thesaural systems , 1989, Inf. Process. Manag..

[10]  Chava Nachmias,et al.  Research Methods in the Social Sciences , 1976 .

[11]  James D. Anderson,et al.  The nature of indexing: how humans and machines analyze messages and texts for retrieval - Part II: Machine indexing, and the allocation of human versus machine effort , 2001, Inf. Process. Manag..

[12]  Marti A. Hearst Automated Discovery of WordNet Relations , 2004 .

[13]  Chaomei Chen,et al.  Visualizing knowledge domains , 2005, Annu. Rev. Inf. Sci. Technol..

[14]  Donald Hindle,et al.  Noun Classification From Predicate-Argument Structures , 1990, ACL.

[15]  Padmini Srinivasan,et al.  Thesaurus Construction , 1992, Information Retrieval: Data Structures & Algorithms.

[16]  Lorna Katherine Rees-potter A Bibliometric Analysis Of Terminological And Conceptual Change In Sociology And Economics: With Application To The Design Of Dynamic Thesaural Systems (volumes I And Ii) , 1987 .

[17]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[18]  Henry G. Small,et al.  Macro-level changes in the structure of co-citation clusters: 1983–1989 , 2005, Scientometrics.

[19]  Eugene Garfield,et al.  THE USE OF CITATION DATA IN WRITING THE HISTORY OF SCIENCE , 1964 .

[20]  T. Kuhn,et al.  The Structure of Scientific Revolutions. , 1964 .

[21]  J. Kruskal The Relationship between Multidimensional Scaling and Clustering , 1977 .

[22]  Virginia A. Lingle,et al.  Indexing and Abstracting in Theory and Practice , 2005 .

[23]  Bluma C. Peritz,et al.  On the Objectives of Citation Analysis: Problems of Theory and Method , 1992, J. Am. Soc. Inf. Sci..

[24]  Howard D. White,et al.  Pathfinder networks and author cocitation analysis: A remapping of paradigmatic information scientists , 2003, J. Assoc. Inf. Sci. Technol..

[25]  M. M. Kessler Comparison of the results of bibliographic coupling and analytic subject indexing , 1965 .

[26]  C. Lee Giles,et al.  Digital Libraries and Autonomous Citation Indexing , 1999, Computer.

[27]  Loet Leydesdorff,et al.  Theories of citation? , 1998, Scientometrics.

[28]  Birger Hjørland,et al.  Information Seeking and Subject Representation: An Activity-Theoretical Approach to Information Science , 1997 .

[29]  Henry G. Small,et al.  Citations and consilience in science , 1998, Scientometrics.

[30]  Plergiorgio Strata,et al.  Citation analysis , 1995, Nature.

[31]  Ronald Rousseau,et al.  Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient , 2003, J. Assoc. Inf. Sci. Technol..

[32]  John Scott Social Network Analysis , 1988 .

[33]  Kui-Lam Kwok,et al.  A probabilistic theory of indexing and similarity measure based on cited and citing documents , 1985, J. Am. Soc. Inf. Sci..

[34]  Sanjaya Mishra,et al.  Research methods in the social sciences , 2005 .

[35]  E. Garfield,et al.  Citation indexes for science. , 1956, Science.

[36]  Peter Willett,et al.  The limitations of term co-occurrence data for query expansion in document retrieval systems , 1991, J. Am. Soc. Inf. Sci..

[37]  W. Bruce Croft,et al.  Deriving concept hierarchies from text , 1999, SIGIR '99.

[38]  Mengxiong Liu,et al.  Progress in Documentation the Complexities of citation Practice: a Review of citation studies , 1993, J. Documentation.

[39]  Timothy Cribbin,et al.  Visualizing and tracking the growth of competing paradigms: Two case studies , 2002, J. Assoc. Inf. Sci. Technol..

[40]  Terttu Luukkonen,et al.  Why has Latour's theory of citations been ignored by the bibliometric community? discussion of sociological interpretations of citation analysis , 2006, Scientometrics.

[41]  William M. Shaw,et al.  Subject indexing and citation indexing--part I: Clustering structure in the cystic fibrosis document collection , 1990, Inf. Process. Manag..

[42]  Mark T. Maybury,et al.  Information Storage and Retrieval Systems , 2002, The Information Retrieval Series.

[43]  Jean King A review of bibliometric and other science indicators and their role in research evaluation , 1987, J. Inf. Sci..

[44]  Wolfgang Glänzel,et al.  A new methodological approach to bibliographic coupling and its application to the national, regional and institutional level , 2005, Scientometrics.

[45]  Giulio Sergio Roi,et al.  Contents, Vol. 41, 1994 , 1994 .

[46]  Dagobert Soergel,et al.  Indexing languages and thesauri : construction and maintenance , 1974 .

[47]  Jean Tague-Sutcliffe,et al.  An Introduction to Informetrics , 1992, Inf. Process. Manag..

[48]  D J PRICE,et al.  NETWORKS OF SCIENTIFIC PAPERS. , 1965, Science.

[49]  U. Miller,et al.  Thesaurus construction: problems and their roots , 1997, Inf. Process. Manag..

[50]  Takenobu Tokunaga,et al.  Combining multiple evidence from different types of thesaurus for query expansion , 1999, SIGIR '99.

[51]  F. Narin,et al.  Bibliometrics/Theory, Practice and Problems , 1994 .

[52]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .

[53]  W. Bruce Croft,et al.  An Association Thesaurus for Information Retrieval , 1994, RIAO.

[54]  O. Persson The intellectual base and research fronts of JASIS 1986–1990 , 1994 .

[55]  Katherine W. McCain,et al.  Visualizing a Discipline: An Author Co-Citation Analysis of Information Science, 1972-1995 , 1998, J. Am. Soc. Inf. Sci..

[56]  Peter Ingwersen,et al.  Data set isolation for bibliometric online analyses of research publications: fundamental methodological issues , 1997 .

[57]  Jesper W. Schneider,et al.  Mapping scientific frontiers: The quest for knowledge visualization , 2004, J. Assoc. Inf. Sci. Technol..

[58]  William M. Shaw,et al.  Subject indexing and citation indexing-- part II: An evaluation and comparison , 1990, Inf. Process. Manag..

[59]  E GARFIELD,et al.  Citation indexes for science; a new dimension in documentation through association of ideas. , 2006, Science.

[60]  Katherine W. McCain,et al.  Visualizing a discipline: an author co-citation analysis of information science, 1972–1995 , 1998 .

[61]  C. Borgman,et al.  Scholarly Communication and Bibliometrics. , 1992 .

[62]  Thed N. van Leeuwen,et al.  Critical comments on Institute for Scientific Information impact factors: a sample of inorganic molecular chemistry journals , 1999, J. Inf. Sci..

[63]  Anthony F. J. van Raan,et al.  Monitoring Scientific Developments from a Dynamic Perspective: Self-Organized Structuring to Map Neural Network Research , 1998, Journal of the American Society for Information Science.

[64]  Gerda Ruge,et al.  Experiments on Linguistically-Based Term Associations , 1992, Inf. Process. Manag..

[65]  Anthony F. J. van Raan,et al.  Advanced mapping of science and technology , 2006, Scientometrics.

[66]  L. R. Rasmussen,et al.  In information retrieval: data structures and algorithms , 1992 .

[67]  Michael B. Usher,et al.  Science in action , 1993, Nature.

[68]  David Bawden,et al.  Thesaurus Construction and Use: A Practical Manual , 2000 .

[69]  Howard D. White,et al.  Author cocitation analysis and Pearson's r , 2003, J. Assoc. Inf. Sci. Technol..

[70]  W. C. Adair,et al.  Citation indexes for scientific literature , 1955 .

[71]  William A. Woods,et al.  Conceptual Indexing: A Better Way to Organize Knowledge , 1997 .

[72]  Henk F. Moed,et al.  Mapping of science by combined co-citation and word analysis, I. Structural aspects , 1991, J. Am. Soc. Inf. Sci..

[73]  Michael H. MacRoberts,et al.  Problems of citation analysis , 1992, Scientometrics.

[74]  Carolyn J. Crouch,et al.  An approach to the automatic construction of global thesauri , 1990, Inf. Process. Manag..

[75]  P. Seglen,et al.  Citation rates and journal impact factors are not suitable for evaluation of research. , 1998, Acta orthopaedica Scandinavica.

[76]  Loet Leydesdorff,et al.  Between texts and contexts: Advances in theories of citation? (A rejoinder) , 1999, Scientometrics.

[77]  D. C. Blair,et al.  Language and Representation in Information Retrieval , 1990 .

[78]  Ray J. Paul,et al.  Fitting the jigsaw of citation: Information visualization in domain analysis , 2001, J. Assoc. Inf. Sci. Technol..

[79]  Hinrich Schütze,et al.  A Cooccurrence-Based Thesaurus and Two Applications to Information Retrieval , 1994, Inf. Process. Manag..

[80]  Peter Vinkler,et al.  Comparative investigation of frequency and strength of motives toward referencing. The reference threshold model , 1998, Scientometrics.

[81]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[82]  P. Sopp Cluster analysis. , 1996, Veterinary immunology and immunopathology.

[83]  Jonathan Furner,et al.  Scholarly communication and bibliometrics , 2005, Annu. Rev. Inf. Sci. Technol..

[84]  Mary Elizabeth Stevens,et al.  Statistical Association Methods for Mechanized Documentation. , 1967 .

[85]  Keith V. Trickey Thesaurus Construction and Use: A Practical Manual (4th ed.) , 2001 .

[86]  B. C. Griffith,et al.  The Structure of Scientific Literatures I: Identifying and Graphing Specialties , 1974 .

[87]  Maurice B. Line,et al.  Changes in the Use of Literature with Time--Obsolescence Revisited , 1993, Libr. Trends.

[88]  H. Small A Co-Citation Model of a Scientific Specialty: A Longitudinal Study of Collagen Research , 1977 .

[89]  Miranda Lee Pao,et al.  Retrieval effectiveness by semantic and citation searching , 1989, JASIS.

[90]  Susan E. Cozzens Split Citation Identity: A Case Study from Economics , 1982, J. Am. Soc. Inf. Sci..

[91]  Edie M. Rasmussen,et al.  Clustering Algorithms , 1992, Information Retrieval: Data Structures & Algorithms.

[92]  Michael H. MacRoberts,et al.  Problems of citation analysis: A critical review , 1989, JASIS.

[93]  M. White,et al.  A Qualitative Study of Citing Behavior: Contributions, Criteria, and Metalevel Documentation Concerns , 1997, The Library Quarterly.

[94]  Henry Black,et al.  Indexing and Abstracting , 1940, The Library Quarterly.

[95]  Vincent Kanade,et al.  Clustering Algorithms , 2021, Wireless RF Energy Transfer in the Massive IoT Era.

[96]  K. McCain,et al.  Visualization of Literatures. , 1997 .

[97]  Henry G. Small,et al.  Visualizing Science by Citation Mapping , 1999, J. Am. Soc. Inf. Sci..

[98]  Susan E. Cozzens,et al.  What do citations count? the rhetoric-first model , 1989, Scientometrics.

[99]  John O'Connor Biomedical citing statements: Computer recognition and use to aid full-text retrieval , 1983, Inf. Process. Manag..

[100]  Henry Small,et al.  Cited Documents as Concept Symbols , 1978 .

[101]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[102]  Olle Persson,et al.  The Intellectual Base and Research Fronts of JASIS 1986-1990 , 1994, J. Am. Soc. Inf. Sci..

[103]  Norman Kaplan,et al.  The Sociology of Science: Theoretical and Empirical Investigations , 1974 .

[104]  Roger W. Schvaneveldt,et al.  Pathfinder associative networks: studies in knowledge organization , 1990 .

[105]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[106]  Michael H. MacRoberts,et al.  Another test of the normative theory of citing , 1987, J. Am. Soc. Inf. Sci..

[107]  Katherine W. McCain,et al.  Mapping authors in intellectual space: A technical overview , 1990, Journal of the American Society for Information Science.

[108]  J. Ravetz Sociology of Science , 1972, Nature.

[109]  Hsinchun Chen,et al.  Automatic Thesaurus Generation for an Electronic Community System , 1995, J. Am. Soc. Inf. Sci..

[110]  E. Garfield,et al.  Can Citation Indexing Be Automated ? , 1964 .

[111]  Blaise Cronin,et al.  The citation process: The role and significance of citations in scientific communication , 1984 .

[112]  Kevin W. Boyack,et al.  Mapping scientific frontiers : the quest for knowledge visualization. , 2003 .

[113]  Kevin W. Boyack,et al.  Domain visualization using VxInsight® for science and technology management , 2002, J. Assoc. Inf. Sci. Technol..

[114]  Eugene Garfield,et al.  Random thoughts on citationology its theory and practice , 1998, Scientometrics.

[115]  Anthony F. J. van Raan Little scientometrics, big scientometrics ... and beyond , 2005, Scientometrics.

[116]  Birger Hjørland,et al.  Domain analysis in information science Eleven approaches traditional as well as innovative , 2002 .

[117]  Ronald N. Kostoff,et al.  The use and misuse of citation analysis in research evaluation , 1998, Scientometrics.

[118]  W. Magnus,et al.  Organization of Knowledge , 1982 .

[119]  Loet Leydesdorff,et al.  Why Words and Co-Words Cannot Map the Development of the Sciences , 1997, J. Am. Soc. Inf. Sci..

[120]  Loet Leydesdorff,et al.  Various methods for the mapping of science , 1987, Scientometrics.

[121]  S. Cole Making Science: Between Nature and Society , 1992 .

[122]  Henk F. Moed,et al.  Mapping of Science : Critical elaboration and new approaches, a case study in agricultural biochemistry , 1988 .

[123]  Endre Száva-Kováts,et al.  Unfounded attribution of the "half-life" index-number of literature obsolescence to Burton and Kebler: A literature science study , 2002, J. Assoc. Inf. Sci. Technol..

[124]  Mark T. Maybury,et al.  Information Storage and Retrieval Systems: Theory and Implementation , 2000 .

[125]  Eugene Garfield,et al.  Validation of citation analysis , 1997 .

[126]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[127]  Janet Carson Scholarly Communication and Bibliometrics , 1993 .

[128]  Leo Egghe,et al.  Little science, big science... and beyond , 1994, Scientometrics.

[129]  F L Hoffman,et al.  The Organization of Knowledge , 1938, Nature.

[130]  M. Callon,et al.  From translations to problematic networks: An introduction to co-word analysis , 1983 .