Evaluations of context-based co-citation searching

Since machine-readable documents have become widespread, some recent studies have proposed retrieval methods using a combination of citation linkage and its context. In the case of co-citation linkage, there have been attempts to discern ‘strong’ co-citations from ‘weak’ ones by examining the positions of citations in a document. However, this promising concept has not yet been sufficiently evaluated, and it remains unclear whether search performance is significantly improved. Therefore, this paper explores the effects of using co-citation context more deeply and more widely by comparing the search performance of six retrieval methods, which differ as to whether co-citation context and normalization using cited frequency are used. For empirically evaluating the effects, a special test collection was created from CiteSeer Metadata, and the search performances of the six retrieval methods were compared by two IR metrics (AP and nDCG). The main conclusions of this paper are: (1) co-citation context has a positive effect on co-citation searching; (2) the normalization technique using cited frequency is useful for context-based co-citation searching; (3) approaches of using co-citation context tend to affect the characteristics of search performance.

[1]  Oliver A. McBryan,et al.  GENVL and WWWW: Tools for taming the Web , 1994, WWW Spring 1994.

[2]  Miranda Lee Pao,et al.  Concepts of Information Retrieval , 1989 .

[3]  Achim G. Hoffmann,et al.  A New Approach for Scientific Citation Classification Using Cue Phrases , 2003, Australian Conference on Artificial Intelligence.

[4]  Henk F. Moed,et al.  Mapping of Science : Critical elaboration and new approaches, a case study in agricultural biochemistry , 1988 .

[5]  Jon M. Kleinberg,et al.  Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text , 1998, Comput. Networks.

[6]  Tim Brody,et al.  Evaluating Research Impact through Open Access to Scholarly Communication , 2006 .

[7]  Marc Najork,et al.  Computing Information Retrieval Performance Measures Efficiently in the Presence of Tied Scores , 2008, ECIR.

[8]  Ben-Ami Lipetz,et al.  Improvement of the selectivity of citation indexes to science literature through inclusion of citation relationship indicators , 1965 .

[9]  Miranda Lee Pao,et al.  Term and Citation Retrieval: A Field Study , 1993, Inf. Process. Manag..

[10]  E. Nadel,et al.  Citation and co-citation indicators of a phased impact of the BCS theory in the physics of superconductivity , 1981, Scientometrics.

[11]  E. B. Duncan,et al.  Qualified Citation Indexing: Its Relevance to Educational Technology. , 1981 .

[12]  Yoshihiko Nankaku,et al.  A trainable singing voice synthesis system capable of representing personal characteristics and singing styles , 2008 .

[13]  Blaise Cronin,et al.  The citation process: The role and significance of citations in scientific communication , 1984 .

[14]  G. Salton,et al.  A citation study of computer science literature , 1979, IEEE Transactions on Professional Communication.

[15]  Jöran Beel,et al.  Citation Proximity Analysis (CPA) : A New Approach for Identifying Related Work Based on Co-Citation Analysis , 2009 .

[16]  John Tait,et al.  Proceedings of the Workshop on How Can Computational Linguistics Improve Information Retrieval , 2006 .

[17]  John O'Connor,et al.  Citing statements: Computer recognition and use to improve retrieval , 1982, Inf. Process. Manag..

[18]  Yoshiteru Nakamori,et al.  Detecting Citation Types Using Finite-State Machines , 2006, PAKDD.

[19]  Jan M. Rabaey,et al.  Comparison of Methods , 2004 .

[20]  Howard D. White,et al.  Author cocitation: A literature measure of intellectual structure , 1981, J. Am. Soc. Inf. Sci..

[21]  Stephen E. Robertson,et al.  Comparing citation contexts for information retrieval , 2008, CIKM '08.

[22]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[23]  Gerard Salton,et al.  Associative Document Retrieval Techniques Using Bibliographic Information , 1963, JACM.

[24]  Henry G. Small,et al.  Clustering thescience citation index® using co-citations , 1985, Scientometrics.

[25]  Shannon Bradshaw,et al.  Reference Directed Indexing: Redeeming Relevance for Subject Search in Citation Indexes , 2003, ECDL.

[26]  Simone Teufel,et al.  Automatic classification of citation function , 2006, EMNLP.

[27]  Simone Teufel,et al.  How to Find Better Index Terms Through Citations , 2006 .

[28]  H. D. White Citation Analysis and Discourse Analysis Revisited. , 2004 .

[29]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[30]  Ronald Rousseau,et al.  Similarity measures in scientometric research: The Jaccard index versus Salton's cosine formula , 1989, Inf. Process. Manag..

[31]  B. C. Griffith,et al.  The Structure of Scientific Literatures I: Identifying and Graphing Specialties , 1974 .

[32]  Henry G. Small,et al.  Clustering thescience citation index® using co-citations - I. A comparison of methods , 1985, Scientometrics.

[33]  John O'Connor Biomedical citing statements: Computer recognition and use to aid full-text retrieval , 1983, Inf. Process. Manag..

[34]  Sara D. Knapp Cocitation Searching: Some Useful Strategies. , 1984 .

[35]  Noriko Kando,et al.  Classification of research papers using citation links and citation types: Towards automatic review article generation. , 2011 .

[36]  E. Garfield,et al.  The geography of science: disciplinary and national mappings , 1985 .

[37]  Irena V. Marshakova-shaikevich System of Document Connections Based on References , 2009 .

[38]  Stephen E. Robertson,et al.  Using Terms from Citations for IR: Some First Results , 2008, ECIR.

[39]  Julie Bichteler,et al.  The combined use of bibliographic coupling and cocitation for document retrieval , 1980, J. Am. Soc. Inf. Sci..

[40]  Henry Voos,et al.  Are All Citations Equal? Or, Did We Op. Cit. Your Idem?. , 1976 .

[41]  Naoki Shibata,et al.  Comparative study on methods of detecting research fronts using different types of citation , 2009, J. Assoc. Inf. Sci. Technol..

[42]  Henry G. Small,et al.  Visualizing Science by Citation Mapping , 1999, J. Am. Soc. Inf. Sci..

[43]  Alison Callahan,et al.  Contextual cocitation: Augmenting cocitation analysis and its applications , 2010, J. Assoc. Inf. Sci. Technol..

[44]  Birger Larsen,et al.  References and citations in automatic indexing and retrieval systems - experiments with the boomerang effect , 2004 .

[45]  Henry G. Small,et al.  Critical thresholds for co-citation clusters and emergence of the giant component , 2009, J. Informetrics.

[46]  Dragomir R. Radev,et al.  Blind men and elephants: What do citation summaries tell us about a research article? , 2008, J. Assoc. Inf. Sci. Technol..

[47]  Jörg Sander,et al.  Focused Co-citation: Improving the Retrieval of Related Pages on the Web , 2003, WWW.

[48]  Henry G. Small,et al.  Update on science mapping: Creating large document spaces , 1997, Scientometrics.

[49]  Michael H. MacRoberts,et al.  Problems of citation analysis: A critical review , 1989, JASIS.

[50]  Peter Ingwersen,et al.  Using citations for ranking in digital libraries , 2006, Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06).

[51]  Jöran Beel,et al.  Identifying Related Documents For Research Paper Recommender By CPA And COA , 2009, WCE 2009.

[52]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[53]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[54]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .

[55]  Paul Nicholls,et al.  Introduction to informetrics: Quantitative methods in library, documentation and information science , 1991 .

[56]  Robert E. Mercer,et al.  Towards an Automated Citation Classifier , 2000, Canadian Conference on AI.

[57]  Michel Beigbeder,et al.  Web Co-citation: Discovering Relatedness Between Scientific Papers , 2007, AWIC.

[58]  Günter Krampen,et al.  On the validity of citation counting in science evaluation: Content analyses of references and citations in psychological publications , 2007, Scientometrics.

[59]  Christina Courtright,et al.  Context in information behavior research , 2007 .

[60]  Plergiorgio Strata,et al.  Citation analysis , 1995, Nature.