Updating a bibliography using the related articles function within PubMed

Comprehensive bibliographies are useful for conducting reviews of the literature, and for assessing the progress within a field. These bibliographies may be broad and inclusive, or focused and precise in their inclusion criteria. In either case, the task of maintaining a complete bibliography within a particular area of research is made difficult by the diversity, complexity and huge volume of newly published literature. In an effort to effectively and automatically retrieve relevant literature, different search strategies and indexing tools have been developed, including the RELATED ARTICLES function provided with the PubMed system. In this paper, we report a program for incremental updates of a bibliography using the PubMed RELATED ARTICLES function. Given a highly specialized starting bibliography of experimental measurements of the structure of the 30S bacterial ribosomal subunit, the system was applied to find additional relevant references. For this particular task, the system has a recall of 75%, a strict precision of 32% and a partial precision of 42%. Our results are notable because although the RELATED ARTICLES function is purely statistical, it is nonetheless able to select a very narrowly defined set of articles from the literature. We discuss the tradeoffs between having a user to evaluate many articles of possible interest in a single session, versus asking a user to evaluate a small set of articles on a periodic basis.

[1]  M M Wagner,et al.  An automatic indexing method for medical documents. , 1991, Proceedings. Symposium on Computer Applications in Medical Care.

[2]  Burton J. Bogitsh,et al.  A COMPARISON OF TWO METHODS , 1975 .

[3]  C G Chute,et al.  An evaluation of concept based latent semantic indexing for clinical information retrieval. , 1992, Proceedings. Symposium on Computer Applications in Medical Care.

[4]  W R Hersh Evaluation of Meta-1 for a Concept-based Approach to the Automated Indexing and Retrieval of Bibliographic and Full-text Databases , 1991, Medical decision making : an international journal of the Society for Medical Decision Making.

[5]  William R. Hersh,et al.  MedWeaver: integrating decision support, literature searching, and Web exploration using the UMLS Metathesaurus , 1997, AMIA.

[6]  Y Yang,et al.  An analysis of statistical term strength and its use in the indexing and retrieval of molecular biology texts , 1996, Comput. Biol. Medicine.

[7]  E N Efthimiadis,et al.  Population groups: indexing, coverage, and retrieval effectiveness of ethnically related health care issues in health sciences databases. , 1996, Bulletin of the Medical Library Association.

[8]  R A Greenes,et al.  SAPHIRE--an information retrieval system featuring concept matching, automatic indexing, probabilistic retrieval, and hierarchical relationships. , 1990, Computers and biomedical research, an international journal.

[9]  R Rada,et al.  A vocabulary for medical informatics. , 1987, Computers and biomedical research, an international journal.

[10]  W R Hersh,et al.  A Comparison of Two Methods for Indexing and Retrieval from a Full-text Medical Database , 1992, Medical decision making : an international journal of the Society for Medical Decision Making.

[11]  Gp Purcell Contextual document models for searching the clinical literature , 1996 .

[12]  W R Hersh,et al.  Words, concepts, or both: optimal indexing units for automated information retrieval. , 1992, Proceedings. Symposium on Computer Applications in Medical Care.

[13]  W R Hersh,et al.  A comparison of retrieval effectiveness for three methods of indexing medical literature. , 1992, The American journal of the medical sciences.

[14]  C G Chute,et al.  Latent Semantic Indexing of medical diagnoses using UMLS semantic structures. , 1991, Proceedings. Symposium on Computer Applications in Medical Care.

[15]  R. Shalini,et al.  "Citation Profiles" to Improve Relevance in a Two-Stage Retrieval System: A Proposal , 1993, Inf. Process. Manag..

[16]  C G Chute,et al.  Words or concepts: the features of indexing units and their optimal use in information retrieval. , 1993, Proceedings. Symposium on Computer Applications in Medical Care.