Pubfocus: Semantic Medline/pubmed Citations Analytics through Integration of Controlled Biomedical Dictionaries and Ranking Algorithm Pubfocus:semanticmedline/pubmedcitations Analyticsthroughintegrationofcontrolledbiomedical Dictionariesandrankingalgorithm

BackgroundUnderstanding research activity within any given biomedical field is important. Search outputs generated by MEDLINE/PubMed are not well classified and require lengthy manual citation analysis. Automation of citation analytics can be very useful and timesaving for both novices and experts.ResultsPubFocus web server automates analysis of MEDLINE/PubMed search queries by enriching them with two widely used human factor-based bibliometric indicators of publication quality: journal impact factor and volume of forward references. In addition to providing basic volumetric statistics, PubFocus also prioritizes citations and evaluates authors' impact on the field of search. PubFocus also analyses presence and occurrence of biomedical key terms within citations by utilizing controlled vocabularies.ConclusionWe have developed citations' prioritisation algorithm based on journal impact factor, forward referencing volume, referencing dynamics, and author's contribution level. It can be applied either to the primary set of PubMed search results or to the subsets of these results identified through key terms from controlled biomedical vocabularies and ontologies. NCI (National Cancer Institute) thesaurus and MGD (Mouse Genome Database) mammalian gene orthology have been implemented for key terms analytics. PubFocus provides a scalable platform for the integration of multiple available ontology databases. PubFocus analytics can be adapted for input sources of biomedical citations other than PubMed.

[1]  Peter D. Karp,et al.  The comprehensive updated regulatory network of Escherichia coli K-12 , 2006, BMC Bioinformatics.

[2]  M. Ashburner,et al.  FlyBase--the Drosophila genetic database. , 1994, Development.

[3]  Gene Ontology Consortium The Gene Ontology (GO) database and informatics resource , 2003 .

[4]  Yoshio Miki,et al.  Role of BRCA1 and BRCA2 as regulators of DNA repair, transcription, and cell cycle in response to DNA damage , 2004, Cancer science.

[5]  Sherri de Coronado,et al.  NCI Thesaurus: A semantic model integrating cancer-related clinical and molecular information , 2007, J. Biomed. Informatics.

[6]  Michael Schroeder,et al.  GoPubMed: exploring PubMed with the Gene Ontology , 2005, Nucleic Acids Res..

[7]  Fang Liu,et al.  SLIM: an alternative Web interface for MEDLINE/PubMed searches – a preliminary study , 2005, BMC Medical Informatics Decis. Mak..

[8]  Régis Beuscart,et al.  SIGAPS a prototype of bibliographic tool for medical research evaluation , 2003, MIE.

[9]  Daniel Berleant,et al.  MedKit: a helper toolkit for automatic mining of MEDLINE/PubMed citations , 2005, Bioinform..

[10]  Russ B. Altman,et al.  Tools for loading MEDLINE into a local relational database , 2004, BMC Bioinformatics.

[11]  Mark S. Tuttle,et al.  NCI Thesaurus: Using Science-Based Terminology to Integrate Cancer Research Results , 2004, MedInfo.

[12]  B Marshall,et al.  Gene Ontology Consortium: The Gene Ontology (GO) database and informatics resource , 2004, Nucleic Acids Res..

[13]  C. Hoeffel,et al.  journal impact factors , 1998, Allergy.

[14]  Giovanni Motta,et al.  Journal impact factors , 1995, Nature.

[15]  M. Shapiro,et al.  The contributions of authors to multiauthored biomedical research papers. , 1994, JAMA.

[16]  Stephen B. Johnson,et al.  Graph theoretic modeling of large-scale semantic networks , 2006, J. Biomed. Informatics.

[17]  Lei Wang,et al.  Three options for citation tracking: Google Scholar, Scopus and Web of Science , 2006, Biomedical digital libraries.

[18]  Andrew P Kurmis,et al.  Exploring the relationship between impact factor and manuscript rejection rates in radiologic journals. , 2006, Academic radiology.

[19]  T. Sun,et al.  Label-retaining cells reside in the bulge area of pilosebaceous unit: Implications for follicular stem cells, hair cycle, and skin carcinogenesis , 1990, Cell.

[20]  Daniel Berleant,et al.  PubMed Assistant: a biologist-friendly interface for enhanced PubMed search , 2006, Bioinform..

[21]  Judith A. Blake,et al.  The Mouse Genome Database (MGD): from genes to mice—a community resource for mouse biology , 2004, Nucleic Acids Res..

[22]  J. Bernhard,et al.  "First Author, Second Author, et Int, and Last Author": A Proposed Citation System for Biomedical Papers , 2003 .

[23]  Claus-Wilhelm von der Lieth,et al.  PubFinder: a tool for improving retrieval rate of relevant PubMed abstracts , 2005, Nucleic Acids Res..

[24]  Miguel A. Andrade-Navarro,et al.  Update on XplorMed: a web server for exploring scientific literature , 2003, Nucleic Acids Res..

[25]  E. Garfield The history and meaning of the journal impact factor. , 2006, JAMA.

[26]  Peng Dong,et al.  The "impact factor" revisited , 2005, Biomedical digital libraries.

[27]  Andrew P. Kurmis,et al.  Exploring the Relationship Between Impact Factor and Manuscript Rejection Rates in Radiologic Journals1 , 2006 .

[28]  P Bork,et al.  XplorMed: a tool for exploring MEDLINE abstracts. , 2001, Trends in biochemical sciences.