Searching biotechnology information: A case study

The elaboration of strategies for the effective search of biotechnology information is a challenging task. In fact, the large amount of data in the public domain on biotechnology products and technologies is scattered among many databases and provided in different formats of document. This situation can make particularly difficult the identification, the extraction, and the aggregation of the information that are needed for performing detailed patent or scientific analyses. The article presents a case study presenting different text-based approaches for searching and analyzing biotechnology information in patent and scientific literature using a series of exemplary searches on antibodies, a class of biological products having broad scientific and commercial interest. The results show the complexity of defining how biotechnology information is actually searchable through the variety of available resources and to what extent. Some major factors that should be taken into consideration when searches are performed for evaluating scientific/patent trends, selecting documents potentially relevant for patentability, or identifying valuable technical information.

[1]  Monika Henzinger,et al.  Search Technologies for the Internet , 2007, Science.

[2]  Michael Y. Galperin The Molecular Biology Database Collection: 2008 update , 2007, Nucleic Acids Res..

[3]  Dominique Guellec,et al.  Patent inflation in Europe , 2008 .

[4]  M. Waldrop,et al.  Science 2.0. , 2008, Scientific American.

[5]  Thomas E. Vanhecke,et al.  PubMed vs. HighWire Press: A head-to-head comparison of two medical literature search engines , 2007, Comput. Biol. Medicine.

[6]  Mike Tansey,et al.  The challenge of sustaining the research and innovation process , 2005 .

[7]  A. Valencia,et al.  Text-mining and information-retrieval services for molecular biology , 2005, Genome Biology.

[8]  R. Sodoyer,et al.  Monoclonal and recombinant antibodies, 30 years after ... , 2006, Human antibodies.

[9]  Michael Schroeder,et al.  GoPubMed: exploring PubMed with the Gene Ontology , 2005, Nucleic Acids Res..

[10]  Walter W. Powell,et al.  Biotechnology: Its origins, organization, and outputs , 2007 .

[11]  Christian Sternitzke,et al.  Reducing uncertainty in the patent application procedure – Insights from invalidating prior art in European patent applications , 2009 .

[12]  Anthony Arundel,et al.  OECD Biotechnology Statistics 2009 , 2006 .

[13]  G. Gann Xu,et al.  Patent sequence databases , 2002 .

[14]  Hyo Jeong Hong,et al.  Antibody engineering for the development of therapeutic antibodies. , 2005, Molecules and cells.

[15]  Jacob Köhler,et al.  Addressing the problems with life-science databases for traditional uses and systems biology , 2006, Nature Reviews Genetics.

[16]  Philip E. Bourne,et al.  Will a Biological Database Be Different from a Biological Journal? , 2005, PLoS Comput. Biol..

[17]  Cynthia Barcelon-Yang,et al.  Intellectual property management of biosequence information from a patent searching perspective , 2005 .

[18]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[19]  Monya Baker,et al.  Upping the ante on antibodies , 2005, Nature Biotechnology.

[20]  Alan F. Smeaton,et al.  On Combining Text and MeSH Searches to Improve the Retrieval of MEDLINE documents , 2006, CORIA.

[21]  Jeremy R.M. Scott When is a search not a search? The EPO approach , 2007 .

[22]  Anthony Arundel,et al.  OECD Biotechnology Statistics , 2009 .

[23]  Laura M. Felter,et al.  Google scholar, scirus, and the scholarly search revolution , 2005 .

[24]  K. Bretonnel Cohen,et al.  Getting Started in Text Mining , 2008, PLoS Comput. Biol..

[25]  Sophia Ananiadou,et al.  Text mining and its potential applications in systems biology. , 2006, Trends in biotechnology.

[26]  Evert Nijhof Subject analysis and search strategies – Has the searcher become the bottleneck in the search process? , 2007 .

[27]  Eugenio Archontopoulos Prior art search tools on the Internet and legal status of the results: a European Patent Office perspective , 2004 .

[28]  Pasquale Foglia,et al.  Patentability search strategies and the reformed IPC: A patent office perspective , 2007 .

[29]  Mounir Errami,et al.  eTBLAST: a web server to identify expert reviewers, appropriate journals and similar publications , 2007, Nucleic Acids Res..

[30]  Brandon Keim News feature: WikiMedia , 2007, Nature Medicine.

[31]  Mark Harper,et al.  A comparative study of patent sequence databases , 2008 .