Empowering industrial research with shared biomedical vocabularies.

The life science industries (including pharmaceuticals, agrochemicals and consumer goods) are exploring new business models for research and development that focus on external partnerships. In parallel, there is a desire to make better use of data obtained from sources such as human clinical samples to inform and support early research programmes. Success in both areas depends upon the successful integration of heterogeneous data from multiple providers and scientific domains, something that is already a major challenge within the industry. This issue is exacerbated by the absence of agreed standards that unambiguously identify the entities, processes and observations within experimental results. In this article we highlight the risks to future productivity that are associated with incomplete biological and chemical vocabularies and suggest a new model to address this long-standing issue.

[1]  Eric K. Neumann,et al.  Identifying disease-causal genes using Semantic Web-based representation of integrated genomic and phenomic knowledge , 2008, J. Biomed. Informatics.

[2]  Ted Slater,et al.  Beyond data integration. , 2008, Drug discovery today.

[3]  Kay Nieselt,et al.  Integrative Systems Biology Visualization with MAYDAY , 2010, J. Integr. Bioinform..

[4]  Werner Ceusters,et al.  Towards a Reference Terminology for Ontology Research and Development in the Biomedical Domain , 2006, KR-MED.

[5]  S. Friend,et al.  The Need for Precompetitive Integrative Bionetwork Disease Model Building , 2010, Clinical pharmacology and therapeutics.

[6]  John Quackenbush,et al.  Data reporting standards: making the things we use better , 2009, Genome Medicine.

[7]  Amit P. Sheth,et al.  An ontology-driven semantic mashup of gene and biological pathway information: Application to the domain of nicotine dependence , 2008, J. Biomed. Informatics.

[8]  Pantelis Sopasakis,et al.  Collaborative development of predictive toxicology applications , 2010, J. Cheminformatics.

[9]  Stephan C Schürer,et al.  BioAssay Ontology Annotations Facilitate Cross-Analysis of Diverse High-Throughput Screening Data Sets , 2011, Journal of biomolecular screening.

[10]  C. Bizer,et al.  Enabling Tailored Therapeutics with Linked Data , 2009 .

[11]  Damian Smedley,et al.  Sustaining the Data and Bioresource Commons , 2010, Science.

[12]  Alan Mcnaught,et al.  The IUPAC international chemical identifier : InChl-A new standard for molecular informatics , 2006 .

[13]  Klaar Vanopstal,et al.  Vocabularies and Retrieval Tools in Biomedicine: Disentangling the Terminological Knot , 2011, Journal of Medical Systems.

[14]  F. Collins Reengineering Translational Science: The Time Is Right , 2011, Science Translational Medicine.

[15]  John Wilbanks,et al.  'Omics Data Sharing , 2009, Science.

[16]  Janice Branson,et al.  Epigenetic Modification of the FMR1 Gene in Fragile X Syndrome Is Associated with Differential Response to the mGluR5 Antagonist AFQ056 , 2011, Science Translational Medicine.

[17]  Lee Harland,et al.  Lowering industry firewalls: pre-competitive informatics initiatives in drug discovery , 2009, Nature Reviews Drug Discovery.

[18]  Nicolas Le Novère,et al.  MIRIAM Resources: tools to generate and resolve robust cross-references in Systems Biology , 2007, BMC Systems Biology.

[19]  Barry Smith,et al.  An Ontology for Carcinoma Classification for Clinical Bioinformatics , 2005, MIE.

[20]  Huajun Chen,et al.  The use of web ontology languages and other semantic web tools in drug discovery , 2010, Expert opinion on drug discovery.

[21]  P. Bork,et al.  Drug Target Identification Using Side-Effect Similarity , 2008, Science.

[22]  Oliver Hofmann,et al.  ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level , 2010, Bioinform..

[23]  S. J. Campbell,et al.  Visualizing the drug target landscape. , 2010, Drug discovery today.

[24]  Marco D Sorani,et al.  Clinical and biological data integration for biomarker discovery. , 2010, Drug discovery today.

[25]  Bin Chen,et al.  Chem2Bio2RDF: a semantic framework for linking and data mining chemogenomic and systems chemical biology data , 2010, BMC Bioinformatics.

[26]  David B. Searls,et al.  Can literature analysis identify innovation drivers in drug discovery? , 2009, Nature Reviews Drug Discovery.

[27]  David B. Searls,et al.  Literature mining in support of drug discovery , 2008, Briefings Bioinform..

[28]  Ron Kikinis,et al.  Computational neuroanatomy: ontology-based representation of neural components and connectivity , 2009, BMC Bioinformatics.

[29]  Ross D. King,et al.  An Ontology for Description of Drug Discovery Investigations , 2010, J. Integr. Bioinform..

[30]  David B. Searls,et al.  Data integration: challenges for drug discovery , 2005, Nature Reviews Drug Discovery.

[31]  Carole A. Goble,et al.  RightField: embedding ontology annotation in spreadsheets , 2011, Bioinform..

[32]  Carole Goble,et al.  The Montagues and the Capulets , 2004, Comparative and functional genomics.

[33]  Susie Stephens,et al.  Is open innovation the way forward for big pharma? , 2010, Nature Reviews Drug Discovery.

[34]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[35]  Putting data to work. , 2002, Lippincott's case management : managing the process of patient care.

[36]  J. Cimino Desiderata for Controlled Medical Vocabularies in the Twenty-First Century , 1998, Methods of Information in Medicine.

[37]  Arguments Against Precompetitive Collaboration , 2010, Clinical pharmacology and therapeutics.

[38]  Rob W.W. Hooft,et al.  The value of data , 2011, Nature Genetics.