Finding Our Way Through Phenotypes

Despite a large and multifaceted effort to understand the vast landscape of phenotypic data, their current form inhibits productive data analysis. The lack of a community-wide, consensusbased, humanand machine-interpretable language for describing phenotypes and their genomic and environmental contexts is perhaps the most pressing scientific bottleneck to integration across many key fields in biology, including genomics, systems biology, development, medicine, evolution, ecology, and systematics. Here we survey the current phenomics landscape, including data resources and handling, and the progress that has been made to accurately capture relevant data descriptions for phenotypes. We present an example of the kind of integration across domains that computable phenotypes would enable, and we call upon the broader biology community, publishers, and relevant funding agencies to support efforts to surmount today’s data barriers and facilitate analytical reproducibility.

[1]  D. Meinke,et al.  A Comprehensive Dataset of Genes with a Loss-of-Function Mutant Phenotype in Arabidopsis , 2012, Plant Physiology.

[2]  Ruei-Jiun Hung,et al.  Mical links semaphorins to F-actin disassembly , 2010, Nature.

[3]  Huaqin Pan,et al.  The PhenX Toolkit: Get the Most From Your Measures , 2011, American journal of epidemiology.

[4]  S. Inagaki,et al.  A functional role for semaphorin 4D/plexin B1 interactions in epithelial branching morphogenesis during organogenesis , 2008, Development.

[5]  S. Mundlos,et al.  The Human Phenotype Ontology , 2010, Clinical genetics.

[6]  Gregory T. Baxter,et al.  The Digital Fish Library: Using MRI to Digitize, Database, and Document the Morphological Diversity of Fish , 2012, PloS one.

[7]  Hilmar Lapp,et al.  Evolutionary Characters, Phenotypes and Ontologies: Curating Data from the Systematic Biology Literature , 2010, PloS one.

[8]  Heather A. Piwowar,et al.  Data archiving is a good investment , 2011, Nature.

[9]  S. D. Fraser,et al.  Semaphorin-plexin signaling guides patterning of the developing vasculature. , 2004, Developmental cell.

[10]  Jeffrey Ross-Ibarra,et al.  Genetic Architecture of Maize Kernel Composition in the Nested Association Mapping and Inbred Association Panels1[W] , 2011, Plant Physiology.

[11]  James P. Balhoff,et al.  A Semantic Model for Species Description Applied to the Ensign Wasps (Hymenoptera: Evaniidae) of New Caledonia , 2013, Systematic biology.

[12]  John M. Hancock,et al.  Entity/quality-based logical definitions for the human skeletal phenome using PATO , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[13]  S. Austad,et al.  The long lifespan of two bat species is correlated with resistance to protein oxidation and enhanced protein homeostasis , 2009, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[14]  Jessica A. Turner,et al.  Modeling biomedical experimental processes with OBI , 2010, J. Biomed. Semant..

[15]  Barry Smith,et al.  The environment ontology: contextualising biological and biomedical entities , 2013, Journal of Biomedical Semantics.

[16]  Kriston L. McGary,et al.  Systematic discovery of nonobvious human disease models through orthologous phenotypes , 2010, Proceedings of the National Academy of Sciences.

[17]  José L. V. Mejino,et al.  CARO - The Common Anatomy Reference Ontology , 2008, Anatomy Ontologies for Bioinformatics.

[18]  J. Epstein,et al.  Semaphorin-PlexinD1 signaling limits angiogenic potential via the VEGF decoy receptor sFlt1. , 2011, Developmental cell.

[19]  C. Justice,et al.  High-Resolution Global Maps of 21st-Century Forest Cover Change , 2013, Science.

[20]  Seth Kaufman,et al.  MorphoBank: phylophenomics in the “cloud” , 2011, Cladistics : the international journal of the Willi Hennig Society.

[21]  James C. Schnable,et al.  Genes Identified by Visible Mutant Phenotypes Show Increased Bias toward One of Two Subgenomes of Maize , 2011, PloS one.

[22]  Edward S. Buckler,et al.  Gramene database in 2010: updates and extensions , 2010, Nucleic Acids Res..

[23]  Qifa Zhang,et al.  Genome-wide association studies of 14 agronomic traits in rice landraces , 2010, Nature Genetics.

[24]  S. Lewis,et al.  Uberon, an integrative multi-species anatomy ontology , 2012, Genome Biology.

[25]  Dani Zamir,et al.  Where Have All the Crop Phenotypes Gone? , 2013, PLoS biology.

[26]  James P. Balhoff,et al.  Folding Wings like a Cockroach: A Review of Transverse Wing Folding Ensign Wasps (Hymenoptera: Evaniidae: Afrevania and Trissevania) , 2014, PloS one.

[27]  John J. O'Connor,et al.  Selective Inhibition of Retinal Angiogenesis by Targeting PI3 Kinase , 2009, PloS one.

[28]  Cathie Martin,et al.  Trichomes: different regulatory networks lead to convergent structures. , 2006, Trends in plant science.

[29]  Kevin Y. Ma,et al.  Controlled Flight of a Biologically Inspired, Insect-Scale Robot , 2013, Science.

[30]  Elena P Ivanova,et al.  Biophysical model of bacterial cell interactions with nanopatterned cicada wing surfaces. , 2013, Biophysical journal.

[31]  Liya Ren,et al.  Gramene QTL database: development, content and applications , 2009, Database J. Biol. Databases Curation.

[32]  Paula M. Mabee,et al.  500,000 fish phenotypes: The new informatics landscape for evolutionary and developmental biology of the vertebrate skeleton , 2012, Zeitschrift fur angewandte Ichthyologie = Journal of applied ichthyology.

[33]  Brett E. Pickett,et al.  Standardized Metadata for Human Pathogen/Vector Genomic Sequences , 2014, PloS one.

[34]  Stefan Richter,et al.  Evolutionary morphology of the circulatory system in Peracarida (Malacostraca; Crustacea) , 2010 .

[35]  Barbara A. Block,et al.  Advances in conservation oceanography: new tagging and tracking technologies and their potential for transforming the science underlying fisheries management , 2009 .

[36]  Nico M. Franz,et al.  BIOLOGICAL TAXONOMY AND ONTOLOGY DEVELOPMENT: SCOPE AND LIMITATIONS , 2010 .

[37]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[38]  S. Omholt,et al.  Phenomics: the next challenge , 2010, Nature Reviews Genetics.

[39]  E. Giraudo,et al.  The role of semaphorins and their receptors in vascular development and cancer. , 2013, Experimental cell research.

[40]  D. Balme,et al.  Aristotle: 'Historia Animalium': Volume 1, Books I-X: Text , 2002 .

[41]  Hong Cui CharaParser for fine-grained semantic annotation of organism morphological descriptions , 2012, J. Assoc. Inf. Sci. Technol..

[42]  Hilmar Lapp,et al.  Overview of FEED, the Feeding Experiments End-user Database , 2011, Integrative and comparative biology.

[43]  Alan Ruttenberg,et al.  Life sciences on the Semantic Web: the Neurocommons and beyond , 2009, Briefings Bioinform..

[44]  Damian Smedley,et al.  MouseFinder: Candidate disease genes from mouse phenotype data , 2012, Human mutation.

[45]  Jeremy A. Miller,et al.  Linking of digital images to phylogenetic data matrices using a morphological ontology. , 2007, Systematic biology.

[46]  M. Roizen,et al.  Hallmarks of Cancer: The Next Generation , 2012 .

[47]  Damian Smedley,et al.  PhenoDigm: analyzing curated annotations to associate animal models with human diseases , 2013, Database J. Biol. Databases Curation.

[48]  Caleb Webber,et al.  Phenotype Ontologies and Cross-Species Analysis for Translational Research , 2014, PLoS genetics.

[49]  Ibrahim Emam,et al.  ArrayExpress update—from an archive of functional genomics experiments to the atlas of gene expression , 2008, Nucleic Acids Res..

[50]  David Houle,et al.  Numbering the hairs on our heads: The shared challenge and promise of phenomics , 2010, Proceedings of the National Academy of Sciences.

[51]  Barry Smith,et al.  The Plant Ontology as a Tool for Comparative Plant Anatomy and Genomic Analyses , 2012, Plant & cell physiology.

[52]  Kei-Hoi Cheung,et al.  Advancing translational research with the Semantic Web , 2007, BMC Bioinformatics.

[53]  J. Blake,et al.  Providing the Missing Link: the Exposure Science Ontology ExO , 2012, Environmental science & technology.

[54]  Robert B Trelease,et al.  Anatomical reasoning in the informatics age: Principles, ontologies, and agendas. , 2006, Anatomical record. Part B, New anatomist.

[55]  Katja C. Seltmann,et al.  A Gross Anatomy Ontology for Hymenoptera , 2010, PloS one.

[56]  Damian Smedley,et al.  Construction and accessibility of a cross-species phenotype ontology along with gene annotations for biomedical research. , 2013, F1000Research.

[57]  Yasser Aboelkassem,et al.  Selective pumping in a network: insect-style microscale flow transport , 2013, Bioinspiration & biomimetics.

[58]  Ü. Niinemets A review of light interception in plant stands from leaf to canopy in different plant functional types and in species with varying shade tolerance , 2010, Ecological Research.

[59]  G. Bejerano,et al.  A "forward genomics" approach links genotype to phenotype using independent phenotypic losses among related species. , 2012, Cell reports.

[60]  Jeyakumar Natarajan,et al.  An overview of the BioCreative 2012 Workshop Track III: interactive text mining task , 2013, Database J. Biol. Databases Curation.

[61]  Damian Smedley,et al.  Effective diagnosis of genetic disease by computational phenotype analysis of the disease-associated genome , 2014, Science Translational Medicine.

[62]  S. Hsu,et al.  Semaphorin signaling facilitates cleft formation in the developing salivary gland , 2007, Development.

[63]  Paul N. Schofield,et al.  PhenomeNET: a whole-phenome approach to disease gene discovery , 2011, Nucleic acids research.

[64]  Damian Smedley,et al.  Improved exome prioritization of disease genes through cross-species phenotype comparison , 2014, Genome research.

[65]  Paula M. Mabee,et al.  Phenex: Ontological Annotation of Phenotypic Diversity , 2010, PloS one.

[66]  P. Robinson,et al.  The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease. , 2008, American journal of human genetics.

[67]  Anne E. Thessen,et al.  Knowledge Extraction and Semantic Annotation of Text from the Encyclopedia of Life , 2014, PloS one.

[68]  D. Weigel,et al.  Developmental genetics and new sequencing technologies: the rise of nonmodel organisms. , 2011, Developmental cell.

[69]  Monte Westerfield,et al.  Linking Human Diseases to Animal Models Using Ontology-Based Phenotype Annotation , 2009, PLoS biology.

[70]  M. Ramírez,et al.  Calculating structural complexity in phylogenies using ancestral ontologies , 2014, Cladistics : the international journal of the Willi Hennig Society.

[71]  Janan T Eppig,et al.  The mammalian phenotype ontology: enabling robust annotation and comparative analysis , 2009, Wiley interdisciplinary reviews. Systems biology and medicine.

[72]  R. Jenner,et al.  The need for data standards in zoomorphology , 2013, Journal of morphology.

[73]  S. Richter,et al.  A research program for Evolutionary Morphology , 2014 .

[74]  Dietrich Rebholz-Schuhmann,et al.  Improving Disease Gene Prioritization by Comparing the Semantic Similarity of Phenotypes in Mice with Those of Human Diseases , 2012, PloS one.

[75]  W. Atchley,et al.  Molecular Evolution of the Myb Family of Transcription Factors: Evidence for Polyphyletic Origin , 1998, Journal of Molecular Evolution.

[76]  Deanna M. Church,et al.  ClinVar: public archive of relationships among sequence variation and human phenotype , 2013, Nucleic Acids Res..

[77]  John M. Hancock,et al.  Using ontologies to describe mouse phenotypes , 2004, Genome Biology.

[78]  J. Balhoff,et al.  Time to change how we describe biodiversity. , 2012, Trends in ecology & evolution.

[79]  Vincent Laudet,et al.  The "street light syndrome", or how protein taxonomy can bias experimental manipulations. , 2008, BioEssays : news and reviews in molecular, cellular and developmental biology.

[80]  Anne E. Trefethen,et al.  Toward interoperable bioscience data , 2012, Nature Genetics.

[81]  Thomas G. Dietterich,et al.  Next-generation phenomics for the Tree of Life , 2013, PLoS currents.