PCAN: phenotype consensus analysis to support disease-gene association

BackgroundBridging genotype and phenotype is a fundamental biomedical challenge that underlies more effective target discovery and patient-tailored therapy. Approaches that can flexibly and intuitively, integrate known gene-phenotype associations in the context of molecular signaling networks are vital to effectively prioritize and biologically interpret genes underlying disease traits of interest.ResultsWe describe Phenotype Consensus Analysis (PCAN); a method to assess the consensus semantic similarity of phenotypes in a candidate gene’s signaling neighborhood. We demonstrate that significant phenotype consensus (p < 0.05) is observable for ~67% of 4,549 OMIM disease-gene associations, using a combination of high quality String interactions + Metabase pathways and use Joubert Syndrome to demonstrate the ease with which a significant result can be interrogated to highlight discriminatory traits linked to mechanistically related genes.ConclusionsWe advocate phenotype consensus as an intuitive and versatile method to aid disease-gene association, which naturally lends itself to the mechanistic deconvolution of diverse phenotypes. We provide PCAN to the community as an R package (http://bioconductor.org/packages/PCAN/) to allow flexible configuration, extension and standalone use or integration to supplement existing gene prioritization workflows.

[1]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[2]  Shashidhar Pai,et al.  Genotype–phenotype correlation in CC2D2A-related Joubert syndrome reveals an association with ventriculomegaly and seizures , 2012, Journal of Medical Genetics.

[3]  Damian Smedley,et al.  Improved exome prioritization of disease genes through cross-species phenotype comparison , 2014, Genome research.

[4]  D. Galas,et al.  Bone dysplasia sclerosteosis results from loss of the SOST gene product, a novel cystine knot-containing protein. , 2001, American journal of human genetics.

[5]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[6]  Jonathan C. Cohen,et al.  Molecular characterization of loss-of-function mutations in PCSK9 and identification of a compound heterozygote. , 2006, American journal of human genetics.

[7]  Deanna M. Church,et al.  ClinVar: public archive of relationships among sequence variation and human phenotype , 2013, Nucleic Acids Res..

[8]  Janan T Eppig,et al.  The mammalian phenotype ontology: enabling robust annotation and comparative analysis , 2009, Wiley interdisciplinary reviews. Systems biology and medicine.

[9]  Tomas W. Fitzgerald,et al.  Large-scale discovery of novel genetic causes of developmental disorders , 2014, Nature.

[10]  Michael Elhadad,et al.  CSI-OMIM - Clinical Synopsis Search in OMIM , 2011, BMC Bioinformatics.

[11]  D. Goldstein,et al.  Genic Intolerance to Functional Variation and the Interpretation of Personal Genomes , 2013, PLoS genetics.

[12]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[13]  Christian von Mering,et al.  STRING 8—a global view on proteins and their functional interactions in 630 organisms , 2008, Nucleic Acids Res..

[14]  Damian Smedley,et al.  PhenoDigm: analyzing curated annotations to associate animal models with human diseases , 2013, Database J. Biol. Databases Curation.

[15]  Peter N. Robinson,et al.  The Human Phenotype Ontology: Semantic Unification of Common and Rare Disease , 2015, American journal of human genetics.

[16]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[17]  Damian Smedley,et al.  The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data , 2014, Nucleic Acids Res..

[18]  Damian Smedley,et al.  Walking the interactome for candidate prioritization in exome sequencing studies of Mendelian diseases , 2014, Bioinform..

[19]  Carolyn M Hutter,et al.  CC2D2A is mutated in Joubert syndrome and interacts with the ciliopathy-associated basal body protein CEP290. , 2008, American journal of human genetics.

[20]  Henning Hermjakob,et al.  The Reactome pathway Knowledgebase , 2015, Nucleic acids research.

[21]  Raphael Gottardo,et al.  Orchestrating high-throughput genomic analysis with Bioconductor , 2015, Nature Methods.

[22]  M. Vidal,et al.  Edgotype: a fundamental link between genotype and phenotype. , 2013, Current opinion in genetics & development.

[23]  Andrew D Paterson,et al.  CC2D2A, encoding a coiled-coil and C2 domain protein, causes autosomal-recessive mental retardation with retinitis pigmentosa. , 2008, American journal of human genetics.

[24]  I. Kola,et al.  A call to reform the taxonomy of human disease , 2011, Nature Reviews Drug Discovery.

[25]  Ian M. Carr,et al.  OVA: integrating molecular and physical phenotype data from multiple biomedical domain ontologies with variant filtering for enhanced variant prioritization , 2015, Bioinform..

[26]  Christian von Mering,et al.  STRING: known and predicted protein–protein associations, integrated and transferred across organisms , 2004, Nucleic Acids Res..

[27]  Tatiana A. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2004, Nucleic Acids Res..

[28]  Marcel H. Schulz,et al.  Clinical diagnostics in human genetics with semantic similarity searches in ontologies. , 2009, American journal of human genetics.

[29]  I. Glass,et al.  Joubert syndrome (and related disorders) (OMIM 213300) , 2007, European Journal of Human Genetics.

[30]  Eric Vilain,et al.  Clinical exome sequencing for genetic identification of rare Mendelian disorders. , 2014, JAMA.

[31]  Avitan Gefen,et al.  Syndrome to gene (S2G): in‐silico identification of candidate genes for human diseases , 2010, Human mutation.

[32]  Magalie S Leduc,et al.  Clinical whole-exome sequencing for the diagnosis of mendelian disorders. , 2013, The New England journal of medicine.

[33]  O. Griffith,et al.  OMIM (Online Mendelian Inheritance in Man) , 2014 .

[34]  D. Valle,et al.  Online Mendelian Inheritance In Man (OMIM) , 2000, Human mutation.