Clinical interpretation of CNVs with cross-species phenotype data

Background Clinical evaluation of CNVs identified via techniques such as array comparative genome hybridisation (aCGH) involves the inspection of lists of known and unknown duplications and deletions with the goal of distinguishing pathogenic from benign CNVs. A key step in this process is the comparison of the individual's phenotypic abnormalities with those associated with Mendelian disorders of the genes affected by the CNV. However, because often there is not much known about these human genes, an additional source of data that could be used is model organism phenotype data. Currently, almost 6000 genes in mouse and zebrafish are, when knocked out, associated with a phenotype in the model organism, but no disease is known to be caused by mutations in the human ortholog. Yet, searching model organism databases and comparing model organism phenotypes with patient phenotypes for identifying novel disease genes and medical evaluation of CNVs is hindered by the difficulty in integrating phenotype information across species and the lack of appropriate software tools. Methods Here, we present an integrated ranking scheme based on phenotypic matching, degree of overlap with known benign or pathogenic CNVs and the haploinsufficiency score for the prioritisation of CNVs responsible for a patient's clinical findings. Results We show that this scheme leads to significant improvements compared with rankings that do not exploit phenotypic information. We provide a software tool called PhenogramViz, which supports phenotype-driven interpretation of aCGH findings based on multiple data sources, including the integrated cross-species phenotype ontology Uberpheno, in order to visualise gene-to-phenotype relations. Conclusions Integrating and visualising cross-species phenotype information on the affected genes may help in routine diagnostics of CNVs.

[1]  Heidi L Rehm,et al.  New approaches to molecular diagnosis. , 2013, JAMA.

[2]  Paul N. Schofield,et al.  Improving ontologies by automatic reasoning and evaluation of logical definitions , 2011, BMC Bioinformatics.

[3]  Damian Smedley,et al.  Phenotypic overlap in the contribution of individual genes to CNV pathogenicity revealed by cross-species computational analysis of single-gene mutations in humans, mice and zebrafish , 2012, Disease Models & Mechanisms.

[4]  Monte Westerfield,et al.  ZFIN, the Zebrafish Model Organism Database: increased support for mutants and transgenics , 2012, Nucleic Acids Res..

[5]  Michael Wigler,et al.  Rare De Novo Variants Associated with Autism Implicate a Large Functional Network of Genes Involved in Formation and Function of Synapses , 2011, Neuron.

[6]  Damian Smedley,et al.  The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data , 2014, Nucleic Acids Res..

[7]  Gautier Koscielny,et al.  The International Mouse Phenotyping Consortium Web Portal, a unified point of access for knockout mice and related phenotyping data , 2013, Nucleic Acids Res..

[8]  F. Dhombres,et al.  Representation of rare diseases in health information systems: The orphanet approach to serve a wide range of end users , 2012, Human mutation.

[9]  Judith A. Blake,et al.  The Mouse Genome Database: integration of and access to knowledge about the laboratory mouse , 2013, Nucleic Acids Res..

[10]  Gary D Bader,et al.  A travel guide to Cytoscape plugins , 2012, Nature Methods.

[11]  M. Hurles,et al.  Copy number variation in human health, disease, and evolution. , 2009, Annual review of genomics and human genetics.

[12]  Christina A. Castellani,et al.  Biological relevance of CNV calling methods using familial relatedness including monozygotic twins , 2014, BMC Bioinformatics.

[13]  Lars Feuk,et al.  The Database of Genomic Variants: a curated collection of structural variation in the human genome , 2013, Nucleic Acids Res..

[14]  H. Lähdesmäki,et al.  The genome-wide landscape of copy number variations in the MUSGEN study provides evidence for a founder effect in the isolated Finnish population , 2013, European Journal of Human Genetics.

[15]  Damian Smedley,et al.  Construction and accessibility of a cross-species phenotype ontology along with gene annotations for biomedical research. , 2013, F1000Research.

[16]  C. Fowler,et al.  Williams-Beuren syndrome. , 2010, The New England journal of medicine.

[17]  L. Feuk,et al.  Diagnostic interpretation of array data using public databases and internet sources , 2012, Human mutation.

[18]  Insuk Lee,et al.  Characterising and Predicting Haploinsufficiency in the Human Genome , 2010, PLoS genetics.

[19]  Caleb Webber,et al.  Phenotype Ontologies and Cross-Species Analysis for Translational Research , 2014, PLoS genetics.

[20]  Steven A. Harvey,et al.  A systematic genome-wide analysis of zebrafish protein-coding gene function , 2013, Nature.

[21]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[22]  Peter T. Fox,et al.  Recurrent interstitial deletions of proximal 18q: A new syndrome involving expressive speech delay , 2007, American journal of medical genetics. Part A.

[23]  Leslie G Biesecker,et al.  Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies. , 2010, American journal of human genetics.

[24]  R. Hochstenbach,et al.  A three-step workflow procedure for the interpretation of array-based comparative genome hybridization results in patients with idiopathic mental retardation and congenital anomalies , 2010, Genetics in Medicine.

[25]  Sebastian Köhler,et al.  Ontological phenotype standards for neurogenetics , 2012, Human mutation.

[26]  A. Valsesia,et al.  The Growing Importance of CNVs: New Insights for Detection and Clinical Interpretation , 2013, Front. Genet..

[27]  D. Conrad,et al.  Global variation in copy number in the human genome , 2006, Nature.

[28]  Carol A. Bocchini,et al.  A new face and new challenges for Online Mendelian Inheritance in Man (OMIM®) , 2011, Human mutation.

[29]  Darlene Riethmaier,et al.  Towards a Universal Clinical Genomics Database: The 2012 International Standards for Cytogenomic Arrays Consortium Meeting , 2013, Human mutation.

[30]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[31]  S. Schwartz,et al.  Variability in interpreting and reporting copy number changes detected by array-based technology in clinical laboratories , 2009, Genetics in Medicine.

[32]  Caleb Webber,et al.  Forging Links between Human Mental Retardation–Associated CNVs and Mouse Gene Knockout Models , 2009, PLoS genetics.

[33]  Caleb Webber,et al.  Accurate Distinction of Pathogenic from Benign CNVs in Mental Retardation , 2010, PLoS Comput. Biol..