Lessons Learned from Whole Exome Sequencing in Multiplex Families Affected by a Complex Genetic Disorder, Intracranial Aneurysm

Genetic risk factors for intracranial aneurysm (IA) are not yet fully understood. Genomewide association studies have been successful at identifying common variants; however, the role of rare variation in IA susceptibility has not been fully explored. In this study, we report the use of whole exome sequencing (WES) in seven densely-affected families (45 individuals) recruited as part of the Familial Intracranial Aneurysm study. WES variants were prioritized by functional prediction, frequency, predicted pathogenicity, and segregation within families. Using these criteria, 68 variants in 68 genes were prioritized across the seven families. Of the genes that were expressed in IA tissue, one gene (TMEM132B) was differentially expressed in aneurysmal samples (n=44) as compared to control samples (n=16) (false discovery rate adjusted p-value=0.023). We demonstrate that sequencing of densely affected families permits exploration of the role of rare variants in a relatively common disease such as IA, although there are important study design considerations for applying sequencing to complex disorders. In this study, we explore methods of WES variant prioritization, including the incorporation of unaffected individuals, multipoint linkage analysis, biological pathway information, and transcriptome profiling. Further studies are needed to validate and characterize the set of variants and genes identified in this study.

[1]  J. Shendure,et al.  A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.

[2]  Murat Gunel,et al.  Mapping a Mendelian form of intracranial aneurysm to 1p34.3-p36.13. , 2005, American journal of human genetics.

[3]  Daniel L. Koller,et al.  Genome-Wide Association Study of Intracranial Aneurysms Confirms Role of Anril and SOX17 in Disease Risk , 2012, Stroke.

[4]  D. Goldstein,et al.  Uncovering the roles of rare variants in common disease through whole-genome sequencing , 2010, Nature Reviews Genetics.

[5]  Murim Choi,et al.  Susceptibility loci for intracranial aneurysm in European and Japanese populations , 2008, Nature Genetics.

[6]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[7]  H. Stefánsson,et al.  The same sequence variant on 9p21 associates with myocardial infarction, abdominal aortic aneurysm and intracranial aneurysm , 2008, Nature Genetics.

[8]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[9]  Gabor T. Marth,et al.  Haplotype-based variant detection from short-read sequencing , 2012, 1207.3907.

[10]  D. Botstein,et al.  Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease , 2003, Nature Genetics.

[11]  Daniel L. Koller,et al.  Genome screen in familial intracranial aneurysm , 2009, BMC Medical Genetics.

[12]  Genome-Wide Association Study of Intracranial Aneurysms Confirms Role of Anril and SOX17 in Disease Risk , 2012, Stroke.

[13]  Davis J. McCarthy,et al.  Count-based differential expression analysis of RNA sequencing data using R and Bioconductor , 2013, Nature Protocols.

[14]  Chuong B. Do,et al.  Large-scale meta-analysis of genome-wide association data identifies six new risk loci for Parkinson’s disease , 2014, Nature Genetics.

[15]  B. Nahed,et al.  Molecular Genetic Analysis of Two Large Kindreds With Intracranial Aneurysms Demonstrates Linkage to 11q24-25 and 14q23-31 , 2006, Stroke.

[16]  A. Algra,et al.  Prevalence of unruptured intracranial aneurysms, with emphasis on sex, age, comorbidity, country, and time period: a systematic review and meta-analysis , 2011, The Lancet Neurology.

[17]  I. Adzhubei,et al.  Predicting Functional Effect of Human Missense Mutations Using PolyPhen‐2 , 2013, Current protocols in human genetics.

[18]  Eleftheria Zeggini,et al.  In search of low-frequency and rare variants affecting complex traits , 2013, Human molecular genetics.

[19]  Steven Henikoff,et al.  SIFT: predicting amino acid changes that affect protein function , 2003, Nucleic Acids Res..

[20]  Hugo Y. K. Lam,et al.  Performance comparison of exome DNA sequencing technologies , 2011, Nature Biotechnology.

[21]  P. Ng,et al.  Predicting the effects of frameshifting indels , 2012, Genome Biology.

[22]  C. Anderson,et al.  Greater Rupture Risk for Familial as Compared to Sporadic Unruptured Intracranial Aneurysms , 2009, Stroke.

[23]  B. Nahed,et al.  Molecular Genetic Analysis of Two Large Kindreds With Intracranial Aneurysms Demonstrates Linkage to 11 q 24-25 and 14 q 23-31 , 2006 .

[24]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[25]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[26]  Richard Durbin,et al.  Fast and accurate long-read alignment with Burrows–Wheeler transform , 2010, Bioinform..

[27]  Alejandro F. Frangi,et al.  Genome-wide association study of intracranial aneurysm identifies three new risk loci , 2010, Nature genetics.

[28]  M. Limburg,et al.  Subarachnoid haemorrhage in first and second degree relatives of patients with subarachnoid haemorrhage , 1995, BMJ.

[29]  Joseph P Broderick,et al.  The Familial Intracranial Aneurysm (FIA) study protocol , 2005, BMC Medical Genetics.

[30]  G. Abecasis,et al.  Merlin—rapid analysis of dense genetic maps using sparse gene flow trees , 2002, Nature Genetics.

[31]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[32]  Yusuke Nakamura,et al.  Common variant near the endothelin receptor type A (EDNRA) gene is associated with intracranial aneurysm risk , 2011, Proceedings of the National Academy of Sciences.

[33]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[34]  V. Feigin,et al.  Worldwide stroke incidence and early case fatality reported in 56 population-based studies: a systematic review , 2009, The Lancet Neurology.

[35]  E. Wijsman,et al.  A statistical framework to guide sequencing choices in pedigrees. , 2014, American journal of human genetics.

[36]  Yuedong Yang,et al.  DDIG-in: discriminating between disease-associated and neutral non-frameshifting micro-indels , 2013, Genome Biology.

[37]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[38]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[39]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[40]  C. Anderson,et al.  Unruptured intracranial aneurysms in the Familial Intracranial Aneurysm and International Study of Unruptured Intracranial Aneurysms cohorts: differences in multiplicity and location. , 2012, Journal of neurosurgery.

[41]  Craig S. Anderson,et al.  Risk Factors for Subarachnoid Hemorrhage: An Updated Systematic Review of Epidemiological Studies , 2005, Stroke.

[42]  D. Nickerson,et al.  Utilizing Graph Theory to Select the Largest Set of Unrelated Individuals for Genetic Analysis , 2013, Genetic epidemiology.

[43]  Emily H Turner,et al.  Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome , 2010, Nature Genetics.

[44]  M. Vihinen,et al.  Performance of mutation pathogenicity prediction methods on missense variants , 2011, Human mutation.

[45]  Paul Theodor Pyl,et al.  HTSeq—a Python framework to work with high-throughput sequencing data , 2014, bioRxiv.

[46]  P. Shannon,et al.  Exome sequencing identifies the cause of a Mendelian disorder , 2009, Nature Genetics.

[47]  Daniel L. Koller,et al.  Genome Screen to Detect Linkage to Common Susceptibility Genes for Intracranial and Aortic Aneurysms , 2009, Stroke.

[48]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[49]  G. McVean,et al.  Differential confounding of rare and common variants in spatially structured populations , 2011, Nature Genetics.

[50]  D. Milewicz,et al.  Identification of a Chromosome 11q23.2-q24 Locus for Familial Aortic Aneurysm Disease, a Genetically Heterogeneous Disorder , 2001, Circulation.

[51]  G. Abecasis,et al.  Rare-variant association analysis: study designs and statistical tests. , 2014, American journal of human genetics.

[52]  T. Foroud Whole exome sequencing of intracranial aneurysm. , 2013, Stroke.

[53]  G. Rinkel,et al.  Genetics of Intracranial Aneurysms , 2008, Stroke.

[54]  Life Technologies,et al.  A map of human genome variation from population-scale sequencing , 2011 .

[55]  C. Kimchi-Sarfaty,et al.  Understanding the contribution of synonymous mutations to human disease , 2011, Nature Reviews Genetics.