Length of Uninterrupted CAG, Independent of Polyglutamine Size, Results in Increased Somatic Instability, Hastening Onset of Huntington Disease.

Huntington disease (HD) is caused by a CAG repeat expansion in the huntingtin (HTT) gene. Although the length of this repeat is inversely correlated with age of onset (AOO), it does not fully explain the variability in AOO. We assessed the sequence downstream of the CAG repeat in HTT [reference: (CAG)n-CAA-CAG], since variants within this region have been previously described, but no study of AOO has been performed. These analyses identified a variant that results in complete loss of interrupting (LOI) adenine nucleotides in this region [(CAG)n-CAG-CAG]. Analysis of multiple HD pedigrees showed that this LOI variant is associated with dramatically earlier AOO (average of 25 years) despite the same polyglutamine length as in individuals with the interrupting penultimate CAA codon. This LOI allele is particularly frequent in persons with reduced penetrance alleles who manifest with HD and increases the likelihood of presenting clinically with HD with a CAG of 36-39 repeats. Further, we show that the LOI variant is associated with increased somatic repeat instability, highlighting this as a significant driver of this effect. These findings indicate that the number of uninterrupted CAG repeats, which is lengthened by the LOI, is the most significant contributor to AOO of HD and is more significant than polyglutamine length, which is not altered in these individuals. In addition, we identified another variant in this region, where the CAA-CAG sequence is duplicated, which was associated with later AOO. Identification of these cis-acting modifiers have potentially important implications for genetic counselling in HD-affected families.

[1]  M. Hayden,et al.  Huntingtin Haplotypes Provide Prioritized Target Panels for Allele-specific Silencing in Huntington Disease Patients of European Ancestry. , 2015, Molecular therapy : the journal of the American Society of Gene Therapy.

[2]  M. Hayden,et al.  High frequency of intermediate alleles on huntington disease‐associated haplotypes in British Columbia's general population , 2013, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[3]  M. Hayden,et al.  Somatic and gonadal mosaicism of the Huntington disease gene CAG repeat in brain and sperm , 1994, Nature Genetics.

[4]  Alan M. Kwong,et al.  Next-generation genotype imputation service and methods , 2016, Nature Genetics.

[5]  A. Destée,et al.  Are interrupted SCA2 CAG repeat expansions responsible for parkinsonism? , 2007, Neurology.

[6]  Jane S. Paulsen,et al.  A new model for prediction of the age of onset and penetrance for Huntington's disease based on CAG length , 2004, Clinical genetics.

[7]  G. van den Engh,et al.  Contribution of DNA sequence and CAG size to mutation frequencies of intermediate alleles for Huntington disease: evidence from single sperm analyses. , 1997, Human molecular genetics.

[8]  Ryan L. Collins,et al.  Variation across 141,456 human exomes and genomes reveals the spectrum of loss-of-function intolerance across human protein-coding genes , 2019, bioRxiv.

[9]  B. Kremer,et al.  Somatic mosaicism in sperm is associated with intergenerational (CAG)n changes in Huntington disease. , 1995, Human molecular genetics.

[10]  M. Hayden,et al.  Familial influence on age of onset among siblings with Huntington disease. , 2001, American journal of medical genetics.

[11]  M. Hayden,et al.  Huntington disease reduced penetrance alleles occur at high frequency in the general population , 2016, Neurology.

[12]  M. Hayden,et al.  Marked differences in neurochemistry and aggregates despite similar behavioural and neuropathological features of Huntington disease in the full-length BACHD and YAC128 mice. , 2012, Human molecular genetics.

[13]  B. Frey,et al.  Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder , 2017, Nature Neuroscience.

[14]  Enrico Amico,et al.  Identification of genetic variants associated with Huntington's disease progression: a genome-wide association study. , 2017, The Lancet. Neurology.

[15]  M. Hegde,et al.  Null alleles at the Huntington disease locus: implications for diagnostics and CAG repeat instability. , 2000, Genetic testing.

[16]  Wenya Linda Bi,et al.  Triplet repeat mutation length gains correlate with cell-type specific vulnerability in Huntington disease brain. , 2007, Human molecular genetics.

[17]  David J. Arenillas,et al.  A SNP in the HTT promoter alters NF-κB binding and is a bidirectional genetic modifier of Huntington disease , 2015, Nature Neuroscience.

[18]  Konrad Scheffler,et al.  ExpansionHunter: a sequence-graph-based tool to analyze variation in short tandem repeat regions , 2019, bioRxiv.

[19]  D. Sillence,et al.  Increased instability of intermediate alleles in families with sporadic Huntington disease compared to similar sized intermediate alleles in the general population. , 1995, Human molecular genetics.

[20]  Edith T. Lopez,et al.  Mismatch Repair Genes Mlh1 and Mlh3 Modify CAG Instability in Huntington's Disease Mice: Genome-Wide and Candidate Approaches , 2013, PLoS genetics.

[21]  S. Yu,et al.  Polymorphisms in the CAG repeat – a source of error in Huntington disease DNA testing , 2000, Clinical genetics.

[22]  Y. Agid,et al.  Sequence analysis of the CCG polymorphic region adjacent to the CAG triplet repeat of the HD gene in normal and HD chromosomes. , 1995, Journal of medical genetics.

[23]  M. Hayden,et al.  Unstable familial transmissions of Huntington disease alleles with 27–35 CAG repeats (intermediate alleles) , 2009, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[24]  Jane S. Paulsen,et al.  Identification of Genetic Factors that Modify Clinical Onset of Huntington’s Disease , 2015, Cell.

[25]  Elizabeth Evans,et al.  Dramatic tissue-specific mutation length increases are an early molecular event in Huntington disease pathogenesis. , 2003, Human molecular genetics.

[26]  C. Broeckhoven,et al.  CAG repeat expansion in the TATA box-binding protein gene causes autosomal dominant cerebellar ataxia. , 2001, Brain : a journal of neurology.

[27]  Chris Shaw,et al.  Detection of long repeat expansions from PCR-free whole-genome sequence data , 2016, bioRxiv.

[28]  T. Ashizawa,et al.  An interrupted 34-CAG repeat SCA-2 allele in patients with sporadic spinocerebellar ataxia , 2000, Neurology.

[29]  G. van Ommen,et al.  New problems in testing for Huntington's disease: the issue of intermediate and reduced penetrance alleles , 2001, Journal of medical genetics.

[30]  Karen Marder,et al.  Venezuelan kindreds reveal that genetic and environmental factors modulate Huntington's disease age of onset. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Audrey E Hendricks,et al.  Somatic expansion of the Huntington's disease CAG repeat in the brain is associated with an earlier age of disease onset. , 2009, Human molecular genetics.

[32]  I. Kanazawa,et al.  SCA17, a novel autosomal dominant cerebellar ataxia caused by an expanded polyglutamine in TATA-binding protein. , 2001, Human molecular genetics.

[33]  Peter Holmans,et al.  The HTT CAG-Expansion Mutation Determines Age at Death but Not Disease Duration in Huntington Disease. , 2016, American journal of human genetics.

[34]  M. Hayden,et al.  CAG size-specific risk estimates for intermediate allele repeat instability in Huntington disease , 2013, Journal of Medical Genetics.

[35]  M. MacDonald,et al.  Huntington's disease: the case for genetic modifiers , 2009, Genome Medicine.