Predicting Disease Onset from Mutation Status Using Proband and Relative Data with Applications to Huntington's Disease.

Huntington's disease (HD) is a progressive neurodegenerative disorder caused by an expansion of CAG repeats in the IT15 gene. The age-at-onset (AAO) of HD is inversely related to the CAG repeat length and the minimum length thought to cause HD is 36. Accurate estimation of the AAO distribution based on CAG repeat length is important for genetic counseling and the design of clinical trials. In the Cooperative Huntington's Observational Research Trial (COHORT) study, the CAG repeat length is known for the proband participants. However, whether a family member shares the huntingtin gene status (CAG expanded or not) with the proband is unknown. In this work, we use the expectation-maximization (EM) algorithm to handle the missing huntingtin gene information in first-degree family members in COHORT, assuming that a family member has the same CAG length as the proband if the family member carries a huntingtin gene mutation. We perform simulation studies to examine performance of the proposed method and apply the methods to analyze COHORT proband and family combined data. Our analyses reveal that the estimated cumulative risk of HD symptom onset obtained from the combined data is slightly lower than the risk estimated from the proband data alone.

[1]  A. MacDonald,et al.  Huntington's disease, critical illness insurance and life insurance , 2004 .

[2]  M. Hayden,et al.  Sex-dependent mechanisms for expansions and contractions of the CAG repeat on affected Huntington disease chromosomes. , 1995, American journal of human genetics.

[3]  Jane S. Paulsen,et al.  A new model for prediction of the age of onset and penetrance for Huntington's disease based on CAG length , 2004, Clinical genetics.

[4]  C. McMurray Mechanisms of trinucleotide repeat instability during human development , 2010, Nature Reviews Genetics.

[5]  C A Ross,et al.  When more is less: Pathogenesis of glutamine repeat neurodegenerative diseases , 1995, Neuron.

[6]  C. Ross,et al.  Huntington's disease: from molecular pathogenesis to clinical treatment , 2011, The Lancet Neurology.

[7]  D. Falush,et al.  Measurement of mutational flow implies both a high new-mutation rate for Huntington disease and substantial underascertainment of late-onset cases. , 2001, American journal of human genetics.

[8]  C A Ross,et al.  Correlation between the onset age of Huntington's disease and length of the trinucleotide repeat in IT-15. , 1993, Human molecular genetics.

[9]  E. Ray Dorsey,et al.  Characterization of a Large Group of Individuals with Huntington Disease and Their Relatives Enrolled in the COHORT Study , 2012, PloS one.

[10]  T. Foroud,et al.  Differences in duration of Huntington’s disease based on age at onset , 1999, Journal of neurology, neurosurgery, and psychiatry.

[11]  Hongyu Zhao,et al.  Practical Issues in Building Risk-Predicting Models for Complex Diseases , 2010, Journal of biopharmaceutical statistics.

[12]  P S Harper,et al.  Phenotypic characterization of individuals with 30-40 CAG repeats in the Huntington disease (HD) gene reveals HD cases with 36 repeats and apparently normal elderly individuals with 36-39 repeats. , 1996, American journal of human genetics.

[13]  E D Louis,et al.  Accuracy of family history data on Parkinson’s disease , 2003, Neurology.

[14]  Jane S. Paulsen,et al.  Unified Huntington's disease rating scale: Reliability and consistency , 1996, Movement disorders : official journal of the Movement Disorder Society.

[15]  M. Hayden,et al.  The likelihood of being affected with Huntington disease by a particular age, for a specific CAG size. , 1997, American journal of human genetics.

[16]  T. Louis Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .

[17]  Jane S. Paulsen,et al.  CAG‐repeat length and the age of onset in Huntington disease (HD): A review and validation study of statistical approaches , 2010, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[18]  Karen Marder,et al.  Risk of Parkinson disease in carriers of parkin mutations: estimation using the kin-cohort method. , 2008, Archives of neurology.

[19]  E Ray Dorsey,et al.  Communicating clinical trial results to research participants. , 2008, Archives of neurology.

[20]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.