Bayesian Genetic Association Test when Secondary Phenotypes Are Available Only in the Case Group

In many case-control genetic association studies, a secondary phenotype that may have common genetic factors with disease status can be identified. When information on the secondary phenotype is available only for the case group due to cost and different data sources, a fitting linear regression model ignoring supplementary phenotype data may provide limited knowledge regarding genetic association. We set up a joint model and use a Bayesian framework to estimate and test the effect of genetic covariates on disease status considering the secondary phenotype as an instrumental variable. The application of our proposed procedure is demonstrated through the rheumatoid arthritis data provided by the 16th Genetic Analysis Workshop.

[1]  C. Amos,et al.  A genome-wide association scan for rheumatoid arthritis data by Hotelling ’ s T 2 tests , 2009 .

[2]  Richard F. Gunst,et al.  Applied Regression Analysis , 1999, Technometrics.

[3]  Wei Chen,et al.  Refining the complex rheumatoid arthritis phenotype based on specificity of the HLA-DRB1 shared epitope for antibodies to citrullinated proteins. , 2005, Arthritis and rheumatism.

[4]  Jinfeng Xu,et al.  Bayes Factor Based on the Trend Test Incorporating Hardy–Weinberg Disequilibrium: More Power to Detect Genetic Association , 2012, Annals of human genetics.

[5]  Christopher I Amos,et al.  Data for Genetic Analysis Workshop 16 Problem 1, association analysis of rheumatoid arthritis data , 2009, BMC proceedings.

[6]  Steven J. Schrodi,et al.  A missense single-nucleotide polymorphism in a gene encoding a protein tyrosine phosphatase (PTPN22) is associated with rheumatoid arthritis. , 2004, American journal of human genetics.

[7]  G. Casella,et al.  Reconciling Bayesian and Frequentist Evidence in the One-Sided Testing Problem , 1987 .

[8]  D. Andrews The Large Sample Correspondence between Classical Hypothesis Tests and Bayesian Posterior Odds Tests , 1994 .

[9]  Alan Agresti,et al.  Categorical Data Analysis , 2003 .

[10]  Arnab Maity,et al.  Testing in semiparametric models with interaction, with applications to gene-environment interactions. , 2009, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[11]  R. Westhovens,et al.  Technical and diagnostic performance of 6 assays for the measurement of citrullinated protein/peptide antibodies in the diagnosis of rheumatoid arthritis. , 2007, Clinical chemistry.

[12]  M. Stephens,et al.  Bayesian statistical methods for genetic association studies , 2009, Nature Reviews Genetics.

[13]  L. A. Goodman,et al.  Latent Structure Analysis of a Set of Multidimensional Contingency Tables , 1984 .

[14]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[15]  Susan E. Hodge,et al.  Likelihood formulation of parent-of-origin effects on segregation analysis, including ascertainment. , 2002, American journal of human genetics.

[16]  A. Barton,et al.  The epidemiology of rheumatoid arthritis and the use of linkage and association studies to identify disease genes , 2006 .

[17]  S. Tishkoff,et al.  SNP ascertainment bias in population genetic analyses: Why it is important, and how to correct it , 2013, BioEssays : news and reviews in molecular, cellular and developmental biology.

[18]  Yusuke Nakamura,et al.  Functional haplotypes of PADI4, encoding citrullinating enzyme peptidylarginine deiminase 4, are associated with rheumatoid arthritis , 2003, Nature Genetics.

[19]  J. Berger,et al.  Testing a Point Null Hypothesis: The Irreconcilability of P Values and Evidence , 1987 .

[20]  M. V. van Leeuwen,et al.  The prognostic value of anti-cyclic citrullinated peptide antibody in patients with recent-onset rheumatoid arthritis. , 2000, Arthritis and rheumatism.

[21]  N. Chatterjee,et al.  Powerful multilocus tests of genetic association in the presence of gene-gene and gene-environment interactions. , 2006, American journal of human genetics.

[22]  A genome-wide association scan for rheumatoid arthritis data by Hotelling's T2 tests , 2009, BMC proceedings.

[23]  D. Falconer The inheritance of liability to certain diseases, estimated from the incidence among relatives , 1965 .

[24]  Nilanjan Chatterjee,et al.  Semiparametric maximum likelihood estimation exploiting gene-environment independence in case-control studies , 2005 .

[25]  J. Gastwirth,et al.  Robust genomic control for association studies. , 2006, American journal of human genetics.

[26]  Hongzhe Li,et al.  A Gaussian copula approach for the analysis of secondary phenotypes in case-control genetic association studies. , 2012, Biostatistics.

[27]  P. Sasieni From genotypes to genes: doubling the sample size. , 1997, Biometrics.

[28]  Colin O. Wu,et al.  A Joint Regression Analysis for Genetic Association Studies with Outcome Stratified Samples , 2013, Biometrics.