A unified stepwise regression procedure for evaluating the relative effects of polymorphisms within a gene using case/control or family data: application to HLA in type 1 diabetes.

A stepwise logistic-regression procedure is proposed for evaluation of the relative importance of variants at different sites within a small genetic region. By fitting statistical models with main effects, rather than modeling the full haplotype effects, we generate tests, with few degrees of freedom, that are likely to be powerful for detecting primary etiological determinants. The approach is applicable to either case/control or nuclear-family data, with case/control data modeled via unconditional and family data via conditional logistic regression. Four different conditioning strategies are proposed for evaluation of effects at multiple, closely linked loci when family data are used. The first strategy results in a likelihood that is equivalent to analysis of a matched case/control study with each affected offspring matched to three pseudocontrols, whereas the second strategy is equivalent to matching each affected offspring with between one and three pseudocontrols. Both of these strategies require you be able to infer parental phase (i.e., those haplotypes present in the parents). Families in which phase cannot be determined must be discarded, which can considerably reduce the effective size of a data set, particularly when large numbers of loci that are not very polymorphic are being considered. Therefore, a third strategy is proposed in which knowledge of parental phase is not required, which allows those families with ambiguous phase to be included in the analysis. The fourth and final strategy is to use conditioning method 2 when parental phase can be inferred and to use conditioning method 3 otherwise. The methods are illustrated using nuclear-family data to evaluate the contribution of loci in the HLA region to the development of type 1 diabetes.

[1]  J. Todd,et al.  A correlation between the relative predisposition of MHC class II alleles to type 1 diabetes and the structure of their proteins. , 2001, Human molecular genetics.

[2]  J. Todd,et al.  The HLA-DPB1--associated component of the IDDM1 and its relationship to the major loci HLA-DQB1, -DQA1, and -DRB1. , 2001, Diabetes.

[3]  J. Todd,et al.  Conditional linkage disequilibrium analysis of a complex disease superlocus, IDDM1 in the HLA region, reveals the presence of independent modifying gene effects influencing the type 1 diabetes risk encoded by the major HLA-DQB1, -DRB1 disease loci. , 2001, Human molecular genetics.

[4]  H. Li,et al.  A permutation procedure for the haplotype method for identification of disease‐predisposing variants , 2001 .

[5]  N Risch,et al.  Complex HLA-DR and -DQ interactions confer risk of narcolepsy-cataplexy in three ethnic groups. , 2001, American journal of human genetics.

[6]  J. Todd,et al.  Major factors influencing linkage disequilibrium by analysis of different chromosome regions in distinct populations: demography, chromosome recombination frequency and selection. , 2000, Human molecular genetics.

[7]  J. Ilonen,et al.  Non-class II HLA gene associated with type 1 diabetes maps to the 240-kb region near HLA-B. , 2000, Diabetes.

[8]  D. Clayton,et al.  Unbiased application of the transmission/disequilibrium test to multilocus haplotypes. , 2000, American journal of human genetics.

[9]  J. Todd,et al.  Adaptation of the extended transmission/disequilibrium test to distinguish disease associations of multiple loci: the Conditional Extended Transmission/Disequilibrium Test , 2000 .

[10]  David E. Booth,et al.  Analysis of Incomplete Multivariate Data , 2000, Technometrics.

[11]  P. Brunetti,et al.  Microsatellite polymorphism of the MHC class I chain-related (MIC-A and MIC-B) genes marks the risk for autoimmune Addison's disease. , 1999, The Journal of clinical endocrinology and metabolism.

[12]  D. Clayton,et al.  A generalization of the transmission/disequilibrium test for uncertain-haplotype transmission. , 1999, American journal of human genetics.

[13]  H. Inoko,et al.  The critical region for Behçet disease in the human major histocompatibility complex is reduced to a 46-kb segment centromeric of HLA-B, by association analysis using refined microsatellite mapping. , 1999, American journal of human genetics.

[14]  J. Todd,et al.  The predisposition to type 1 diabetes linked to the human leukocyte antigen complex includes at least one non-class II gene. , 1999, American journal of human genetics.

[15]  M Knapp,et al.  The transmission/disequilibrium test and parental-genotype reconstruction: the reconstruction-combined transmission/ disequilibrium test. , 1999, American journal of human genetics.

[16]  S. Beck,et al.  Sequence organisation of the class II region of the human MHC , 1999, Immunological reviews.

[17]  B S Weir,et al.  Tests for linkage and association in nuclear families. , 1997, American journal of human genetics.

[18]  A. Valdés,et al.  Detecting disease-predisposing variants: the haplotype method. , 1997, American journal of human genetics.

[19]  D. Schaid General score tests for associations of genetic markers with disease using cases and their parents , 1996, Genetic epidemiology.

[20]  G. Thomson Mapping disease genes: family-based association studies. , 1995, American journal of human genetics.

[21]  Bryan Langholz,et al.  Variation in HLA‐associated risks of childhood insulin‐dependent diabetes in the finnish population: II. Haplotype effects , 1995, Genetic epidemiology.

[22]  G. Thomson,et al.  Homozygous parent affected sib pair method for detecting disease predisposing variants: Application to insulin dependent diabetes mellitus , 1993, Genetic epidemiology.

[23]  K. Liang,et al.  On estimating HLA/disease association with application to a study of aplastic anemia. , 1991, Biometrics.

[24]  M. Baur,et al.  Genetic heterogeneity, modes of inheritance, and risk estimates for a joint study of Caucasians with insulin-dependent diabetes mellitus. , 1988, American journal of human genetics.

[25]  K F Hirji,et al.  Exact inference for matched case-control studies. , 1988, Biometrics.

[26]  Nitin R. Patel,et al.  Computing Distributions for Exact Logistic Regression , 1987 .

[27]  C. Falk,et al.  Haplotype relative risks: an easy reliable way to construct a proper control sample for risk calculations , 1987, Annals of human genetics.

[28]  H. White Maximum Likelihood Estimation of Misspecified Models , 1982 .

[29]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[30]  H. L. Le Roy,et al.  Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability; Vol. IV , 1969 .

[31]  P. J. Huber The behavior of maximum likelihood estimates under nonstandard conditions , 1967 .