Cox proportional hazards survival regression in haplotype-based association analysis using the Stochastic-EM algorithm

It is now widely recognized that haplotype information inferred from genotypes can be of great interest to better characterize the role of a candidate gene in the etiology of a complex trait in the context of association studies. Several works have recently advocated the simultaneous estimation of haplotype frequencies and haplotype effects in order to get a better efficiency in parameter estimation. Most of the available models can deal with a binary or a quantitative phenotype, but none has yet discussed the application of haplotype-based association analysis to a survival outcome. We describe how the recently proposed Stochastic-EM (SEM) algorithm can be applied to estimate haplotype effects in censored data analysis using a standard Cox proportional hazards formulation. This model has been implemented in the THESIAS software freely available at http://www.genecanvas.org

[1]  D. Tregouet,et al.  SELPLG Gene Polymorphisms in Relation to Plasma SELPLG Levels and Coronary Artery Disease , 2003, Annals of human genetics.

[2]  Lue Ping Zhao,et al.  A method for the assessment of disease associations with single-nucleotide polymorphism haplotypes and environmental variables in case-control studies. , 2003, American journal of human genetics.

[3]  Edward H. Ip,et al.  Stochastic EM: method and application , 1996 .

[4]  D. Cox,et al.  Analysis of Survival Data. , 1986 .

[5]  D. Trégouët,et al.  Investigation of the human ANP gene in type 1 diabetic nephropathy: case-control and follow-up studies. , 2004, Diabetes.

[6]  Jean-Louis Golmard,et al.  Specific haplotypes of the P-selectin gene are associated with myocardial infarction. , 2002, Human molecular genetics.

[7]  G. Satten,et al.  Inference on haplotype effects in case-control studies using unphased genotype data. , 2003, American journal of human genetics.

[8]  B Rosner,et al.  Regression calibration method for correcting measurement-error bias in nutritional epidemiology. , 1997, The American journal of clinical nutrition.

[9]  R. Prentice,et al.  Regression calibration in failure time regression. , 1997, Biometrics.

[10]  A. Zwinderman,et al.  Haplotype analysis of the CETP gene: not TaqIB, but the closely linked -629C-->A polymorphism and a novel promoter variant are independently associated with CETP concentration. , 2003, Human molecular genetics.

[11]  D. Lin,et al.  Haplotype‐based association analysis in cohort studies of unrelated individuals , 2004, Genetic epidemiology.

[12]  T. Nakamura,et al.  Proportional hazards model with covariates subject to measurement error. , 1992, Biometrics.

[13]  Peter H. Westfall,et al.  Testing Association of Statistically Inferred Haplotypes with Discrete and Continuous Traits in Samples of Unrelated Individuals , 2002, Human Heredity.

[14]  G. Celeux,et al.  Asymptotic properties of a stochastic EM algorithm for estimating mixing proportions , 1993 .

[15]  D. Tregouet,et al.  Platelet-activating factor-acetylhydrolase and PAF-receptor gene haplotypes in relation to future cardiovascular event in patients with coronary artery disease. , 2004, Human molecular genetics.

[16]  D. Tregouet,et al.  A new algorithm for haplotype‐based association analysis: the Stochastic‐EM algorithm , 2004, Annals of human genetics.

[17]  P. Donnelly,et al.  A new statistical method for haplotype reconstruction from population data. , 2001, American journal of human genetics.

[18]  Zhaohui S. Qin,et al.  Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms. , 2002, American journal of human genetics.

[19]  A. Zwinderman,et al.  Estimation of Multilocus Haplotype Effects Using Weighted Penalised Log‐Likelihood: Analysis of Five Sequence Variations at the Cholesteryl Ester Transfer Protein Gene Locus , 2003, Annals of human genetics.

[20]  D. Schaid,et al.  Score tests for association between traits and haplotypes when linkage phase is ambiguous. , 2002, American journal of human genetics.

[21]  G. Lathrop,et al.  High-resolution genetic mapping of the ACE-linked QTL influencing circulating ACE activity , 2002, European Journal of Human Genetics.

[22]  R S Judson,et al.  Complex promoter and coding region beta 2-adrenergic receptor haplotypes alter receptor expression and predict in vivo responsiveness. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[23]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[24]  N. Laird,et al.  Estimation and Tests of Haplotype-Environment Interaction when Linkage Phase Is Ambiguous , 2003, Human Heredity.

[25]  Daniel O. Stram,et al.  Modeling and E-M Estimation of Haplotype-Specific Relative Risks from Genotype Data for a Case-Control Study of Unrelated Individuals , 2003, Human Heredity.