Haplotypes with Copy Number and Single Nucleotide Polymorphisms in CYP2A6 Locus Are Associated with Smoking Quantity in a Japanese Population

Smoking is a major public health problem, but the genetic factors associated with smoking behaviors are not fully elucidated. Here, we have conducted an integrated genome-wide association study to identify common copy number polymorphisms (CNPs) and single nucleotide polymorphisms (SNPs) associated with the number of cigarettes smoked per day (CPD) in Japanese smokers ( = 17,158). Our analysis identified a common CNP with a strong effect on CPD (rs8102683; ) in the 19q13 region, encompassing the CYP2A6 locus. After adjustment for the associated CNP, we found an additional associated SNP (rs11878604; ) located 30 kb downstream of the CYP2A6 gene. Imputation of the CYP2A6 locus revealed that haplotypes underlying the CNP and the SNP corresponded to classical, functional alleles of CYP2A6 gene that regulate nicotine metabolism and explained 2% of the phenotypic variance of CPD (ANOVA -test ). These haplotypes were also associated with smoking-related diseases, including lung cancer, chronic obstructive pulmonary disease and arteriosclerosis obliterans.

[1]  Yusuke Nakamura,et al.  PlatinumCNV: A Bayesian Gaussian mixture model for genotyping copy number polymorphisms using SNP array signal intensity data , 2011, Genetic epidemiology.

[2]  Santhosh Girirajan,et al.  Human copy number variation and complex genetic disease. , 2011, Annual review of genetics.

[3]  T. Ninomiya,et al.  Smoking cessation improves mortality in Japanese men: the Hisayama study , 2011, Tobacco Control.

[4]  Yusuke Nakamura,et al.  Identification of Nine Novel Loci Associated with White Blood Cell Subtypes in a Japanese Population , 2011, PLoS genetics.

[5]  Suzanne M. Leal,et al.  A Novel Adaptive Method for the Analysis of Next-Generation Sequencing Data to Detect Complex Trait Associations with Rare Variants Due to Gene Main Effects and Interactions , 2010, PLoS genetics.

[6]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[7]  Yusuke Nakamura,et al.  Establishment of a standardized system to perform population structure analyses with limited sample size or with different sets of SNP genotypes , 2010, Journal of Human Genetics.

[8]  Tariq Ahmad,et al.  Meta-analysis and imputation refines the association of 15q25 with smoking quantity , 2010, Nature Genetics.

[9]  Ming D. Li,et al.  Genome-wide meta-analyses identify multiple loci associated with smoking behavior , 2010, Nature Genetics.

[10]  C. Gieger,et al.  Sequence variants at CHRNB3–CHRNA6 and CYP2A6 affect smoking behavior , 2010, Nature Genetics.

[11]  Jake K. Byrnes,et al.  Genome-wide association study of copy number variation in 16,000 cases of eight common diseases and 3,000 shared controls , 2010 .

[12]  Tomas W. Fitzgerald,et al.  Origins and functional impact of copy number variation in the human genome , 2010, Nature.

[13]  E. Zeggini,et al.  An Evaluation of Statistical Approaches to Rare Variant Analysis in Genetic Association Studies , 2009, Genetic epidemiology.

[14]  Jake K. Byrnes,et al.  Genome-wide association study of copy number variation in 16,000 cases of eight common diseases and 3,000 shared controls , 2010, Nature.

[15]  C. Gieger,et al.  Sequence variants at CHRNB 3 – CHRNA 6 and CYP 2 A 6 affect smoking behavior , 2010 .

[16]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[17]  Yusuke Nakamura,et al.  CYP2D6 genotyping for functional-gene dosage analysis by allele copy number detection. , 2009, Clinical chemistry.

[18]  B. Browning,et al.  A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. , 2009, American journal of human genetics.

[19]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[20]  Yusuke Nakamura,et al.  Japanese population structure, based on SNP genotypes from 7003 individuals compared to other ethnic groups: effects on population-based association studies. , 2008, American journal of human genetics.

[21]  Joshua M. Korn,et al.  Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs , 2008, Nature Genetics.

[22]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.

[23]  Tomas W. Fitzgerald,et al.  A robust statistical method for case-control association testing with copy number variation , 2008, Nature Genetics.

[24]  K. Mossman The Wellcome Trust Case Control Consortium, U.K. , 2008 .

[25]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[26]  D. Clayton,et al.  A Method to Address Differential Bias in Genotyping in Large-Scale Association Studies , 2007, PLoS genetics.

[27]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[28]  Pardis C Sabeti,et al.  Positive Natural Selection in the Human Lineage , 2006, Science.

[29]  Pardis C Sabeti,et al.  Positive Natural Selection in the Human , 2006 .

[30]  Yusuke Nakamura,et al.  [BioBank Japan project]. , 2005, Nihon rinsho. Japanese journal of clinical medicine.

[31]  G. Swan,et al.  Nicotine metabolism: the impact of CYP2A6 on estimates of additive genetic influence , 2005, Pharmacogenetics and genomics.

[32]  Laurent Bodin,et al.  Determination of Cytochrome P450 2D6 (CYP2D6) Gene Copy Number by Real-Time Quantitative PCR , 2005, Journal of biomedicine & biotechnology.

[33]  J. Yokota,et al.  Evaluation of CYP2A6 genetic polymorphisms as determinants of smoking behavior and tobacco-related lung cancer risk in male Japanese smokers. , 2004, Carcinogenesis.

[34]  R. Tyndale,et al.  Ethnic variation in CYP2A6 and association of genetically slow nicotine metabolism and smoking in adult Caucasians. , 2004, Pharmacogenetics.

[35]  Daniel O. Stram,et al.  Modeling and E-M Estimation of Haplotype-Specific Relative Risks from Genotype Data for a Case-Control Study of Unrelated Individuals , 2003, Human Heredity.

[36]  Ming D. Li,et al.  A meta-analysis of estimated genetic and environmental effects on smoking behavior in male and female adult twins. , 2003, Addiction.

[37]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[38]  M. Neale,et al.  The Genetics of Smoking Initiation and Quantity Smoked in Dutch Adolescent and Young Adult Twins , 1999, Behavior genetics.

[39]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[40]  C. Quesenberry,et al.  Concentration bands for uniformity plots , 1980 .

[41]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[42]  E. Dempster,et al.  Heritability of Threshold Characters. , 1950, Genetics.