Genome-wide association study of chronic sputum production implicates loci involved in mucus production and infection

Background Chronic sputum production impacts on quality of life and is a feature of many respiratory diseases. Identification of the genetic variants associated with chronic sputum production in a disease agnostic sample could improve understanding of its causes and identify new molecular targets for treatment. Methods We conducted a genome-wide association study (GWAS) of chronic sputum production in UK Biobank. Signals meeting genome-wide significance (p<5×10−8) were investigated in additional independent studies, were fine-mapped and putative causal genes identified by gene expression analysis. GWASs of respiratory traits were interrogated to identify whether the signals were driven by existing respiratory disease among the cases and variants were further investigated for wider pleiotropic effects using phenome-wide association studies (PheWASs). Results From a GWAS of 9714 cases and 48 471 controls, we identified six novel genome-wide significant signals for chronic sputum production including signals in the human leukocyte antigen (HLA) locus, chromosome 11 mucin locus (containing MUC2, MUC5AC and MUC5B) and FUT2 locus. The four common variant associations were supported by independent studies with a combined sample size of up to 2203 cases and 17 627 controls. The mucin locus signal had previously been reported for association with moderate-to-severe asthma. The HLA signal was fine-mapped to an amino acid change of threonine to arginine (frequency 36.8%) in HLA-DRB1 (HLA-DRB1*03:147). The signal near FUT2 was associated with expression of several genes including FUT2, for which the direction of effect was tissue dependent. Our PheWAS identified a wide range of associations including blood cell traits, liver biomarkers, infections, gastrointestinal and thyroid-associated diseases, and respiratory disease. Conclusions Novel signals at the FUT2 and mucin loci suggest that mucin fucosylation may be a driver of chronic sputum production even in the absence of diagnosed respiratory disease and provide genetic support for this pathway as a target for therapeutic intervention. Genome-wide association study in UK Biobank identifies six novel loci associated with chronic sputum production at genome-wide significance in a disease agnostic population. These include a FUT2 locus, highlighting a possible target for drug development. https://bit.ly/3IRVJeT

[1]  M. Tobin,et al.  Deep-PheWAS: a pipeline for phenotype generation and association analysis for phenome-wide association studies. , 2022, medRxiv.

[2]  Camille M. Moore,et al.  Nasal airway transcriptome-wide association study of asthma reveals genetically driven mucus pathobiology , 2022, Nature Communications.

[3]  Eurie L. Hong,et al.  Whole-genome sequencing reveals host factors underlying critical COVID-19 , 2022, Nature.

[4]  Emma L Adams,et al.  Extended Cohort for E-health, Environment and DNA (EXCEED) COVID-19 focus , 2021, Wellcome Open Research.

[5]  Laurent F. Thomas,et al.  Genome-wide association study of susceptibility to hospitalised respiratory infections , 2021, Wellcome open research.

[6]  M. Tobin,et al.  Improving ethnic diversity in respiratory genomics research , 2021, European Respiratory Journal.

[7]  Mattia G. Bergomi,et al.  Mapping the human genetic architecture of COVID-19 , 2021, Nature.

[8]  C. Sudlow,et al.  CovidLife: a resource to understand mental health, well-being and behaviour during the COVID-19 pandemic in the UK , 2021, Wellcome open research.

[9]  E. Hoffman,et al.  Airway mucin MUC5AC and MUC5B concentrations and the initiation and progression of chronic obstructive pulmonary disease: an analysis of the SPIROMICS cohort. , 2021, The Lancet. Respiratory medicine.

[10]  J. Baines,et al.  The role of the blood group-related glycosyltransferases FUT2 and B4GALNT2 in susceptibility to infectious disease. , 2021, International journal of medical microbiology : IJMM.

[11]  Trevor Hastie,et al.  Genetics of 35 blood and urine biomarkers in the UK Biobank , 2020, Nature Genetics.

[12]  Gautier Koscielny,et al.  Open Targets Genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics , 2020, Nucleic Acids Res..

[13]  G. Koppelman,et al.  The genetics of asthma and the promise of genomics-guided drug target discovery. , 2020, The Lancet. Respiratory medicine.

[14]  Trends and risk factors of mortality and disability adjusted life years for chronic respiratory diseases from 1990 to 2017: systematic analysis for the Global Burden of Disease Study 2017 , 2020, BMJ.

[15]  D. Nicolae,et al.  Shared and distinct genetic risk factors for childhood-onset and adult-onset asthma: genome-wide and transcriptome-wide studies. , 2019, The Lancet. Respiratory medicine.

[16]  Vilmundur Gudnason,et al.  Genome-Wide Association Study of Susceptibility to Idiopathic Pulmonary Fibrosis , 2019, bioRxiv.

[17]  P. Visscher,et al.  Genome-wide association study of medication-use and associated disease in the UK Biobank , 2019, Nature Communications.

[18]  L. Wain,et al.  Moderate-to-severe asthma in individuals of European ancestry: a genome-wide association study , 2019, The Lancet. Respiratory medicine.

[19]  G. Guidicelli,et al.  Characterization of the novel HLA‐DRB1*03:147 allele by sequencing‐based typing , 2018, HLA.

[20]  Benjamin B. Sun,et al.  New genetic signals for lung function highlight pathways and chronic obstructive pulmonary disease associations across multiple ancestries. , 2018, Nature Genetics.

[21]  Dajiang J. Liu,et al.  Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use , 2018, Nature Genetics.

[22]  D. Gudbjartsson,et al.  Genome-wide association meta-analysis yields 20 loci associated with gallstone disease , 2018, Nature Communications.

[23]  Ivana V. Yang,et al.  FUT2 Variants Confer Susceptibility to Familial Otitis Media. , 2018, American journal of human genetics.

[24]  C. Cooper,et al.  FUT2 Genetic Variants and Reported Respiratory and Gastrointestinal Illnesses During Infancy , 2018, The Journal of infectious diseases.

[25]  Tanya M. Teslovich,et al.  Genetics of Blood Lipids Among ~300,000 Multi-Ethnic Participants of the Million Veteran Program , 2018, Nature Genetics.

[26]  Christopher E Brightling,et al.  Cohort Profile Cohort Profile : Extended Cohort for E-health , Environment and DNA ( EXCEED ) , 2019 .

[27]  Kewu Huang,et al.  Management of airway mucus hypersecretion in chronic airway inflammatory disease: Chinese expert consensus (English edition) , 2018, International journal of chronic obstructive pulmonary disease.

[28]  N. Risch,et al.  A large electronic health record-based genome-wide study of serum lipids , 2018, Nature Genetics.

[29]  Manuel A. R. Ferreira,et al.  Multiancestry association study identifies new asthma risk loci that colocalize with immune cell enhancer marks , 2017, Nature Genetics.

[30]  Lars G Fritsche,et al.  Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies , 2017, Nature Genetics.

[31]  Nicola J. Rinaldi,et al.  Genetic effects on gene expression across human tissues , 2017, Nature.

[32]  N. Allen,et al.  Occupational self-coding and automatic recording (OSCAR): a novel web-based tool to collect and code lifetime job histories in large population-based studies. , 2017, Scandinavian journal of work, environment & health.

[33]  G. Chandak,et al.  GWAS identifies population-specific new regulatory variants in FUT6 associated with plasma B12 concentrations in Indians , 2017, Human molecular genetics.

[34]  M. R. Siddiqui,et al.  Global Initiative for Chronic Obstructive Lung Disease (GOLD) , 2017 .

[35]  Christian Gieger,et al.  Genome-wide association analyses for lung function and chronic obstructive pulmonary disease identify new loci and potential druggable targets , 2017, Nature Genetics.

[36]  J. Vonk,et al.  No convincing association between genetic markers and respiratory symptoms: results of a GWA study , 2017, Respiratory Research.

[37]  Matthew T. Maurano,et al.  Genetic Drivers of Epigenetic and Transcriptional Variation in Human Immune Cells , 2016, Cell.

[38]  N. Eriksson,et al.  Genome-wide association and HLA region fine-mapping studies identify susceptibility loci for multiple common infections , 2016, Nature Communications.

[39]  S. Wesselingh,et al.  FUT2 genotype influences lung function, exacerbation frequency and airway microbiota in non-CF bronchiectasis , 2016, Thorax.

[40]  David C. Wilson,et al.  Genome-wide association study implicates immune activation of multiple integrin genes in inflammatory bowel disease , 2016, Nature Genetics.

[41]  C. Reis,et al.  Muc5ac gastric mucin glycosylation is shaped by FUT2 activity and functionally impacts Helicobacter pylori binding , 2016, Scientific Reports.

[42]  F. Cunningham,et al.  The Ensembl Variant Effect Predictor , 2016, bioRxiv.

[43]  Y. Bossé,et al.  Genome-wide interaction study of gene-by-occupational exposure and effects on FEV1 levels. , 2015, The Journal of allergy and clinical immunology.

[44]  P. Szilagyi,et al.  Epidemiologic Association Between FUT2 Secretor Status and Severe Rotavirus Gastroenteritis in Children in the United States. , 2015, JAMA pediatrics.

[45]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[46]  C. Wijmenga,et al.  Cohort Profile Cohort Profile : LifeLines , a three-generation cohort study and biobank , 2015 .

[47]  Judy H. Cho,et al.  Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations , 2015, Nature Genetics.

[48]  Manolis Kellis,et al.  Fine mapping of type 1 diabetes susceptibility loci and evidence for colocalization of causal variants with lymphoid gene enhancers , 2015, Nature Genetics.

[49]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[50]  Edwin K Silverman,et al.  Genetic susceptibility for chronic bronchitis in chronic obstructive pulmonary disease , 2014, Respiratory Research.

[51]  Y. Bossé,et al.  Dissecting the genetics of chronic mucus hypersecretion in smokers with and without COPD , 2014, European Respiratory Journal.

[52]  H. Völzke,et al.  Fucosyltransferase 2 (FUT2) non-secretor status and blood group B are associated with elevated serum lipase activity in asymptomatic subjects, and an increased risk for chronic pancreatitis: a genetic association study , 2014, Gut.

[53]  D. Postma,et al.  Association of occupational pesticide exposure with accelerated longitudinal decline in lung function. , 2014, American journal of epidemiology.

[54]  J. Le Pendu,et al.  A FUT2 gene common polymorphism determines resistance to rotavirus A of the P[8] genotype. , 2014, Journal of Infectious Diseases.

[55]  S. Tims,et al.  Faecal Microbiota Composition in Adults Is Associated with the FUT2 Gene Determining the Secretor Status , 2014, PloS one.

[56]  D. Postma,et al.  Risk factors for chronic mucus hypersecretion in individuals with and without COPD: influence of smoking and job exposure on CMH , 2014, Occupational and Environmental Medicine.

[57]  J. Le Pendu,et al.  Noroviruses and histo‐blood groups: the impact of common host genetic polymorphisms on virus transmission and evolution , 2013, Reviews in medical virology.

[58]  Tanya M. Teslovich,et al.  Discovery and refinement of loci associated with lipid levels , 2013, Nature Genetics.

[59]  Buhm Han,et al.  Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens , 2013, PloS one.

[60]  Archie Campbell,et al.  Cohort Profile: Generation Scotland: Scottish Family Health Study (GS:SFHS). The study, its participants and their potential for genetic research on health and illness. , 2013, International journal of epidemiology.

[61]  C. Wallace,et al.  Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics , 2013, PLoS genetics.

[62]  P. Burke,et al.  Development of orally active inhibitors of protein and cellular fucosylation , 2013, Proceedings of the National Academy of Sciences.

[63]  V. Kim,et al.  Chronic bronchitis and chronic obstructive pulmonary disease. , 2013, American journal of respiratory and critical care medicine.

[64]  Tom R. Gaunt,et al.  Predicting the Functional, Molecular, and Phenotypic Consequences of Amino Acid Substitutions using Hidden Markov Models , 2012, Human mutation.

[65]  Jing Hu,et al.  SIFT web server: predicting effects of amino acid substitutions on proteins , 2012, Nucleic Acids Res..

[66]  Philip Rosenstiel,et al.  Colonic mucosa-associated microbiota is influenced by an interaction of Crohn disease and FUT2 (Secretor) genotype , 2011, Proceedings of the National Academy of Sciences.

[67]  Cleo C. van Diemen,et al.  Susceptibility to Chronic Mucus Hypersecretion, a Genome Wide Association Study , 2011, PloS one.

[68]  J. Hokanson,et al.  The chronic bronchitic phenotype of COPD: an analysis of the COPDGene Study. , 2011, Chest.

[69]  J. Nikkilä,et al.  Secretor Genotype (FUT2 gene) Is Strongly Associated with the Composition of Bifidobacteria in the Human Intestine , 2011, PloS one.

[70]  Ivana V. Yang,et al.  A common MUC5B promoter polymorphism and pulmonary fibrosis. , 2011, The New England journal of medicine.

[71]  C. McCulloch,et al.  The H antigen at epithelial surfaces is associated with susceptibility to asthma exacerbation. , 2011, American journal of respiratory and critical care medicine.

[72]  P. Visscher,et al.  GCTA: a tool for genome-wide complex trait analysis. , 2011, American journal of human genetics.

[73]  Tariq Ahmad,et al.  Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci , 2010, Nature Genetics.

[74]  Tanya M. Teslovich,et al.  LocusZoom: regional visualization of genome-wide association scan results , 2010, Bioinform..

[75]  Tanya M. Teslovich,et al.  Biological, Clinical, and Population Relevance of 95 Loci for Blood Lipids , 2010, Nature.

[76]  S. Chanock,et al.  Genome-wide significant predictors of metabolites in the one-carbon metabolism pathway. , 2009, Human molecular genetics.

[77]  A. Varki,et al.  ABO blood group glycans modulate sialic acid recognition on erythrocytes. , 2009, Blood.

[78]  Reem Abu Mallouh,et al.  The G428A Nonsense Mutation in FUT2 Provides Strong but Not Absolute Protection against Symptomatic GII.4 Norovirus Infection , 2009, PloS one.

[79]  Toshiko Tanaka,et al.  Genome-wide association study of vitamin B6, vitamin B12, folate, and homocysteine blood concentrations. , 2009, American journal of human genetics.

[80]  S. Chanock,et al.  Common variants of FUT2 are associated with plasma vitamin B12 levels , 2008, Nature Genetics.

[81]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[82]  B. Rubin Mucolytics, expectorants, and mucokinetic medications. , 2007, Respiratory care.

[83]  Cleo C. van Diemen,et al.  A Disintegrin and Metalloprotease 33 polymorphisms and lung function decline in the general population , 2006, European Respiratory Review.

[84]  Gustaf E. Rydell,et al.  Antibody prevalence and titer to norovirus (genogroup II) correlate with secretor (FUT2) but not with ABO phenotype or Lewis (FUT3) genotype. , 2006, The Journal of infectious diseases.

[85]  Harry Campbell,et al.  Generation Scotland: the Scottish Family Health Study; a new resource for researching genes and heritability , 2006, BMC Medical Genetics.

[86]  A. Nissinen,et al.  Thirty-year cumulative incidence of chronic bronchitis and COPD in relation to 30-year pulmonary function and 40-year mortality: a follow-up in middle-aged rural men. , 2006, Chest.

[87]  S. Domino,et al.  Gastrointestinal mucins of Fut2-null mice lack terminal fucosylation without affecting colonization by Candida albicans. , 2005, Glycobiology.

[88]  D. Johns,et al.  Biological dust exposure in the workplace is a risk factor for chronic obstructive pulmonary disease , 2005, Thorax.

[89]  L. Trupin,et al.  The occupational burden of chronic obstructive pulmonary disease , 2003, European Respiratory Journal.

[90]  Xi Jiang,et al.  Human susceptibility and resistance to Norwalk virus infection , 2003, Nature Medicine.

[91]  J. Chiorini,et al.  Secreted and Transmembrane Mucins Inhibit Gene Transfer with AAV4 More Efficiently than AAV5* , 2002, The Journal of Biological Chemistry.

[92]  Y. Kodera,et al.  Polymorphisms of two fucosyltransferase genes (Lewis and Secretor genes) involving type I Lewis antigens are associated with the presence of anti-Helicobacter pylori IgG antibody. , 2001, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[93]  G. Lennon,et al.  Sequence and expression of a candidate for the human Secretor blood group alpha(1,2)fucosyltransferase gene (FUT2). Homozygosity for an enzyme-inactivating nonsense mutation commonly correlates with the non-secretor phenotype. , 1995, The Journal of biological chemistry.

[94]  S. Normark,et al.  Attachment of Helicobacter pylori to human gastric epithelium mediated by blood group antigens. , 1993, Science.

[95]  C. Blackwell,et al.  NON-SECRETION OF ABO ANTIGENS PREDISPOSING TO INFECTION BY NEISSERIA MENINGITIDIS AND STREPTOCOCCUS PNEUMONIAE , 1986, The Lancet.

[96]  I. Deary,et al.  Genome-Wide Association Study Meta-Analysis of the Alcohol Use Disorders Identification Test (AUDIT) in Two Population-Based Cohorts. , 2019, The American journal of psychiatry.

[97]  Judy H. Cho Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease , 2016 .

[98]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[99]  D. Jarvis,et al.  The European Community Respiratory Health Survey. , 1994, The European respiratory journal.