A Functional Genomics Pipeline to Identify High-Value CpG Dinucleotides Related to Allergy or Asthma in the Human Methylome

DNA methylation of cytosines at CpG dinucleotides is a widespread epigenetic mark that is responsive to environmental exposures and associated with a wide range of human pathologies. However, compared to genetic variation, genome wide epigenetic variation has been relatively unexplored due to the limited representation of variable CpGs on commercial high throughput platforms. To address this gap, we developed a pipeline to identify high value asthma and allergy associated CpGs among the >28 million CpG sites in the human genome. We focused on epithelial cells: sentinels in the airway that respond to inhaled microbes, pollution and allergens and mediate their downstream effects on asthma and allergic disease risk. We combined whole genome bisulfite sequencing in these cells from children with and without allergic asthma and in silico evidence of functionality to identify high value CpGs for a custom DNA methylation array. Compared to commercial arrays, the Asthma&Allergy Custom array was enriched for CpGs with intermediate methylation levels, which are more likely to be correlated with the expression of their nearest or target genes and associated with allergic sensitization in children of diverse ancestries compared to CpGs on the commercial array. Our study revealed signature features of functional CpGs and unveiled a wealth of DNA methylation variation at functional CpG sites that are missed by existing high throughput arrays, indicating that inter individual variation in DNA methylation may contribute more to disease risk than previously appreciated.

[1]  C. Akdis The epithelial barrier hypothesis proposes a comprehensive understanding of the origins of allergic and other chronic non-communicable diseases. , 2021, The Journal of allergy and clinical immunology.

[2]  Matthew C. Altman,et al.  Multi-omics colocalization with genome-wide association studies reveals a context-specific genetic mechanism at a childhood onset asthma risk locus , 2021, Genome medicine.

[3]  C. McEvoy,et al.  Impact of vitamin C supplementation on placental DNA methylation changes related to maternal smoking: association with gene expression and respiratory outcomes , 2021, Clinical epigenetics.

[4]  Lin S. Chen,et al.  Genetic regulation of DNA methylation across tissues reveals thousands of molecular links to complex traits , 2021 .

[5]  C. Ward‐Caviness,et al.  Controlled human exposures to diesel exhaust: a human epigenome-wide experiment of target bronchial epithelial cells , 2021, Environmental epigenetics.

[6]  Scott R. Presnell,et al.  Endotype of Allergic Asthma with Airway Obstruction in Urban Children. , 2021, The Journal of allergy and clinical immunology.

[7]  M. Desai,et al.  Air pollution exposure is linked with methylation of immunoregulatory genes, altered immune cell profiles, and increased blood pressure in children , 2021, Scientific Reports.

[8]  D. Nicolae,et al.  Altered transcriptional and chromatin responses to rhinovirus in bronchial epithelial cells from adults with asthma , 2020, Communications Biology.

[9]  Chris McKennan,et al.  Factor analysis in high dimensional biological data with dependent observations , 2020, 2009.11134.

[10]  R. Krishnan,et al.  Cytokine-induced molecular responses in airway smooth muscle cells inform genome-wide association studies of asthma , 2020, Genome Medicine.

[11]  Michael J. Purcaro,et al.  Expanded encyclopaedias of DNA elements in the human and mouse genomes , 2020, Nature.

[12]  C. Ober,et al.  CDHR3 Asthma-Risk Genotype Affects Susceptibility of Airway Epithelium to Rhinovirus C Infections. , 2019, American journal of respiratory cell and molecular biology.

[13]  Brian D. Bennett,et al.  Comparison of smoking-related DNA methylation between newborns from prenatal exposure and adults from personal smoking , 2019, Epigenomics.

[14]  Anshul Kundaje,et al.  The ENCODE Blacklist: Identification of Problematic Regions of the Genome , 2019, Scientific Reports.

[15]  D. Nicolae,et al.  Shared and distinct genetic risk factors for childhood-onset and adult-onset asthma: genome-wide and transcriptome-wide studies. , 2019, The Lancet. Respiratory medicine.

[16]  I. Ruczinski,et al.  The MALT1 locus and peanut avoidance in the risk for peanut allergy. , 2019, The Journal of allergy and clinical immunology.

[17]  Tao Zhang,et al.  EWAS Atlas: a curated knowledgebase of epigenome-wide association studies , 2018, Nucleic Acids Res..

[18]  Genome-wide association analysis of 350,000 Caucasians from the UK Biobank identifies novel loci for asthma, hay fever and eczema , 2018 .

[19]  P. Donnelly,et al.  The UK Biobank resource with deep phenotyping and genomic data , 2018, Nature.

[20]  S. London,et al.  DNA methylation signature of smoking in lung cancer is enriched for exposure signatures in newborn and adult blood , 2018, Scientific Reports.

[21]  Sara A. Grimm,et al.  DNA methylation and transcriptome aberrations mediated by ERα in mouse seminal vesicles following developmental DES exposure , 2018, Proceedings of the National Academy of Sciences.

[22]  D. Nicolae,et al.  Accounting for unobserved covariates with varying degrees of estimability in high dimensional experimental data , 2018 .

[23]  P. Eline Slagboom,et al.  DNA methylation as a mediator of the association between prenatal adversity and risk factors for metabolic disease in adulthood , 2018, Science Advances.

[24]  Alexander E. Kel,et al.  cutPrimers: A New Tool for Accurate Cutting of Primers from Reads of Targeted Next Generation Sequencing , 2017, J. Comput. Biol..

[25]  Francesco Salvatore,et al.  Epigenetic features of FoxP3 in children with cow’s milk allergy , 2016, Clinical Epigenetics.

[26]  J. Gern,et al.  The contributions of allergic sensitization and respiratory pathogens to asthma inception , 2016, Journal of Allergy and Clinical Immunology.

[27]  G. Smyth,et al.  ROBUST HYPERPARAMETER ESTIMATION PROTECTS AGAINST HYPERVARIABLE GENES AND IMPROVES POWER TO DETECT DIFFERENTIAL EXPRESSION. , 2016, The annals of applied statistics.

[28]  D. Nicolae,et al.  Genome-Wide Methylation Study Identifies an IL-13-induced Epigenetic Signature in Asthmatic Airways. , 2016, American journal of respiratory and critical care medicine.

[29]  Warren A. Cheung,et al.  Population whole-genome bisulfite sequencing across two tissues highlights the environment as the principal source of human methylome variation , 2015, Genome Biology.

[30]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[31]  Timothy A Thornton,et al.  Robust Inference of Population Structure for Ancestry Prediction and Correction of Stratification in the Presence of Relatedness , 2015, Genetic epidemiology.

[32]  Donald H. Arnold,et al.  Objectives, design and enrollment results from the Infant Susceptibility to Pulmonary Infections and Asthma Following RSV Exposure Study (INSPIRE) , 2015, BMC Pulmonary Medicine.

[33]  Differences in DNA methylation profile of Th1 and Th2 cytokine genes are associated with tolerance acquisition in children with IgE-mediated cow’s milk allergy , 2015, Clinical Epigenetics.

[34]  Miao Yu,et al.  Bacterial infection remodels the DNA methylation landscape of human dendritic cells , 2015, bioRxiv.

[35]  Ting Wang,et al.  Intermediate DNA methylation is a conserved signature of genome regulation , 2015, Nature Communications.

[36]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[37]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[38]  Rafael A. Irizarry,et al.  Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays , 2014, Bioinform..

[39]  L. Hummelshoj,et al.  Allergic sensitization: host-immune factors , 2014, Clinical and Translational Allergy.

[40]  B. Langmead,et al.  BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions , 2012, Genome Biology.

[41]  Weiliang Qiu,et al.  Cigarette smoking behaviors and time since quitting are associated with differential DNA methylation across the human genome. , 2012, Human molecular genetics.

[42]  A. Oshlack,et al.  SWAN: Subset-quantile Within Array Normalization for Illumina Infinium HumanMethylation450 BeadChips , 2012, Genome Biology.

[43]  S. Holgate,et al.  The sentinel role of the airway epithelium in asthma pathogenesis , 2011, Immunological reviews.

[44]  Felix Krueger,et al.  Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications , 2011, Bioinform..

[45]  David M. Simcha,et al.  Tackling the widespread and critical impact of batch effects in high-throughput data , 2010, Nature Reviews Genetics.

[46]  Rosalind J Wright,et al.  The Urban Environment and Childhood Asthma (URECA) birth cohort study: design, methods, and study population , 2009, BMC pulmonary medicine.

[47]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[48]  R. Lemanske The Childhood Origins of Asthma (COAST) study , 2002, Pediatric allergy and immunology : official publication of the European Society of Pediatric Allergy and Immunology.

[49]  B. Niggemann,et al.  The pattern of atopic sensitization is associated with the development of asthma in childhood. , 2001, The Journal of allergy and clinical immunology.