Learning functional conservation between pig and human to decipher evolutionary mechanisms underlying gene expression and complex trait

The assessment of genomic conservation between human and pig at the functional level can help understand and improve the potential of pig as a human biomedical model. To address this, we developed a Deep learning-based approach to learn the Genomic Conservation at the Functional level (DeepGCF) between species by integrating 386 and 374 epigenome and transcriptome profiles from human and pig, respectively. DeepGCF demonstrated a better prediction performance compared to the previous functional conservation prediction method. In addition, we showed that the resulting DeepGCF score captures the functional conservation by examining DeepGCF on chromatin states, sequence ontologies, and regulatory variants. Regions with higher DeepGCF score play a more important role in regulatory activities and show heritability enrichment in human complex traits and diseases. Our DeepGCF approach shows a promising application on the comparison of cross-species functional conservation, and the model framework can be easily adapted to other species. By expanding the model to integrate the functional profiles of multiple species, including human, mouse, pig, cattle, and other livestock animals in the future, the functional conservation information will provide additional insight into the genetic and evolutionary mechanisms behind complex traits and diseases.

[1]  Xiangdong Ding,et al.  A compendium of genetic regulatory effects across pig tissues , 2022, bioRxiv.

[2]  P. Visscher,et al.  Leveraging functional genomic annotations and genome coverage to improve polygenic prediction of complex traits within and between ancestries , 2022, European Neuropsychopharmacology.

[3]  J. Prendergast,et al.  The conservation of human functional variants and their effects across livestock species , 2022, Communications Biology.

[4]  Ying Yu,et al.  Comparative transcriptome in large-scale human and cattle populations , 2022, Genome Biology.

[5]  P. VanRaden,et al.  A multi-tissue atlas of regulatory variants in cattle , 2022, Nature Genetics.

[6]  J. Boeke,et al.  Results of Two Cases of Pig-to-Human Kidney Xenotransplantation. , 2022, The New England journal of medicine.

[7]  J. Tena,et al.  Multidimensional chromatin profiling of zebrafish pancreas to uncover and investigate disease-relevant enhancers , 2022, Nature Communications.

[8]  Michael F. Green,et al.  Mapping genomic loci implicates genes and synaptic biology in schizophrenia , 2022, Nature.

[9]  C. Lindgren,et al.  Modelling the genetic aetiology of complex disease: human–mouse conservation of noncoding features and disease-associated loci , 2022, Biology Letters.

[10]  Kathleen M. Chen,et al.  A sequence-based global map of regulatory activity for deciphering human genetics , 2021, Nature Genetics.

[11]  J. Lunney,et al.  Importance of the pig as a human biomedical model , 2021, Science Translational Medicine.

[12]  M. Lund,et al.  Pig genome functional annotation enhances the biological interpretation of complex traits and human disease , 2021, Nature Communications.

[13]  Shuhong Zhao,et al.  A compendium and comparative epigenomics analysis of cis-regulatory elements in the pig genome , 2021, Nature Communications.

[14]  Soo Bin Kwon,et al.  Learning a genome-wide score of human–mouse conservation at the functional genomics level , 2020, Nature Communications.

[15]  Voichita D. Marinescu,et al.  A comparative genomics multitool for scientific discovery and conservation , 2020, Nature.

[16]  P. Visscher,et al.  Using prior information from humans to prioritize genes and gene-associated variants for complex traits in livestock , 2020, bioRxiv.

[17]  Huanming Yang,et al.  An atlas of the protein-coding genes in the human, pig, and mouse brain , 2020, Science.

[18]  Siska Croubels,et al.  Comparative toxicokinetics of Fusarium mycotoxins in pigs and humans. , 2020, Food and chemical toxicology : an international journal published for the British Industrial Biological Research Association.

[19]  Matti Pirinen,et al.  Functionally-informed fine-mapping and polygenic localization of complex trait heritability , 2019, Nature Genetics.

[20]  Paul Flicek,et al.  Variant calling on the GRCh38 assembly with the data from phase three of the 1000 Genomes Project , 2019, Wellcome open research.

[21]  F. Hormozdiari,et al.  Disease heritability enrichment of regulatory elements is concentrated in elements with ancient sequence age and conserved function across species , 2018, bioRxiv.

[22]  Xiao Xiang Zhu,et al.  Identifying Corresponding Patches in SAR and Optical Images With a Pseudo-Siamese CNN , 2018, IEEE Geoscience and Remote Sensing Letters.

[23]  M. Stephens,et al.  Flexible statistical methods for estimating and testing effects in genomic studies with multiple conditions , 2016, bioRxiv.

[24]  Stephane E. Castel,et al.  Quantifying the regulatory effect size of cis-acting genetic variation using allelic fold change , 2016, bioRxiv.

[25]  Shawn M. Gillespie,et al.  Insulator dysfunction and oncogene activation in IDH mutant gliomas , 2015, Nature.

[26]  Yakir A Reshef,et al.  Partitioning heritability by functional annotation using genome-wide association summary statistics , 2015, Nature Genetics.

[27]  O. Troyanskaya,et al.  Predicting effects of noncoding variants with deep learning–based sequence model , 2015, Nature Methods.

[28]  Michael Q. Zhang,et al.  CRISPR Inversion of CTCF Sites Alters Genome Topology and Enhancer/Promoter Function , 2015, Cell.

[29]  Hans H. Cheng,et al.  Coordinated international action to accelerate genome-to-phenome with FAANG, the Functional Annotation of Animal Genomes project , 2015, Genome Biology.

[30]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[31]  J. T. Erichsen,et al.  Enhancer Evolution across 20 Mammalian Species , 2015, Cell.

[32]  K. Lindblad-Toh,et al.  Comparative genomics as a tool to understand evolution and disease , 2013, Genome research.

[33]  E. Wolf,et al.  Permanent Neonatal Diabetes in INSC94Y Transgenic Pigs , 2013, Diabetes.

[34]  Dan Xie,et al.  Comparative Epigenomic Annotation of Regulatory DNA , 2012, Cell.

[35]  Lucy J. Colwell,et al.  A core subunit of Polycomb repressive complex 1 is broadly conserved in function but not primary sequence , 2012, Proceedings of the National Academy of Sciences.

[36]  G. Bourque,et al.  Transposable elements have rewired the core regulatory network of human embryonic stem cells , 2010, Nature Genetics.

[37]  A. Visel,et al.  Limits of sequence and functional conservation , 2010, Nature Genetics.

[38]  Cory Y. McLean,et al.  GREAT improves functional interpretation of cis-regulatory regions , 2010, Nature Biotechnology.

[39]  K. Pollard,et al.  Detection of nonneutral substitution rates on mammalian phylogenies. , 2010, Genome research.

[40]  Lars Bolund,et al.  Hemizygous minipigs produced by random gene insertion and handmade cloning express the Alzheimer’s disease-causing dominant mutation APPsw , 2009, Transgenic Research.

[41]  S. Batzoglou,et al.  Distribution and intensity of constraint in mammalian genomic sequence. , 2005, Genome research.

[42]  D. Haussler,et al.  Human-mouse alignments with BLASTZ. , 2003, Genome research.