Identification and annotation of conserved promoters and macrophage-expressed genes in the pig genome

Background: The FANTOM5 consortium used Cap Analysis of Gene Expression (CAGE) tag sequencing to produce a comprehensive atlas of promoters and enhancers within the human and mouse genomes. We reasoned that the mapping of these regulatory elements to the pig genome could provide useful annotation and evidence to support assignment of orthology. Results: For human transcription start sites (TSS) associated with annotated human-mouse orthologs, 17% mapped to the pig genome but not to the mouse, 10% mapped only to the mouse, and 55% mapped to both pig and mouse. Around 17% did not map to either species. The mapping percentages were lower where there was not clear orthology relationship, but in every case, mapping to pig was greater than to mouse, and the degree of homology was also greater. Combined mapping of mouse and human CAGE-defined promoters identified at least one putative conserved TSS for >16,000 protein-coding genes. About 54% of the predicted locations of regulatory elements in the pig genome were supported by CAGE and/or RNA-Seq analysis from pig macrophages. Conclusions: Comparative mapping of promoters and enhancers from humans and mice can provide useful preliminary annotation of other animal genomes. The data also confirm extensive gain and loss of regulatory elements between species, and the likelihood that pigs provide a better model than mice for human gene regulation and function.

[1]  Qianjun Zhao,et al.  Copy number variation detection using SNP genotyping arrays in three Chinese pig breeds. , 2015, Animal genetics.

[2]  Hans H. Cheng,et al.  Coordinated international action to accelerate genome-to-phenome with FAANG, the Functional Annotation of Animal Genomes project , 2015, Genome Biology.

[3]  Y. Zhang,et al.  Genome‐wide analysis reveals artificial selection on coat colour and reproductive traits in Chinese domestic pigs , 2015, Molecular ecology resources.

[4]  Shane J. Neph,et al.  A comparative encyclopedia of DNA elements in the mouse genome , 2014, Nature.

[5]  Keizo Takao,et al.  Genomic responses in mouse models greatly mimic human inflammatory diseases , 2014, Proceedings of the National Academy of Sciences.

[6]  D. Hume,et al.  Design and development of exome capture sequencing for the domestic pig (Sus scrofa) , 2014, BMC Genomics.

[7]  J. Estellé,et al.  Differences in Muscle Transcriptome among Pigs Phenotypically Extreme for Fatty Acid Composition , 2014, PloS one.

[8]  W. Wasserman,et al.  On the identification of potential regulatory variants within genome wide association candidate SNP sets , 2014, BMC Medical Genomics.

[9]  Jan Gorodkin,et al.  Structured RNAs and synteny regions in the pig genome , 2014, BMC Genomics.

[10]  T. Meehan,et al.  An atlas of active enhancers across human cell types and tissues , 2014, Nature.

[11]  Cesare Furlanello,et al.  A promoter-level mammalian expression atlas , 2015 .

[12]  G. K. Sandve,et al.  Chromatin states reveal functional associations for globally defined transcription start sites in four human cell lines , 2014, BMC Genomics.

[13]  Y. Hayashizaki,et al.  Interactive visualization and analysis of large-scale sequencing datasets using ZENBU , 2014 .

[14]  Yunhan Hong,et al.  Alternative transcription generates multiple Mitf isoforms with different expression patterns and activities in medaka , 2014, Pigment cell & melanoma research.

[15]  Tom C Freeman,et al.  An expression atlas of human primary cells: inference of gene function from coexpression networks , 2013, BMC Genomics.

[16]  D. Hume,et al.  The impact of breed and tissue compartment on the response of pig macrophages to lipopolysaccharide , 2013, BMC Genomics.

[17]  M. Groenen,et al.  Evolutionary dynamics of copy number variation in pig genomes in the context of adaptation and domestication , 2013, BMC Genomics.

[18]  J. Harrow,et al.  Structural and functional annotation of the porcine immunome , 2013, BMC Genomics.

[19]  R. Gamelli,et al.  Genomic responses in mouse models poorly mimic human inflammatory diseases , 2013, Proceedings of the National Academy of Sciences.

[20]  A. Su,et al.  A gene expression atlas of the domestic pig , 2012, BMC Biology.

[21]  James M. Reecy,et al.  Prediction of Altered 3′- UTR miRNA-Binding Sites from RNA-Seq Data: The Swine Leukocyte Antigen Complex (SLA) as a Model Region , 2012, PloS one.

[22]  Bronwen L. Aken,et al.  Analyses of pig genomes provide insight into porcine demography and evolution , 2012, Nature.

[23]  Richard Baldock,et al.  eMouseAtlas, EMAGE, and the spatial dimension of the transcriptome , 2012, Mammalian Genome.

[24]  Raymond K. Auerbach,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[25]  David A. Hume,et al.  Pig Bone Marrow-Derived Macrophages Resemble Human Macrophages in Their Response to Bacterial Lipopolysaccharide , 2012, The Journal of Immunology.

[26]  N. Goldman,et al.  Conservation and divergence in Toll-like receptor 4-regulated gene expression in primary human versus mouse macrophages , 2012, Proceedings of the National Academy of Sciences.

[27]  Tim Massingham,et al.  All Your Base: a fast and accurate probabilistic approach to base calling , 2012, Genome Biology.

[28]  Hans Nauwynck,et al.  The pig: a model for human infectious diseases , 2011, Trends in Microbiology.

[29]  D. Hume,et al.  The mononuclear phagocyte system of the pig as a model for understanding human innate immunity and disease , 2011, Journal of leukocyte biology.

[30]  J. Rogers,et al.  Pig genome sequence - analysis and publication strategy , 2010, BMC Genomics.

[31]  John Wei,et al.  Towards a comprehensive structural variation map of an individual human genome , 2010, Genome Biology.

[32]  Israel Steinfeld,et al.  BMC Bioinformatics BioMed Central , 2008 .

[33]  Fengtang Yang,et al.  Copy number variation and evolution in humans and chimpanzees. , 2008, Genome research.

[34]  Juliane C. Dohm,et al.  Substantial biases in ultra-short read data sets from high-throughput DNA sequencing , 2008, Nucleic acids research.

[35]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[36]  Michael C. Ostrowski,et al.  The Ewing Sarcoma Protein (EWS) Binds Directly to the Proximal Elements of the Macrophage-Specific Promoter of the CSF-1 Receptor (csf1r) Gene1 , 2008, The Journal of Immunology.

[37]  C. Feschotte Transposable elements and the evolution of regulatory networks , 2008, Nature Reviews Genetics.

[38]  Geoffrey J Faulkner,et al.  A rescue strategy for multimapping short sequence tags refines surveys of transcriptional activity by CAGE. , 2008, Genomics.

[39]  D. Conrad,et al.  Global variation in copy number in the human genome , 2006, Nature.

[40]  Piero Carninci,et al.  The complexity of the mammalian transcriptome , 2006, The Journal of physiology.

[41]  Martin S. Taylor,et al.  Genome-wide analysis of mammalian promoter architecture and evolution , 2006, Nature Genetics.

[42]  C. Kai,et al.  CAGE: cap analysis of gene expression , 2006, Nature Methods.

[43]  S. Salzberg,et al.  The Transcriptional Landscape of the Mammalian Genome , 2005, Science.

[44]  Friedrich Möller,et al.  Genome comparison without alignment using shortest unique substrings , 2005, BMC Bioinformatics.

[45]  Wei Li,et al.  Pigs in sequence space: A 0.66X coverage pig genome survey based on shotgun sequencing , 2005, BMC Genomics.

[46]  John Quackenbush,et al.  Continued discovery of transcriptional units expressed in cells of the mouse mononuclear phagocyte lineage. , 2003, Genome research.

[47]  Michael C. Ostrowski,et al.  A macrophage colony-stimulating factor receptor-green fluorescent protein transgene is expressed throughout the mononuclear phagocyte system of the mouse. , 2003, Blood.

[48]  Michael C. Ostrowski,et al.  Interaction between PU.1 and Another Ets Family Transcription Factor Promotes Macrophage-specific Basal Transcription Initiation* , 1998, The Journal of Biological Chemistry.

[49]  R. Knox Impact of swine reproductive technologies on pig and global food production. , 2014, Advances in experimental medicine and biology.

[50]  I. K. Jordan,et al.  Transposable element derived DNaseI-hypersensitive sites in the human genome , 2006 .