The non-human primate reference transcriptome resource (NHPRTR) for comparative functional genomics

RNA-based next-generation sequencing (RNA-Seq) provides a tremendous amount of new information regarding gene and transcript structure, expression and regulation. This is particularly true for non-coding RNAs where whole transcriptome analyses have revealed that the much of the genome is transcribed and that many non-coding transcripts have widespread functionality. However, uniform resources for raw, cleaned and processed RNA-Seq data are sparse for most organisms and this is especially true for non-human primates (NHPs). Here, we describe a large-scale RNA-Seq data and analysis infrastructure, the NHP reference transcriptome resource (http://nhprtr.org); it presently hosts data from12 species of primates, to be expanded to 15 species/subspecies spanning great apes, old world monkeys, new world monkeys and prosimians. Data are collected for each species using pools of RNA from comparable tissues. We provide data access in advance of its deposition at NCBI, as well as browsable tracks of alignments against the human genome using the UCSC genome browser. This resource will continue to host additional RNA-Seq data, alignments and assemblies as they are generated over the coming years and provide a key resource for the annotation of NHP genomes as well as informing primate studies on evolution, reproduction, infection, immunity and pharmacology.

[1]  Jeroen Demmers,et al.  RNF12 initiates X-chromosome inactivation by targeting REX1 for degradation , 2012, Nature.

[2]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[3]  Páll Melsted,et al.  Comparative RNA sequencing reveals substantial genetic variation in endangered primates. , 2012, Genome research.

[4]  J. Thierry-Mieg,et al.  AceView: a comprehensive cDNA-supported gene and transcripts annotation , 2006, Genome Biology.

[5]  Günter Kahl,et al.  Tag-based next generation sequencing / , 2012 .

[6]  Páll Melsted,et al.  A Genome Sequence Resource for the Aye-Aye (Daubentonia madagascariensis), a Nocturnal Lemur from Madagascar , 2011, Genome biology and evolution.

[7]  Sergey Koren,et al.  The bonobo genome compared with the chimpanzee and human genomes , 2012, Nature.

[8]  David G. Knowles,et al.  The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression , 2012, Genome research.

[9]  Yuriy Fofanov,et al.  PIQA: pipeline for Illumina G1 genome analyzer data quality assessment , 2009, Bioinform..

[10]  Richard Durbin,et al.  Fast and accurate long-read alignment with Burrows–Wheeler transform , 2010, Bioinform..

[11]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.

[12]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[13]  Terrence S. Furey,et al.  Both Noncoding and Protein-Coding RNAs Contribute to Gene Expression Evolution in the Primate Brain , 2010, Genome biology and evolution.

[14]  Albert J. Vilella,et al.  Insights into hominid evolution from the gorilla genome sequence , 2012, Nature.

[15]  Martin Vingron,et al.  Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels , 2012, Bioinform..

[16]  Jenny Tung,et al.  Changes in Gene Expression Associated with Reproductive Maturation in Wild Female Baboons , 2011, Genome biology and evolution.

[17]  S. Bergmann,et al.  The evolution of gene expression levels in mammalian organs , 2011, Nature.

[18]  David Z. Chen,et al.  Architecture of the human regulatory network derived from ENCODE data , 2012, Nature.

[19]  Lior Pachter,et al.  Sequence Analysis , 2020, Definitions.

[20]  Gary P. Schroth,et al.  Total RNA‐Seq: Complete Analysis of the Transcriptome Using Illumina Sequencing‐By‐Synthesis Sequencing , 2012 .

[21]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[22]  Jean L. Chang,et al.  Initial sequence of the chimpanzee genome and comparison with the human genome , 2005, Nature.

[23]  Rahul C. Deo,et al.  Programming human pluripotent stem cells into white and brown adipocytes , 2012, Nature Cell Biology.

[24]  Jian Wang,et al.  Genome sequencing and comparison of two nonhuman primate animal models, the cynomolgus and Chinese rhesus macaques , 2011, Nature Biotechnology.

[25]  Steven J. M. Jones,et al.  De novo assembly and analysis of RNA-seq data , 2010, Nature Methods.

[26]  N. Friedman,et al.  Comprehensive comparative analysis of strand-specific RNA sequencing methods , 2010, Nature Methods.

[27]  David N. Messina,et al.  Evolutionary and Biomedical Insights from the Rhesus Macaque Genome , 2007, Science.

[28]  J. Rinn,et al.  A Large Intergenic Noncoding RNA Induced by p53 Mediates Global Gene Repression in the p53 Response , 2010, Cell.

[29]  N. Friedman,et al.  Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data , 2011, Nature Biotechnology.