A comprehensive hybridization model allows whole HERV transcriptome profiling using high density microarray

BackgroundHuman endogenous retroviruses (HERVs) have received much attention for their implications in the etiology of many human diseases and their profound effect on evolution. Notably, recent studies have highlighted associations between HERVs expression and cancers (Yu et al., Int J Mol Med 32, 2013), autoimmunity (Balada et al., Int Rev Immunol 29:351–370, 2010) and neurological (Christensen, J Neuroimmune Pharmacol 5:326–335, 2010) conditions. Their repetitive nature makes their study particularly challenging, where expression studies have largely focused on individual loci (De Parseval et al., J Virol 77:10414–10422, 2003) or general trends within families (Forsman et al., J Virol Methods 129:16–30, 2005; Seifarth et al., J Virol 79:341–352, 2005; Pichon et al., Nucleic Acids Res 34:e46, 2006).MethodsTo refine our understanding of HERVs activity, we introduce here a new microarray, HERV-V3. This work was made possible by the careful detection and annotation of genomic HERV/MaLR sequences as well as the development of a new hybridization model, allowing the optimization of probe performances and the control of cross-reactions.ResultsHERV-V3 offers an almost complete coverage of HERVs and their ancestors (mammalian apparent LTR-retrotransposons, MaLRs) at the locus level along with four other repertoires (active LINE-1 elements, lncRNA, a selection of 1559 human genes and common infectious viruses). We demonstrate that HERV-V3 analytical performances are comparable with commercial Affymetrix arrays, and that for a selection of tissue/pathological specific loci, the patterns of expression measured on HERV-V3 is consistent with those reported in the literature.ConclusionsGiven its large HERVs/MaLRs coverage and additional repertoires, HERV-V3 opens the door to multiple applications such as enhancers and alternative promoters identification, biomarkers identification as well as the characterization of genes and HERVs/MaLRs modulation caused by viral infection.

[1]  T. Christensen HERVs in Neuropathogenesis , 2010, Journal of Neuroimmune Pharmacology.

[2]  G. Young,et al.  Microarray analysis reveals global modulation of endogenous retroelement transcription by microbes , 2014, Retrovirology.

[3]  D. Mager,et al.  Endogenous retroviral LTRs as promoters for human genes: a critical assessment. , 2009, Gene.

[4]  M. Tristem,et al.  The Evolution, Distribution and Diversity of Endogenous Retroviruses , 2003, Virus Genes.

[5]  L. Duret,et al.  The endogenous retroviral locus ERVWE1 is a bona fide gene involved in hominoid placental physiology. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[7]  Robert D. Finn,et al.  Dfam: a database of repetitive DNA based on profile hidden Markov models , 2012, Nucleic Acids Res..

[8]  J. Rinn,et al.  Many human large intergenic noncoding RNAs associate with chromatin-modifying complexes and affect gene expression , 2009, Proceedings of the National Academy of Sciences.

[9]  Richard Durbin,et al.  Fast and accurate long-read alignment with Burrows–Wheeler transform , 2010, Bioinform..

[10]  V. Cheynet,et al.  Microarray-based Identification of Individual HERV Loci Expression: Application to Biomarker Discovery in Prostate Cancer , 2013, Journal of visualized experiments : JoVE.

[11]  Arnaud Doucet,et al.  A boosting approach to structure learning of graphs with and without prior knowledge , 2009, Bioinform..

[12]  T. Heidmann,et al.  Genomewide screening for fusogenic human endogenous retrovirus envelopes identifies syncytin 2, a gene conserved on primate evolution , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[13]  C. Li,et al.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Deepak Grover,et al.  dbRIP: A highly integrated database of retrotransposon insertion polymorphisms in humans , 2006, Human mutation.

[15]  N. Bannert,et al.  Retroelements and the human genome: New perspectives on an old relation , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Yingying Lian,et al.  Role of human noncoding RNAs in the control of tumorigenesis , 2009, Proceedings of the National Academy of Sciences.

[17]  F. Zhu,et al.  The role of human endogenous retroviral long terminal repeat sequences in human cancer (Review). , 2013, International journal of molecular medicine.

[18]  M. Beckmann,et al.  Reactivation of codogenic endogenous retroviral (ERV) envelope genes in human endometrial carcinoma and prestages: Emergence of new molecular targets , 2012, Oncotarget.

[19]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[20]  Philipp Kapranov,et al.  VlincRNAs controlled by retroviral elements are a hallmark of pluripotency and cancer , 2013, Genome Biology.

[21]  Jun Lu,et al.  Transcript-based redefinition of grouped oligonucleotide probe sets using AceView: High-resolution annotation for microarrays , 2007, BMC Bioinform..

[22]  C. A. Dunn,et al.  Transcription of two human genes from a bidirectional endogenous retrovirus promoter. , 2006, Gene.

[23]  Tiago G Fernandes,et al.  High-throughput cellular microarray platforms: applications in drug discovery, toxicology and stem cell research. , 2009, Trends in biotechnology.

[24]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[25]  J. Hooyberghs,et al.  Probing hybridization parameters from microarray experiments: nearest-neighbor model and beyond , 2012, Nucleic acids research.

[26]  Thomas Dandekar,et al.  L1Base: from functional annotation to prediction of active LINE-1 elements , 2004, Nucleic Acids Res..

[27]  Hiroshi Shimizu,et al.  An improved physico-chemical model of hybridization on high-density oligonucleotide microarrays , 2008, Bioinform..

[28]  Manhong Dai,et al.  HIV infection reveals widespread expansion of novel centromeric human endogenous retroviruses , 2013, Genome research.

[29]  Jonas Blomberg,et al.  Development of broadly targeted human endogenous gammaretroviral pol-based real time PCRs Quantitation of RNA expression in human tissues. , 2005, Journal of virological methods.

[30]  D. Markovitz,et al.  Gitlin and David M. Markovitz Lymphoma and Breast Cancer Elements in the Plasma of People with Human Endogenous Retrovirus K (hml-2) Supplemental Material , 2008 .

[31]  Peter N. Murakami,et al.  Assessing affymetrix GeneChip microarray quality , 2011, BMC Bioinformatics.

[32]  John M. Coffin,et al.  Differential Expression of HERV-K (HML-2) Proviruses in Cells and Virions of the Teratocarcinoma Cell Line Tera-1 , 2015, Viruses.

[33]  J. Jurka,et al.  Repbase Update, a database of eukaryotic repetitive elements , 2005, Cytogenetic and Genome Research.

[34]  Michael Weber,et al.  Reinforcement of STAT3 activity reprogrammes human embryonic stem cells to naive-like pluripotency , 2015, Nature Communications.

[35]  Wei Chu,et al.  Biomarker discovery in microarray gene expression data with Gaussian processes , 2005, Bioinform..

[36]  F. Mallet,et al.  From Viruses to Genes: Syncytins , 2012 .

[37]  S. Hainsworth,et al.  A CRITICAL ASSESSMENT , 2014 .

[38]  M. Vilardell‐Tarrés,et al.  Implication of Human Endogenous Retroviruses in the Development of Autoimmune Diseases , 2010, International reviews of immunology.

[39]  Helen M. Rowe,et al.  Dynamic control of endogenous retroviruses during development. , 2011, Virology.

[40]  J. Kawai,et al.  The regulated retrotransposon transcriptome of mammalian cells , 2009, Nature Genetics.

[41]  Magali Jaillard,et al.  Microarray-Based Sketches of the HERV Transcriptome Landscape , 2012, PloS one.

[42]  A. Sandelin,et al.  Deep transcriptome profiling of mammalian stem cells supports a regulatory role for retrotransposons in pluripotency maintenance , 2014, Nature Genetics.

[43]  Teresa A. Webster,et al.  Probe selection for high-density oligonucleotide arrays , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[44]  F. Mallet,et al.  Custom human endogenous retroviruses dedicated microarray identifies self-induced HERV-W family elements reactivated in testicular cancer upon methylation control , 2010, Nucleic acids research.

[45]  J. Blomberg,et al.  Automated recognition of retroviral sequences in genomic data—RetroTector© , 2007, Nucleic acids research.

[46]  Mark Reimers,et al.  Making Informed Choices about Microarray Data Analysis , 2010, PLoS Comput. Biol..

[47]  C. A. Dunn,et al.  Impact of transposable elements on the evolution of mammalian gene regulation , 2005, Cytogenetic and Genome Research.

[48]  Jonas Blomberg,et al.  Classification and nomenclature of endogenous retroviral sequences (ERVs): problems and recommendations. , 2009, Gene.

[49]  A. Greenwood,et al.  Comprehensive Analysis of Human Endogenous Retrovirus Transcriptional Activity in Human Tissues with a Retrovirus-Specific Microarray , 2005, Journal of Virology.

[50]  Thierry Heidmann,et al.  Survey of Human Genes of Retroviral Origin: Identification and Transcriptome of the Genes with Coding Capacity for Complete Envelope Proteins , 2003, Journal of Virology.

[51]  B. Wullich,et al.  Expression patterns of transcribed human endogenous retrovirus HERV-K(HML-2) loci in human tissues and the need for a HERV Transcriptome Project , 2008, BMC Genomics.

[52]  D. Butterfield,et al.  Human endogenous retrovirus glycoprotein–mediated induction of redox reactants causes oligodendrocyte death and demyelination , 2004, Nature Neuroscience.

[53]  Dixie L. Mager,et al.  Retroviral Repeat Sequences , 2005 .

[54]  Shuzhao Li,et al.  A competitive hybridization model predicts probe signal intensity on high density DNA microarrays , 2008, Nucleic acids research.

[55]  J. Volff,et al.  Expression of young HERV-H loci in the course of colorectal carcinoma and correlation with molecular subtypes , 2015, Oncotarget.

[56]  F. Pedersen,et al.  Utility of next‐generation RNA‐sequencing in identifying chimeric transcription involving human endogenous retroviruses , 2016, APMIS : acta pathologica, microbiologica, et immunologica Scandinavica.

[57]  Jianrong Wang,et al.  A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags , 2010, Bioinform..

[58]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[59]  D. Mager,et al.  Endogenous retroviruses provide the primary polyadenylation signal for two new human genes (HHLA2 and HHLA3). , 1999, Genomics.

[60]  L. Hurst,et al.  Primate-specific endogenous retrovirus-driven transcription defines naive-like stem cells , 2014, Nature.

[61]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[62]  W. Schulz,et al.  Selective changes of retroelement expression in human prostate cancer. , 2011, Carcinogenesis.

[63]  F. Mallet,et al.  Multiplex degenerate PCR coupled with an oligo sorbent array for human endogenous retrovirus expression profiling , 2006, Nucleic acids research.

[64]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[65]  Samuel Kaski,et al.  Methods for estimating human endogenous retrovirus activities from EST databases , 2007, BMC Bioinformatics.