ISDTool: A computational model for predicting immunosuppressive domain of HERVs

Human endogenous retroviruses (HERVs) have been found to act as etiological cofactors in several chronic diseases, including cancer, autoimmunity and neurological dysfunction. Immunosuppressive domain (ISD) is a conserved region of transmembrane protein (TM) in envelope gene (env) of retroviruses. In vitro and vivo, evidence has shown that retroviral TM is highly immunosuppressive and a synthetic peptide (CKS-17) that shows homology to ISD inhibits immune function. ISD is probably a potential pathogenic element in HERVs. However, only less than one hundred ISDs of HERVs have been annotated by researchers so far, and universal software for domain prediction could not achieve sufficient accuracy for specific ISD. In this paper, a computational model is proposed to identify ISD in HERVs based on genome sequences only. It has a classification accuracy of 97.9% using Jack-knife test. 117 HERVs families were scanned with the model, 1002 new putative ISDs have been predicted and annotated in the human chromosomes. This model is also applicable to search for ISDs in human T-lymphotropic virus (HTLV), simian T-lymphotropic virus (STLV) and murine leukemia virus (MLV) because of the evolutionary relationship between endogenous and exogenous retroviruses. Furthermore, software named ISDTool has been developed to facilitate the application of the model. Datasets and the software involved in the paper are all available at https://sourceforge.net/projects/isdtool/files/ISDTool-1.0.

[1]  J. Blomberg,et al.  Sequence variation of human endogenous retrovirus ERV9-related elements in an env region corresponding to an immunosuppressive peptide: transcription in normal and neoplastic cells , 1993, Journal of virology.

[2]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[3]  Brian T. Naughton,et al.  A graph-based motif detection algorithm models complex nucleotide dependencies in transcription factor binding sites , 2006, Nucleic acids research.

[4]  Michele Magrane,et al.  UniProt Knowledgebase: a hub of integrated protein data , 2011, Database J. Biol. Databases Curation.

[5]  J. Jurka,et al.  Repbase Update, a database of eukaryotic repetitive elements , 2005, Cytogenetic and Genome Research.

[6]  R. Snyderman,et al.  Successful tumour immunotherapy: possible role of antibodies to anti-inflammatory factors produced by neoplasms. , 1985, Clinical and experimental immunology.

[7]  Terri K. Attwood,et al.  FAN: fingerprint analysis of nucleotide sequences , 2004, Nucleic Acids Res..

[8]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[9]  D. Butterfield,et al.  Human endogenous retrovirus glycoprotein–mediated induction of redox reactants causes oligodendrocyte death and demyelination , 2004, Nature Neuroscience.

[10]  J. Coffin,et al.  Effects of retroviruses on host genome function. , 2008, Annual review of genetics.

[11]  M. Tristem,et al.  The Evolution, Distribution and Diversity of Endogenous Retroviruses , 2003, Virus Genes.

[12]  Danielle W. Lu,et al.  Detecting the expression of human endogenous retrovirus E envelope transcripts in human prostate adenocarcinoma , 2003, Cancer.

[13]  Zheng Yuan Prediction of protein subcellular locations using Markov chain models , 1999, FEBS letters.

[14]  S. Pizzo,et al.  Anti-Inflammatory and Vasoprotective Activity of a Retroviral-Derived Peptide, Homologous to Human Endogenous Retroviruses: Endothelial Cell Effects , 2012, PloS one.

[15]  H S Kim,et al.  Sequence and phylogeny of HERV-W pol fragments. , 2001, AIDS research and human retroviruses.

[16]  R. Weiss,et al.  Abundance of an endogenous retroviral envelope protein in placental trophoblasts suggests a biological function. , 1995, Virology.

[17]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[18]  Mary Goldman,et al.  The UCSC Genome Browser database: extensions and updates 2013 , 2012, Nucleic Acids Res..

[19]  T. Heidmann,et al.  Tumor cells expressing a retroviral envelope escape immune rejection in vivo. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Anne-Mieke Vandamme,et al.  A Public HTLV-1 Molecular Epidemiology Database for Sequence Management and Data Mining , 2012, PloS one.

[21]  R. Snyderman,et al.  Inhibitors of monocyte responses to chemotaxins are present in human cancerous effusions and react with monoclonal antibodies to the P15(E) structural protein of retroviruses. , 1981, The Journal of clinical investigation.

[22]  F. Tian,et al.  In silico quantitative prediction of peptides binding affinity to human MHC molecule: an intuitive quantitative structure–activity relationship approach , 2009, Amino Acids.

[23]  J. Jurka Repbase update: a database and an electronic journal of repetitive elements. , 2000, Trends in genetics : TIG.

[24]  Philippe Dessen,et al.  Identification, Phylogeny, and Evolution of Retroviral Elements Based on Their Envelope Genes , 2001, Journal of Virology.

[25]  H. Hashimoto,et al.  Immune Abnormalities Induced by Human Endogenous Retroviral Peptides: With Reference to the Pathogenesis of Systemic Lupus Erythematosus , 2003, Journal of Clinical Immunology.

[26]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[27]  Jiuqiang Han,et al.  A novel computational method for the identification of plant alternative splice sites. , 2013, Biochemical and biophysical research communications.

[28]  S. Manel,et al.  Comparing discriminant analysis, neural networks and logistic regression for predicting species distributions: a case study with a Himalayan river bird , 1999 .

[29]  J. Blomberg,et al.  Evolution of human endogenous retroviral sequences: a conceptual account. , 2008, Cellular and molecular life sciences : CMLS.

[30]  D. Rosen,et al.  Expression of multiple human endogenous retrovirus surface envelope proteins in ovarian cancer , 2007, International journal of cancer.

[31]  Menglong Li,et al.  PPM-Dom: A novel method for domain position prediction , 2013, Comput. Biol. Chem..

[32]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[33]  Jonas Blomberg,et al.  Classification and nomenclature of endogenous retroviral sequences (ERVs): problems and recommendations. , 2009, Gene.

[34]  Pierre Baldi,et al.  DOMpro: Protein Domain Prediction Using Profiles, Secondary Structure, Relative Solvent Accessibility, and Recursive Neural Networks , 2006, Data Mining and Knowledge Discovery.

[35]  T. Heidmann,et al.  Characterization of the three HERV-H proviruses with an open envelope reading frame encompassing the immunosuppressive domain and evolutionary history in primates. , 2001, Virology.

[36]  Steven J. M. Jones,et al.  Circos: an information aesthetic for comparative genomics. , 2009, Genome research.

[37]  F. Tian,et al.  A Structure‐based, Quantitative Structure–Activity Relationship Approach for Predicting HLA‐A*0201‐restricted Cytotoxic T Lymphocyte Epitopes , 2007, Chemical biology & drug design.

[38]  Heui-Soo Kim Genomic impact, chromosomal distribution and transcriptional regulation of HERV elements , 2012, Molecules and cells.

[39]  J. Jurka,et al.  A universal classification of eukaryotic transposable elements implemented in Repbase , 2008, Nature Reviews Genetics.

[40]  C. Mendelson,et al.  Mechanisms in tissue-specific regulation of estrogen biosynthesis in humans , 2002, Trends in Endocrinology & Metabolism.

[41]  Feng Wang-Johanning,et al.  Quantitation of HERV-K env gene expression and splicing in human breast cancer , 2003, Oncogene.