An immuno-informatics approach for annotation of hypothetical proteins and multi-epitope vaccine designed against the Mpox virus

A worrying new outbreak of Monkeypox (Mpox) in humans is caused by the Mpox virus (MpoxV). The pathogen has roughly 28 hypothetical proteins of unknown structure, function, and pathogenicity. Using reliable bioinformatics tools, we attempted to analyze the MpoxV genome, identify the role of hypothetical proteins (HPs), and design a potential candidate vaccine. Out of 28, we identified seven hypothetical proteins using multi-server validation with high confidence for the occurrence of conserved domains. Their physical, chemical, and functional characterizations, including molecular weight, theoretical isoelectric point, 3D structures, GRAVY value, subcellular localization, functional motifs, antigenicity, and virulence factors, were performed. We predicted possible cytotoxic T cell (CTL), helper T cell (HTL) and linear and conformational B cell epitopes, which were combined in a 219 amino acid multiepitope vaccine with human β defensin as a linker. This multi-epitopic vaccine was structurally modelled and docked with toll-like receptor-3 (TLR-3). The dynamical stability of the vaccine-TLR-3 docked complexes exhibited stable interactions based on RMSD and RMSF tests. Additionally, the modelled vaccine was cloned in-silico in an E. coli host to check the appropriate expression of the final vaccine built. Our results might conform to an immunogenic and safe vaccine, which would require further experimental validation.Communicated by Ramaswamy H. Sarma.

[1]  B. Chopade,et al.  Insights into the challenging multi-country outbreak of Mpox: a comprehensive review. , 2023, Journal of medical microbiology.

[2]  Abhishek S. Rao,et al.  Translational vaccinomics and structural filtration algorithm to device multiepitope vaccine for catastrophic monkeypox virus , 2022, Comput. Biol. Medicine.

[3]  K. Dhama,et al.  FDA's authorized “JYNNEOS” vaccine for counteracting monkeypox global public health emergency; an update – Correspondence , 2022, International journal of surgery.

[4]  S. Luo,et al.  Immunoinformatic-based design of immune-boosting multiepitope subunit vaccines against monkeypox virus and validation through molecular dynamics and immune simulation , 2022, Frontiers in Immunology.

[5]  Mujahed I. Mustafa,et al.  Novel multi epitope-based vaccine against monkeypox virus: vaccinomic approach , 2022, Scientific Reports.

[6]  P. Alam,et al.  Multi-Epitope-Based Vaccine Candidate for Monkeypox: An In Silico Approach , 2022, Vaccines.

[7]  Mansoor Ahmed,et al.  Monkeypox in 2022: A new threat in developing , 2022, Annals of medicine and surgery.

[8]  Kaviya Parambath Kootery,et al.  Structural and functional characterization of a hypothetical protein in the RD7 region in clinical isolates of Mycobacterium tuberculosis — an in silico approach to candidate vaccines , 2022, Journal of Genetic Engineering and Biotechnology.

[9]  F. Lienert,et al.  The changing epidemiology of human monkeypox—A potential threat? A systematic review , 2021, medRxiv.

[10]  B. Okech,et al.  Screening and characterization of hypothetical proteins of Plasmodium falciparum as novel vaccine candidates in the fight against malaria using reverse vaccinology , 2021, Journal of Genetic Engineering and Biotechnology.

[11]  Wolfgang Nejdl,et al.  Capturing Protein Domain Structure and Function Using Self-Supervision on Domain Architectures , 2021, Algorithms.

[12]  A. Amin,et al.  In Silico Characterization of a Hypothetical Protein from Shigella dysenteriae ATCC 12039 Reveals a Pathogenesis-Related Protein of the Type-VI Secretion System , 2021, Bioinformatics and biology insights.

[13]  Rafael A. Caceres,et al.  Geo-Measures: A PyMOL plugin for protein structure ensembles analysis , 2020, Comput. Biol. Chem..

[14]  N. Verma,et al.  Functional Annotation and Curation of Hypothetical Proteins Present in A Newly Emerged Serotype 1c of Shigella flexneri: Emphasis on Selecting Targets for Virulence and Vaccine Design Studies , 2020, Genes.

[15]  V. B. Rao,et al.  A systematic review of the epidemiology of human monkeypox outbreaks and implications for outbreak strategy , 2019, PLoS neglected tropical diseases.

[16]  Syed Shujait Ali,et al.  Immunoinformatics and structural vaccinology driven prediction of multi-epitope vaccine against Mayaro virus and validation through in-silico expression. , 2019, Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases.

[17]  Syed Shujait Ali,et al.  Immunoinformatic and systems biology approaches to predict and validate peptide vaccines against Epstein–Barr virus (EBV) , 2019, Scientific Reports.

[18]  G. Ippolito,et al.  Monkeypox — Enhancing public health preparedness for an emerging lethal human zoonotic epidemic threat in the wake of the smallpox post-eradication era , 2018, International Journal of Infectious Diseases.

[19]  Torsten Schwede,et al.  SWISS-MODEL: homology modelling of protein structures and complexes , 2018, Nucleic Acids Res..

[20]  V. Prajapati,et al.  Exploring Leishmania secretory proteins to design B and T cell multi-epitope subunit vaccine using immunoinformatics approach , 2017, Scientific Reports.

[21]  Alexandre M. J. J. Bonvin,et al.  PRODIGY: a web server for predicting the binding affinity of protein-protein complexes , 2016, Bioinform..

[22]  E. Mateu,et al.  Immunological Features of the Non-Structural Proteins of Porcine Reproductive and Respiratory Syndrome Virus , 2015, Viruses.

[23]  Han Wool Kim,et al.  A Potential Protein Adjuvant Derived from Mycobacterium tuberculosis Rv0652 Enhances Dendritic Cells-Based Tumor Immunotherapy , 2014, PloS one.

[24]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[25]  Daniel R Roe,et al.  PTRAJ and CPPTRAJ: Software for Processing and Analysis of Molecular Dynamics Trajectory Data. , 2013, Journal of chemical theory and computation.

[26]  Jian Peng,et al.  Template-based protein structure modeling using the RaptorX web server , 2012, Nature Protocols.

[27]  Chaok Seok,et al.  GalaxyWEB server for protein structure prediction and refinement , 2012, Nucleic Acids Res..

[28]  Umashankar Vetrivel,et al.  A novel in silico approach to identify potential therapeutic targets in human bacterial pathogens , 2011, The HUGO Journal.

[29]  K. Chou,et al.  Cell-PLoc 2.0: an improved package of web-servers for predicting subcellular localization of proteins in various organisms , 2010 .

[30]  Holger Gohlke,et al.  DrugScorePPI webserver: fast and accurate in silico alanine scanning for scoring protein–protein interactions , 2010, Nucleic Acids Res..

[31]  Patricia C. Babbitt,et al.  Annotation Error in Public Databases: Misannotation of Molecular Function in Enzyme Superfamilies , 2009, PLoS Comput. Biol..

[32]  Chih-Chieh Chen,et al.  (PS)2-v2: template-based protein structure prediction server , 2009, BMC Bioinformatics.

[33]  Prashanth Suravajhala,et al.  In Silico screening for functional candidates amongst hypothetical proteins , 2009, BMC Bioinformatics.

[34]  Arturo Casadevall,et al.  Virulence factors and their mechanisms of action: the view from a damage-response framework. , 2009, Journal of water and health.

[35]  C. Orengo,et al.  Protein function annotation by homology-based inference , 2009, Genome Biology.

[36]  Nir Ben-Tal,et al.  Detection of functionally important regions in "hypothetical proteins" of known structure. , 2008, Structure.

[37]  Wei Li,et al.  ElliPro: a new structure-based tool for the prediction of antibody epitopes , 2008, BMC Bioinformatics.

[38]  S. Garcia-Vallvé,et al.  CAIcal: A combined set of tools to assess codon usage adaptation , 2008, Biology Direct.

[39]  Dinesh Gupta,et al.  VirulentPred: a SVM based prediction method for virulent proteins in bacterial pathogens , 2008, BMC Bioinformatics.

[40]  David A. Lee,et al.  Predicting protein function from sequence and structure , 2007, Nature Reviews Molecular Cell Biology.

[41]  Morten Nielsen,et al.  Large-scale validation of methods for cytotoxic T-lymphocyte epitope prediction , 2007, BMC Bioinformatics.

[42]  Ruth Nussinov,et al.  FireDock: Fast interaction refinement in molecular docking , 2007, Proteins.

[43]  A. Clatworthy,et al.  Targeting virulence: a new paradigm for antimicrobial therapy , 2007, Nature Chemical Biology.

[44]  Manfred J. Sippl,et al.  Thirty years of environmental health research--and growing. , 1996, Nucleic Acids Res..

[45]  S. Brunak,et al.  Locating proteins in the cell using TargetP, SignalP and related tools , 2007, Nature Protocols.

[46]  Avner Schlessinger,et al.  Towards a consensus on datasets and evaluation metrics for developing B‐cell epitope prediction tools , 2007, Journal of molecular recognition : JMR.

[47]  Irini A. Doytchinova,et al.  BMC Bioinformatics BioMed Central Methodology article VaxiJen: a server for prediction of protective antigens, tumour , 2007 .

[48]  M. Sabourin,et al.  A flexible protein linker improves the function of epitope‐tagged proteins in Saccharomyces cerevisiae , 2007, Yeast.

[49]  Narmada Thanki,et al.  CDD: a conserved domain database for interactive domain family analysis , 2006, Nucleic Acids Res..

[50]  Jenn-Kang Hwang,et al.  Prediction of protein subcellular localization , 2006, Proteins.

[51]  Gajendra P. S. Raghava,et al.  AlgPred: prediction of allergenic proteins and mapping of IgE epitopes , 2006, Nucleic Acids Res..

[52]  Amos Bairoch,et al.  ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins , 2006, Nucleic Acids Res..

[53]  Chih-Chieh Chen,et al.  (PS)2: protein structure prediction server , 2006, Nucleic Acids Res..

[54]  J. Sejvar,et al.  Clinical characteristics of human monkeypox, and risk factors for severe disease. , 2005, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[55]  Gert Lubec,et al.  Searching for hypothetical proteins: Theory and practice based upon original data and literature , 2005, Progress in Neurobiology.

[56]  Benjamin F. Cravatt,et al.  Assignment of protein function in the postgenomic era , 2005 .

[57]  P. Formenty,et al.  Extended interhuman transmission of monkeypox in a hospital community in the Republic of the Congo, 2003. , 2005, The American journal of tropical medicine and hygiene.

[58]  Rolf Apweiler,et al.  InterProScan: protein domains identifier , 2005, Nucleic Acids Res..

[59]  Ruth Nussinov,et al.  PatchDock and SymmDock: servers for rigid and symmetric docking , 2005, Nucleic Acids Res..

[60]  Alessandro Sette,et al.  Generating quantitative models describing the sequence specificity of biological processes with the stabilized matrix method , 2005, BMC Bioinformatics.

[61]  P. Kloetzel,et al.  Modeling the MHC class I pathway by combining predictions of proteasomal cleavage,TAP transport and MHC class I binding , 2005, Cellular and Molecular Life Sciences CMLS.

[62]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[63]  S Brunak,et al.  Sensitive quantitative predictions of peptide-MHC binding by a 'Query by Committee' artificial neural network approach. , 2003, Tissue antigens.

[64]  D. Chaplin Overview of the immune response. , 2003, The Journal of allergy and clinical immunology.

[65]  Markus Reiher,et al.  Quantum chemical calculation of vibrational spectra of large molecules—Raman and IR spectra for Buckminsterfullerene , 2002, J. Comput. Chem..

[66]  István Simon,et al.  The HMMTOP transmembrane topology prediction server , 2001, Bioinform..

[67]  Thomas L. Madden,et al.  Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. , 2001, Nucleic acids research.

[68]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[69]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[70]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[71]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[72]  I. Wilson,et al.  Structural evidence for induced fit as a mechanism for antibody-antigen recognition. , 1994, Science.

[73]  T. Yeates,et al.  Verification of protein structures: Patterns of nonbonded atomic interactions , 1993, Protein science : a publication of the Protein Society.

[74]  M. W. Pandit,et al.  Correlation between stability of a protein and its dipeptide composition: a novel approach for predicting in vivo stability of a protein from its primary sequence. , 1990, Protein engineering.

[75]  P. V. von Hippel,et al.  Calculation of protein extinction coefficients from amino acid sequence data. , 1989, Analytical biochemistry.

[76]  R. Hodges,et al.  New hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites. , 1986, Biochemistry.

[77]  E. Emini,et al.  Induction of hepatitis A virus-neutralizing antibody by a virus-specific synthetic peptide , 1985, Journal of virology.

[78]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[79]  Michael Zuker,et al.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information , 1981, Nucleic Acids Res..

[80]  F. Chappuis,et al.  Safety of live vaccines on immunosuppressive or immunomodulatory therapy—a retrospective study in three Swiss Travel Clinics , 2018, Journal of travel medicine.

[81]  F. van Kessel,et al.  Contraindication of live vaccines in immunocompromised patients: an estimate of the number of affected people in the USA and the UK. , 2017, Public health.

[82]  A. Marra Targeting Virulence for Antibacterial Chemotherapy , 2006 .

[83]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[84]  Philip Lijnzaad,et al.  The Ensembl genome database project , 2002, Nucleic Acids Res..

[85]  Claire O'Donovan,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999 , 1999, Nucleic Acids Res..

[86]  Shigeki Mitaku,et al.  SOSUI: classification and secondary structure prediction system for membrane proteins , 1998, Bioinform..

[87]  G. Rose,et al.  Antigenic determinants in proteins coincide with surface regions accessible to large probes (antibody domains). , 1986, Proceedings of the National Academy of Sciences of the United States of America.