Computers and viral diseases. Preliminary bioinformatics studies on the design of a synthetic vaccine and a preventative peptidomimetic antagonist against the SARS-CoV-2 (2019-nCoV, COVID-19) coronavirus

Abstract This paper concerns study of the genome of the Wuhan Seafood Market isolate believed to represent the causative agent of the disease COVID-19. This is to find a short section or sections of viral protein sequence suitable for preliminary design proposal for a peptide synthetic vaccine and a peptidomimetic therapeutic, and to explore some design possibilities. The project was originally directed towards a use case for the Q-UEL language and its implementation in a knowledge management and automated inference system for medicine called the BioIngine, but focus here remains mostly on the virus itself. However, using Q-UEL systems to access relevant and emerging literature, and to interact with standard publically available bioinformatics tools on the Internet, did help quickly identify sequences of amino acids that are well conserved across many coronaviruses including 2019-nCoV. KRSFIEDLLFNKV was found to be particularly well conserved in this study and corresponds to the region around one of the known cleavage sites of the SARS virus that are believed to be required for virus activation for cell entry. This sequence motif and surrounding variations formed the basis for proposing a specific synthetic vaccine epitope and peptidomimetic agent. The work can, nonetheless, be described in traditional bioinformatics terms, and readily reproduced by others, albeit with the caveat that new data and research into 2019-nCoV is emerging and evolving at an explosive pace. Preliminary studies using molecular modeling and docking, and in that context the potential value of certain known herbal extracts, are also described.

[1]  Vimal Kumar,et al.  Mechanism & inhibition kinetics of bioassay-guided fractions of Indian medicinal plants and foods as ACE inhibitors , 2018, Journal of traditional and complementary medicine.

[2]  Erik Verner,et al.  Dissecting and designing inhibitor selectivity determinants at the S1 site using an artificial Ala190 protease (Ala190 uPA). , 2004, Journal of Molecular Biology.

[3]  Dongqing Wei,et al.  Prediction and validation of potent peptides against herpes simplex virus type 1 via immunoinformatic and systems biology approach , 2019, Chemical biology & drug design.

[4]  M. Joshi,et al.  Peptide Vaccine: Progress and Challenges , 2014, Vaccines.

[5]  Sergio Rosales-Mendoza,et al.  An overview of bioinformatics tools for epitope prediction: Implications on vaccine development , 2015, J. Biomed. Informatics.

[6]  Barry Robson,et al.  Studies in the assessment of folding quality for protein modeling and structure prediction. , 2002, Journal of proteome research.

[7]  Syed Shujait Ali,et al.  Immunoinformatic and systems biology approaches to predict and validate peptide vaccines against Epstein–Barr virus (EBV) , 2019, Scientific Reports.

[8]  Barry Robson,et al.  Split-complex numbers and Dirac bra-kets , 2014, Commun. Inf. Syst..

[9]  I. Soares,et al.  Editorial: Epitope Discovery and Synthetic Vaccine Design , 2018, Front. Immunol..

[10]  Dong-Qing Wei,et al.  A-CaMP: a tool for anti-cancer and antimicrobial peptide generation , 2019, Journal of biomolecular structure & dynamics.

[11]  Barry Robson Towards New Tools for Pharmacoepidemiology , 2012 .

[12]  Barry Robson,et al.  Suggestions for a web based universal exchange and inference language for medicine. Continuity of patient care with PCAST disaggregation , 2015, Comput. Biol. Medicine.

[13]  Barry Robson,et al.  Extension of the Quantum Universal Exchange Language to precision medicine and drug lead discovery. Preliminary example studies using the mitochondrial genome , 2020, Comput. Biol. Medicine.

[14]  B. Robson The Concept of Novel Compositions of Matter: A Theoretical Analysis , 2014 .

[15]  B. Bosch,et al.  Coronavirus Escape from Heptad Repeat 2 (HR2)-Derived Peptide Entry Inhibition as a Result of Mutations in the HR1 Domain of the Spike Fusion Protein , 2007, Journal of Virology.

[16]  P. Masters,et al.  The Molecular Biology of Coronaviruses , 2006, Advances in Virus Research.

[17]  G. Whittaker,et al.  Activation of the SARS coronavirus spike protein via sequential proteolytic cleavage at two distinct sites , 2009, Proceedings of the National Academy of Sciences.

[18]  Barry Robson,et al.  Drug discovery using very large numbers of patents. General strategy with extensive use of match and edit operations , 2011, J. Comput. Aided Mol. Des..

[19]  S. Sarafianos,et al.  Novel Inhibitors of Severe Acute Respiratory Syndrome Coronavirus Entry That Act by Three Distinct Mechanisms , 2013, Journal of Virology.

[20]  Haixia Zhou,et al.  Cryo-electron microscopy structures of the SARS-CoV spike glycoprotein reveal a prerequisite conformational state for receptor binding , 2016, Cell Research.

[21]  Shakti Sahi,et al.  CytoMegaloVirus Infection Database: A Public Omics Database for Systematic and Comparable Information of CMV , 2019, Interdisciplinary Sciences: Computational Life Sciences.

[22]  M. H. Regenmortel Synthetic Peptide Vaccines and the Search for Neutralization B Cell Epitopes , 2009, HIV/AIDS: Immunochemistry, Reductionism and Vaccine Design.

[23]  J Garnier,et al.  Studies on rationales for an expert system approach to the interpretation of protein sequence data Preliminary analysis of the human epidermal growth factor receptor , 1987, FEBS letters.

[24]  Jagdish Rai Peptide and protein mimetics by retro and retroinverso analogs , 2019, Chemical biology & drug design.

[25]  Olivier Barré,et al.  Cleavage Specificity Analysis of Six Type II Transmembrane Serine Proteases (TTSPs) Using PICS with Proteome-Derived Peptide Libraries , 2014, PloS one.

[26]  B. Robson,et al.  Expert system for protein engineering: its application in the study of choramphenicol acetyltransferase and avian pancreatic polypeptide , 1987 .

[27]  Barry Robson,et al.  Studies in the extensively automatic construction of large odds-based inference networks from structured data. Examples from medical, bioinformatics, and health insurance claims data , 2018, Comput. Biol. Medicine.

[28]  ingwei Liu,et al.  Peptides Corresponding to the Predicted Heptad Repeat 2 Domain of the Feline Coronavirus Spike Protein Are Potent Inhibitors of Viral Infection , 2013, PloS one.

[29]  Barry Robson,et al.  POPPER, a simple programming language for probabilistic semantic inference in medicine , 2015, Comput. Biol. Medicine.

[30]  Barry Robson,et al.  Hyperbolic Dirac Nets for medical decision support. Theory, methods, and comparison with Bayes Nets , 2014, Comput. Biol. Medicine.

[31]  Barry Robson,et al.  Studies in using a universal exchange and inference language for evidence based medicine. Semi-automated learning and reasoning for PICO methodology, systematic review, and environmental epidemiology , 2016, Comput. Biol. Medicine.

[32]  Dong-Qing Wei,et al.  Exploring the Papillomaviral Proteome to Identify Potential Candidates for a Chimeric Vaccine against Cervix Papilloma Using Immunomics and Computational Structural Vaccinology , 2019, Viruses.

[33]  G. Fasman Prediction of Protein Structure and the Principles of Protein Conformation , 2012, Springer US.

[34]  Yi Xiong,et al.  DTI-CDF: a cascade deep forest model towards the prediction of drug-target interactions based on hybrid features , 2019, Briefings Bioinform..

[35]  B. Robson,et al.  Chapter 7 – The role of information, bioinformatics and genomics , 2013 .

[36]  R. Bruzzone,et al.  Cleavage of the SARS Coronavirus Spike Glycoprotein by Airway Proteases Enhances Virus Entry into Human Bronchial Epithelial Cells In Vitro , 2009, PloS one.

[37]  Barry Robson,et al.  Bidirectional General Graphs for inference. Principles and implications for medicine , 2019, Comput. Biol. Medicine.

[38]  Ralph S. Baric,et al.  Receptor Recognition by the Novel Coronavirus from Wuhan: an Analysis Based on Decade-Long Structural Studies of SARS Coronavirus , 2020, Journal of Virology.

[39]  B. Robson,et al.  Prediction of HIV vaccine , 1987, Nature.

[40]  Tin-Yun Ho,et al.  Emodin blocks the SARS coronavirus spike protein and angiotensin-converting enzyme 2 interaction , 2006, Antiviral Research.

[41]  Barry Robson,et al.  Protein folding revisited. , 2008, Progress in molecular biology and translational science.

[42]  B. Robson,et al.  Studies of the role of a smart web for precision medicine supported by biobanking. , 2016, Personalized medicine.

[43]  Jianhua Shen,et al.  Emodin, a natural product, selectively inhibits 11β‐hydroxysteroid dehydrogenase type 1 and ameliorates metabolic disorder in diet‐induced obese mice , 2010, British journal of pharmacology.

[44]  Barry Robson,et al.  Implementation of a web based universal exchange and inference language for medicine: Sparse data, probabilities and inference in data mining of clinical data repositories , 2015, Comput. Biol. Medicine.

[45]  Barry Robson,et al.  Suggestions for a Web based universal exchange and inference language for medicine , 2013, Comput. Biol. Medicine.

[46]  D. Osguthorpe,et al.  Monte Carlo simulation of water behavior around the dipeptide N-acetylalanyl-N-methylamide. , 1980, Science.

[47]  Barry Robson,et al.  Data-mining to build a knowledge representation store for clinical decision support. Studies on curation and validation based on machine performance in multiple choice medical licensing examinations , 2016, Computers in Biology and Medicine.

[48]  Fang Li,et al.  Structure, Function, and Evolution of Coronavirus Spike Proteins. , 2016, Annual review of virology.

[49]  B. Robson Doppelgänger proteins as drug leads , 1996, Nature Biotechnology.

[50]  M. Dahan,et al.  The role of information , 2006 .

[51]  M. Clerici,et al.  The heptad repeat region is a major selection target in MERS-CoV and related coronaviruses , 2015, Scientific Reports.

[52]  N. Mabbott,et al.  Progress in Molecular Biology and Translational Science , 2017 .

[53]  Jin Li,et al.  BIOINFORMATICS AND COMPUTATIONAL CHEMISTRY IN MOLECULAR DESIGN : RECENT ADVANCES AND THEIR ADDLICATIONS , 2000 .

[54]  Sakshi Sachdeva Peptides as ‘Drugs’: The Journey so Far , 2016, International Journal of Peptide Research and Therapeutics.

[55]  Barry Robson,et al.  Towards Automated Reasoning for Drug Discovery and Pharmaceutical Business Intelligence , 2012 .

[56]  Barry Robson,et al.  Studies in the use of data mining, prediction algorithms, and a universal exchange and inference language in the analysis of socioeconomic health data , 2019, Comput. Biol. Medicine.

[57]  B Robson Computer aided peptide and protein engineering. , 1989, Progress in clinical and biological research.

[58]  Barry Robson,et al.  Interesting things for computer systems to do: Keeping and data mining millions of patient records, guiding patients and physicians, and passing medical licensing exams , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[59]  Barry Robson,et al.  Introduction to proteins and protein engineering , 1986 .

[60]  R. Hodges,et al.  Advantages of a Synthetic Peptide Immunogen Over a Protein Immunogen in the Development of an Anti‐Pilus Vaccine for Pseudomonas aeruginosa , 2009, Chemical biology & drug design.

[61]  Barry Robson,et al.  Considerations for a Universal Exchange Language for healthcare , 2011, 2011 IEEE 13th International Conference on e-Health Networking, Applications and Services.

[62]  Wenjing Yu,et al.  Emodin inhibits current through SARS-associated coronavirus 3a protein , 2011, Antiviral Research.

[63]  Lennart M. Reinke,et al.  Different residues in the SARS-CoV spike protein determine cleavage and activation by the host cell protease TMPRSS2 , 2017, PloS one.

[64]  J. Garnier,et al.  The GOR Method for Predicting Secondary Structures in Proteins , 1989 .