Three minimal sequences found in Ebola virus genomes and absent from human DNA

Motivation: Ebola virus causes high mortality hemorrhagic fevers, with more than 25 000 cases and 10 000 deaths in the current outbreak. Only experimental therapies are available, thus, novel diagnosis tools and druggable targets are needed. Results: Analysis of Ebola virus genomes from the current outbreak reveals the presence of short DNA sequences that appear nowhere in the human genome. We identify the shortest such sequences with lengths between 12 and 14. Only three absent sequences of length 12 exist and they consistently appear at the same location on two of the Ebola virus proteins, in all Ebola virus genomes, but nowhere in the human genome. The alignment-free method used is able to identify pathogen-specific signatures for quick and precise action against infectious agents, of which the current Ebola virus outbreak provides a compelling example. Availability and Implementation: EAGLE is freely available for non-commercial purposes at http://bioinformatics.ua.pt/software/eagle. Contact: raquelsilva@ua.pt; pratas@ua.pt Supplementary Information: Supplementary data are available at Bioinformatics online.

[1]  Timothy B. Stockwell,et al.  Deep Sequencing Identifies Noncanonical Editing of Ebola and Marburg Virus RNAs in Infected Cells , 2014, mBio.

[2]  Yoshihiro Kawaoka,et al.  Functional Mapping of the Nucleoprotein of Ebola Virus , 2006, Journal of Virology.

[3]  Joshua C. Johnson,et al.  Postexposure protection of non-human primates against a lethal Ebola virus challenge with RNA interference: a proof-of-concept study , 2010, The Lancet.

[4]  Tao Jiang,et al.  Efficient computation of shortest absent words in a genomic sequence , 2010, Inf. Process. Lett..

[5]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[6]  Paolo Fontana,et al.  keeSeek: searching distant non-existing words in genomes for PCR-based applications , 2014, Bioinform..

[7]  Stephan Günther,et al.  Emergence of Zaire Ebola virus disease in Guinea. , 2014, The New England journal of medicine.

[8]  Armando J. Pinho,et al.  Minimal Absent Words in Prokaryotic and Eukaryotic Genomes , 2011, PloS one.

[9]  Robert Giegerich,et al.  BMC Bioinformatics BioMed Central Methodology article Efficient computation of absent words in genomic sequences , 2008 .

[10]  Rachel S. G. Sealfon,et al.  Genomic surveillance elucidates Ebola virus origin and transmission during the 2014 outbreak , 2014, Science.

[11]  M. Rossmann,et al.  The structure of the RNA-dependent RNA polymerase from bovine viral diarrhea virus establishes the role of GTP in de novo initiation. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Ben M. Webb,et al.  Comparative Protein Structure Modeling Using Modeller , 2006, Current protocols in bioinformatics.

[13]  M. Blackledge,et al.  Structure of Nipah virus unassembled nucleoprotein in complex with its viral chaperone , 2014, Nature Structural &Molecular Biology.

[14]  Maxime Crochemore,et al.  Using minimal absent words to build phylogeny , 2012, Theor. Comput. Sci..

[15]  Ashok K Tiwari,et al.  Rapid label-free visual assay for the detection and quantification of viral RNA using peptide nucleic acid (PNA) and gold nanoparticles (AuNPs). , 2013, Analytica chimica acta.

[16]  Manfred J. Sippl,et al.  Thirty years of environmental health research--and growing. , 1996, Nucleic Acids Res..

[17]  J. Conde,et al.  Gold-nanobeacons for gene therapy: evaluation of genotoxicity, cell toxicity and proteome profiling analysis , 2014, Nanotoxicology.

[18]  A. Gulland Clinical trials of Ebola therapies to begin in December , 2014, BMJ : British Medical Journal.

[19]  Daniel G. Anderson,et al.  Non-viral vectors for gene-based therapy , 2014, Nature Reviews Genetics.

[20]  F. Chen,et al.  Nanoparticle-Mediated Systemic Delivery of siRNA for Treatment of Cancers and Viral Infections , 2014, Theranostics.

[21]  I. Lukashevich Advanced Vaccine Candidates for Lassa Fever , 2012, Viruses.

[22]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[23]  M. Sippl Recognition of errors in three‐dimensional structures of proteins , 1993, Proteins.

[24]  Michel G Bergeron,et al.  Rapid molecular theranostics in infectious diseases. , 2002, Drug discovery today.

[25]  Jeffrey E. Lee,et al.  The Secret Life of Viral Entry Glycoproteins: Moonlighting in Immune Evasion , 2013, PLoS pathogens.

[26]  Luísa Azevedo,et al.  NAMPT and NAPRT1: novel polymorphisms and distribution of variants between normal tissues and tumor samples , 2014, Scientific Reports.

[27]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[28]  L. Hutwagner,et al.  Effective Vaccine for Lassa Fever , 2000, Journal of Virology.

[29]  J. Mascola,et al.  Safety and Immunogenicity of DNA Vaccines Encoding Ebolavirus and Marburgvirus Wild-Type Glycoproteins in a Phase I Clinical Trial , 2014, The Journal of infectious diseases.

[30]  R. Wilson,et al.  Modernizing Reference Genome Assemblies , 2011, PLoS biology.

[31]  Armando J. Pinho,et al.  On finding minimal absent words , 2009, BMC Bioinformatics.

[32]  Ben M. Webb,et al.  Comparative Protein Structure Modeling Using MODELLER , 2016, Current protocols in bioinformatics.

[33]  R. Compans,et al.  Antigenic Subversion: A Novel Mechanism of Host Immune Evasion by Ebola Virus , 2012, PLoS pathogens.

[34]  B. Friedrich,et al.  Potential Vaccines and Post-Exposure Treatments for Filovirus Infections , 2012, Viruses.

[35]  E. Hayden RNA interference rebooted , 2014, Nature.

[36]  Advantages of peptide nucleic acids as diagnostic platforms for detection of nucleic acids in resource-limited settings. , 2010, The Journal of infectious diseases.

[37]  B Veigas,et al.  A low cost, safe, disposable, rapid and self-sustainable paper-based platform for diagnostic testing: lab-on-paper , 2014, Nanotechnology.

[38]  Heinz Feldmann,et al.  Live attenuated recombinant vaccine protects nonhuman primates against Ebola and Marburg viruses , 2005, Nature Medicine.

[39]  S. Bavari,et al.  Therapeutics for filovirus infection: traditional approaches and progress towards in silico drug design , 2012, Expert opinion on drug discovery.