A new approach to determining whole viral genomic sequences including termini using a single deep sequencing run.

Next-generation sequencing is now commonly used for a variety of applications in virology including virus discovery, investigation of quasispecies, viral evolution, metagenomics, and analyses of antiviral resistance. However, there are limitations with the current sample preparation methods used for deep sequencing of viral genomes, especially during de novo sequencing. For example, current methods are unable to capture the terminal sequences of viral genomes in an efficient and effective manner; data representing the 3' and 5' ends are typically insufficient. Methods such as Rapid Amplification of cDNA Ends address this issue but these methods can be time consuming, may require some prior knowledge of the viral sequence, and require multiple independent procedures. The current study outlines a sample preparation technique that overcomes some of these shortcomings. The method relied on random fragmentation with divalent cations and subsequent adapter ligation directly to RNA, rather than cDNA, to maximize the quality and quantity of terminal reads. The technique was tested on RNA samples from two different RNA viruses, Ebola virus and hepatitis C virus. This method permits rapid preparation of samples for deep sequencing while eliminating the use of sequence specific primers and captures the entire genome sequence, including the 5' and 3' ends. This could improve the efficiency of virus discovery projects where the terminal ends are unknown.

[1]  A. Djikeng,et al.  Viral genome sequencing by random priming methods , 2008 .

[2]  A. Sanchez,et al.  Sequence analysis of the Ebola virus genome: organization, genetic elements, and comparison with the genome of Marburg virus. , 1993, Virus research.

[3]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[4]  P. Silver,et al.  Retrovirus-delivered siRNA , 2002, BMC biotechnology.

[5]  Hiroyuki Miyoshi,et al.  Optimization of an siRNA‐expression system with an improved hairpin and its significant suppressive effects in mammalian cells , 2004, The journal of gene medicine.

[6]  L. H. Taylor,et al.  Diseases of humans and their domestic mammals: pathogen characteristics, host range and the risk of emergence. , 2001, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[7]  Fatih Ozsolak,et al.  RNA sequencing: advances, challenges and opportunities , 2011, Nature Reviews Genetics.

[8]  Filoviruses: A Compendium of 40 Years of Epidemiological, Clinical, and Laboratory Studies , 2009, Emerging Infectious Diseases.

[9]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[10]  P. Stadler,et al.  Conserved RNA secondary structures in Flaviviridae genomes. , 2004, The Journal of general virology.

[11]  Mark E.J. Woolhouse,et al.  Host Range and Emerging and Reemerging Pathogens , 2005, Emerging infectious diseases.

[12]  E. Eichler,et al.  Limitations of next-generation genome sequence assembly , 2011, Nature Methods.

[13]  E. Mardis Next-generation DNA sequencing methods. , 2008, Annual review of genomics and human genetics.

[14]  E. Lavezzo,et al.  Applications of Next-Generation Sequencing Technologies to Diagnostic Virology , 2011, International Journal of Molecular Sciences.

[15]  M. Gerstein,et al.  The Transcriptional Landscape of the Yeast Genome Defined by RNA Sequencing , 2008, Science.

[16]  P. Plagemann Hepatitis C virus , 2005, Archives of Virology.