VirusSeq: software to identify viruses and their integration sites using next-generation sequencing of human cancer tissue

SUMMARY We developed a new algorithmic method, VirusSeq, for detecting known viruses and their integration sites in the human genome using next-generation sequencing data. We evaluated VirusSeq on whole-transcriptome sequencing (RNA-Seq) data of 256 human cancer samples from The Cancer Genome Atlas. Using these data, we showed that VirusSeq accurately detects the known viruses and their integration sites with high sensitivity and specificity. VirusSeq can also perform this function using whole-genome sequencing data of human tissue. AVAILABILITY VirusSeq has been implemented in PERL and is available at http://odin.mdacc.tmc.edu/∼xsu1/VirusSeq.html. CONTACT xsu1@mdanderson.org SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Thomas D. Wu,et al.  The effects of hepatitis B virus integration into the genomes of hepatocellular carcinoma patients. , 2012, Genome research.

[2]  H. Hausen,et al.  The search for infectious causes of human cancers: where and why. , 2009, Virology.

[3]  Angela M. Liu,et al.  Genome-wide survey of recurrent HBV integration in hepatocellular carcinoma , 2012, Nature Genetics.

[4]  Yoshiki Murakami,et al.  Hepatitis B virus-related insertional mutagenesis occurs frequently in human liver cancers and recurrently targets human telomerase gene , 2003, Oncogene.

[5]  Gabor T. Marth,et al.  Whole-genome sequencing and variant discovery in C. elegans , 2008, Nature Methods.

[6]  G. Fourel,et al.  The win locus involved in activation of the distal N-myc2 gene upon WHV integration in woodchuck liver tumors harbors S/MAR elements. , 2004, Virology.

[7]  G. Getz,et al.  PathSeq: software to identify or discover microbes by deep sequencing of human tissue , 2011, Nature Biotechnology.

[8]  Ofer Isakov,et al.  Pathogen detection using short-RNA deep sequencing subtraction and assembly , 2011, Bioinform..

[9]  B. Rosner Percentage Points for a Generalized ESD Many-Outlier Procedure , 1983 .

[10]  B. McMahon,et al.  Integrations of the hepatitis B virus (HBV) and human papillomavirus (HPV) into the human telomerase reverse transcriptase (hTERT) gene in liver and cervical cancers , 2003, Oncogene.

[11]  P. Moore,et al.  Clonal Integration of a Polyomavirus in Human Merkel Cell Carcinoma , 2008, Science.

[12]  Takehide Asano,et al.  Integration of hepatitis B virus DNA into the myeloid/lymphoid or mixed‐lineage leukemia (MLL4) gene and rearrangements of MLL4 in human hepatocellular carcinoma , 2008, Human mutation.