Anaconda: AN automated pipeline for somatic COpy Number variation Detection and Annotation from tumor exome sequencing data

BackgroundCopy number variations (CNVs) are the main genetic structural variations in cancer genome. Detecting CNVs in genetic exome region is efficient and cost-effective in identifying cancer associated genes. Many tools had been developed accordingly and yet these tools lack of reliability because of high false negative rate, which is intrinsically caused by genome exonic bias.ResultsTo provide an alternative option, here, we report Anaconda, a comprehensive pipeline that allows flexible integration of multiple CNV-calling methods and systematic annotation of CNVs in analyzing WES data. Just by one command, Anaconda can generate CNV detection result by up to four CNV detecting tools. Associated with comprehensive annotation analysis of genes involved in shared CNV regions, Anaconda is able to deliver a more reliable and useful report in assistance with CNV-associate cancer researches.ConclusionAnaconda package and manual can be freely accessed at http://mcg.ustc.edu.cn/bsc/ANACONDA/.

[1]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[2]  B. Giusti,et al.  EXCAVATOR: detecting copy number variants from whole-exome sequencing data , 2013, Genome Biology.

[3]  Yadong Wang,et al.  ERDS-pe: A paired hidden Markov model for copy number variant detection from whole-exome sequencing data , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[4]  Tatiana Popova,et al.  Supplementary Methods , 2012, Acta Neuropsychiatrica.

[5]  Uri Tabori,et al.  Excessive genomic DNA copy number variation in the Li–Fraumeni cancer predisposition syndrome , 2008, Proceedings of the National Academy of Sciences.

[6]  Sampsa Hautaniemi,et al.  Comparative analysis of methods for identifying somatic copy number alterations from deep sequencing data , 2015, Briefings Bioinform..

[7]  D. Conrad,et al.  Global variation in copy number in the human genome , 2006, Nature.

[8]  Matthew S. Lebo,et al.  Detecting Copy Number Variation via Next Generation Technology , 2016, Current Genetic Medicine Reports.

[9]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[10]  Derek Y. Chiang,et al.  The landscape of somatic copy-number alteration across human cancers , 2010, Nature.

[11]  Tomas W. Fitzgerald,et al.  Origins and functional impact of copy number variation in the human genome , 2010, Nature.

[12]  David Liu,et al.  DAVID Knowledgebase: a gene-centered database integrating heterogeneous gene annotation resources to facilitate high-throughput gene functional analysis , 2007, BMC Bioinformatics.

[13]  Huan Zhang,et al.  DeAnnCNV: a tool for online detection and annotation of copy number variations from whole-exome sequencing data , 2015, Nucleic Acids Res..

[14]  X. Xie,et al.  Reproducible copy number variation patterns among single circulating tumor cells of lung cancer patients , 2013, Proceedings of the National Academy of Sciences.

[15]  John Quackenbush,et al.  Exome sequencing-based copy-number variation and loss of heterozygosity detection: ExomeCNV , 2011, Bioinform..

[16]  Christian A. Rees,et al.  Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumors , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Peter J. Park,et al.  Evaluation of somatic copy number estimation tools for whole-exome sequencing data , 2016, Briefings Bioinform..