Genome Fusion Detection: a novel method to detect fusion genes from SNP-array data

Motivation: Fusion genes result from genomic rearrangements, such as deletions, amplifications and translocations. Such rearrangements can also frequently be observed in cancer and have been postulated as driving event in cancer development. to detect them, one needs to analyze the transition region of two segments with different copy number, the location where fusions are known to occur. Finding fusion genes is essential to understanding cancer development and may lead to new therapeutic approaches. Results: Here we present a novel method, the Genomic Fusion Detection algorithm, to predict fusion genes on a genomic level based on SNP-array data. This algorithm detects genes at the transition region of segments with copy number variation. With the application of defined constraints, certain properties of the detected genes are evaluated to predict whether they may be fused. We evaluated our prediction by calculating the observed frequency of known fusions in both primary cancers and cell lines. We tested a set of cell lines positive for the BCR-ABL1 fusion and prostate cancers positive for the TMPRSS2-ERG fusion. We could detect the fusions in all positive cell lines, but not in the negative controls. Availability: The algorithm is available from the supplement. Contact: philip.groth@bayer.com Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Tomas W. Fitzgerald,et al.  Origins and functional impact of copy number variation in the human genome , 2010, Nature.

[2]  J. Melo,et al.  Selection and characterization of BCR-ABL positive cell lines with differential sensitivity to the tyrosine kinase inhibitor STI571: diverse mechanisms of resistance. , 2000, Blood.

[3]  L. Kearney,et al.  The paired box domain gene PAX5 is fused to ETV6/TEL in an acute lymphoblastic leukemia case. , 2001, Cancer research.

[4]  C. Wiuf,et al.  A review of software for microarray genotyping , 2011, Human Genomics.

[5]  Ryan Mills,et al.  Comprehensive assessment of array-based platforms and calling algorithms for detection of copy number variants , 2011, Nature Biotechnology.

[6]  P. Edwards Fusion genes and chromosome translocations in the common epithelial cancers , 2009, The Journal of pathology.

[7]  Lee T. Sam,et al.  Transcriptome Sequencing to Detect Gene Fusions in Cancer , 2009, Nature.

[8]  S. Suzuki,et al.  Chromosome 17 copy numbers and incidence of p 53 gene deletion in gastric cancer cells. Dual color fluorescence in situ hybridization analysis. , 1997, Nihon Ika Daigaku zasshi.

[9]  Joshua M. Korn,et al.  Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs , 2008, Nature Genetics.

[10]  P. Nowell,et al.  Chromosome studies on normal and leukemic human leukocytes. , 1960, Journal of the National Cancer Institute.

[11]  Meena Kishore Sakharkar,et al.  Distributions of exons and introns in the human genome , 2004, Silico Biol..

[12]  J. Tchinda,et al.  Molecular characterization of TMPRSS2-ERG gene fusion in the NCI-H660 prostate cancer cell line: a new perspective for an old model. , 2007, Neoplasia.

[13]  D. Berney,et al.  Distinct genomic alterations in prostate cancers in Chinese and Western populations suggest alternative pathways of prostate carcinogenesis. , 2010, Cancer research.

[14]  A. Ferrando,et al.  Fusion of NUP214 to ABL1 on amplified episomes in T-cell acute lymphoblastic leukemia , 2004, Nature Genetics.

[15]  Joseph T. Glessner,et al.  PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. , 2007, Genome research.

[16]  Süleyman Cenk Sahinalp,et al.  deFuse: An Algorithm for Gene Fusion Discovery in Tumor RNA-Seq Data , 2011, PLoS Comput. Biol..

[17]  M. Hofer,et al.  Die TMPRSS2-ETS-Genfusion beim Prostatakarzinom , 2007, Der Urologe.

[18]  Human Cell Culture , 2002 .

[19]  Tieliu Shi,et al.  Overview of available methods for diverse RNA-Seq data analyses , 2011, Science China Life Sciences.

[20]  Y. Matsuo,et al.  ABL-BCR expression in BCR-ABL-positive human leukemia cell lines. , 1999, Leukemia research.

[21]  B. Johansson,et al.  Primary vs. secondary neoplasia‐associated chromosomal abnormalities—balanced rearrangements vs. genomic imbalances? , 1996, Genes, chromosomes & cancer.

[22]  Ira M. Hall,et al.  Detection and interpretation of genomic structural variation in mammals. , 2012, Methods in molecular biology.

[23]  E. Liu,et al.  Next-generation DNA sequencing of paired-end tags (PET) for transcriptome and genome analyses. , 2009, Genome research.

[24]  F. Mitelman Cancer cytogenetics update 2005 , 2011 .

[25]  Cheng Li,et al.  Lessons from a decade of integrating cancer copy number alterations with gene expression profiles , 2012, Briefings Bioinform..

[26]  M. Long,et al.  A new function evolved from gene fusion. , 2000, Genome research.

[27]  A. Tsalenko,et al.  The fine-scale and complex architecture of human copy-number variation. , 2008, American journal of human genetics.

[28]  Elisa Rossi,et al.  Epidermal growth factor receptor gene and protein and gefitinib sensitivity in non-small-cell lung cancer. , 2005, Journal of the National Cancer Institute.

[29]  J. Tchinda,et al.  Recurrent fusion of TMPRSS2 and ETS transcription factor genes in prostate cancer. , 2006, Science.

[30]  Osamu Miura,et al.  Sorafenib induces apoptosis specifically in cells expressing BCR/ABL by inhibiting its kinase activity to activate the intrinsic mitochondrial pathway. , 2009, Cancer research.

[31]  G. Barbany,et al.  Early landmark analysis of imatinib treatment in CML chronic phase: Less than 10% BCR‐ABL by FISH at 3 months associated with improved long‐term clinical outcome , 2012, American journal of hematology.

[32]  Nicholas A. Stover,et al.  Detection of Fused Genes in Eukaryotic Genomes using Gene deFuser: Analysis of the Tetrahymena thermophila genome , 2011, BMC Bioinformatics.

[33]  Yoshiyuki Shibata,et al.  Detection of DNA fusion junctions for BCR-ABL translocations by Anchored ChromPET , 2010, Genome Medicine.

[34]  Shigeru Chiba,et al.  A robust algorithm for copy number detection using high-density oligonucleotide single nucleotide polymorphism genotyping arrays. , 2005, Cancer research.

[35]  H. Aburatani,et al.  Identification of the transforming EML4–ALK fusion gene in non-small-cell lung cancer , 2007, Nature.

[36]  Joseph B Hiatt,et al.  Evidence for compensatory upregulation of expressed X-linked genes in mammals, Caenorhabditis elegans and Drosophila melanogaster , 2011, Nature Genetics.

[37]  S. Dhanasekaran,et al.  Distinct classes of chromosomal rearrangements create oncogenic ETS gene fusions in prostate cancer , 2007, Nature.

[38]  H. Greisman,et al.  Rapid high-resolution mapping of balanced chromosomal rearrangements on tiling CGH arrays. , 2011, The Journal of molecular diagnostics : JMD.

[39]  L. Kearney,et al.  Molecular cytogenetics in haematological malignancy: current technology and future prospects , 2005, Chromosoma.

[40]  Benjamin J. Raphael,et al.  Detection of recurrent rearrangement breakpoints from copy number data , 2011, BMC Bioinformatics.

[41]  Xiaobo Zhou,et al.  Conditional random pattern model for copy number aberration detection , 2009, BMC Bioinformatics.

[42]  S. Sen,et al.  Aneuploidy and cancer , 2000, Current opinion in oncology.

[43]  B. Johansson,et al.  The impact of translocations and gene fusions on cancer causation , 2007, Nature Reviews Cancer.

[44]  J. Melo,et al.  A novel BCR-ABL fusion gene (e6a2) in a patient with Philadelphia chromosome-negative chronic myelogenous leukemia. , 1996, Blood.

[45]  B J Williams,et al.  Comparative genomic hybridization. , 1996, Methods in molecular medicine.

[46]  S. Swamy,et al.  PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data , 2009, Biostatistics.

[47]  Y. Bang,et al.  The potential for crizotinib in non-small cell lung cancer: a perspective review , 2011, Therapeutic advances in medical oncology.

[48]  E. Koonin,et al.  Evolution of gene fusions: horizontal transfer versus independent events , 2002, Genome Biology.

[49]  Luc Girard,et al.  An integrated view of copy number and allelic alterations in the cancer genome using single nucleotide polymorphism arrays. , 2004, Cancer research.

[50]  D. Mount Bioinformatics: Sequence and Genome Analysis , 2001 .