Since the discovery of the "Philadelphia chromosome" in chronic myelogenous leukemia in 1960, there is an ongoing intensive research of chromosomal aberrations in cancer. These aberrations, which result in abnormally structured genomes, became a hallmark of cancer. Many studies give evidence to the connection between chromosomal alterations and aberrant genes involved in the carcinogenesis process. An important problem in the analysis of cancer genomes, is inferring the history of events leading to the observed aberrations. Cancer genomes are usually described in form of karyotypes, which present the global changes in the genomes' structure. In this study, we propose a mathematical framework for analyzing chromosomal aberrations in cancer karyotypes. We introduce the problem of sorting karyotypes by elementary operations, which seeks for a shortest sequence of elementary chromosomal events transforming a normal karyotype into a given (abnormal) cancerous karyotype. Under certain assumptions, we prove a lower bound for the elementary distance, and present a polynomial-time 3-approximation algorithm. We applied our algorithm to karyotypes from the Mitelman database, which records cancer karyotypes reported in the scientific literature. Approximately 94% of the karyotypes in the database, totalling 57,252 karyotypes, supported our assumptions, and each of them was subjected to our algorithm. Remarkably, even though the algorithm is only guaranteed to generate a 3-approximation, it produced a sequence whose length matches the lower bound (and hence optimal) in 99.9% of the tested karyotypes.
[1]
Louxin Zhang,et al.
Models and Methods in Comparative Genomics
,
2006,
Adv. Comput..
[2]
H. Kuhn.
The Hungarian method for the assignment problem
,
1955
.
[3]
Benjamin J. Raphael,et al.
Reconstructing tumor genome architectures
,
2003,
ECCB.
[4]
Ajay N. Jain,et al.
Assembly of microarrays for genome-wide measurement of DNA copy number
,
2001,
Nature Genetics.
[5]
E. Birney,et al.
Patterns of somatic mutation in human cancer genomes
,
2007,
Nature.
[6]
F. Alt,et al.
DNA double strand break repair and chromosomal translocation: Lessons from animal models
,
2001,
Oncogene.
[7]
A. J. Radcliffe,et al.
Reversals and Transpositions Over Finite Alphabets
,
2005
.
[8]
J. Munkres.
ALGORITHMS FOR THE ASSIGNMENT AND TRANSIORTATION tROBLEMS*
,
1957
.
[9]
D. Albertson,et al.
Chromosome aberrations in solid tumors
,
2003,
Nature Genetics.
[10]
Bernhard Hiller,et al.
CyDAS: a cytogenetic data analysis system
,
2005,
Bioinform..
[11]
Jens Vygen,et al.
The Book Review Column1
,
2020,
SIGACT News.
[12]
Mattias Höglund,et al.
Statistical behavior of complex cancer karyotypes
,
2005,
Genes, chromosomes & cancer.