A genome-wide analysis of common fragile sites: What features determine chromosomal instability in the human genome?

Chromosomal common fragile sites (CFSs) are unstable genomic regions that break under replication stress and are involved in structural variation. They frequently are sites of chromosomal rearrangements in cancer and of viral integration. However, CFSs are undercharacterized at the molecular level and thus difficult to predict computationally. Newly available genome-wide profiling studies provide us with an unprecedented opportunity to associate CFSs with features of their local genomic contexts. Here, we contrasted the genomic landscape of cytogenetically defined aphidicolin-induced CFSs (aCFSs) to that of nonfragile sites, using multiple logistic regression. We also analyzed aCFS breakage frequencies as a function of their genomic landscape, using standard multiple regression. We show that local genomic features are effective predictors both of regions harboring aCFSs (explaining ∼77% of the deviance in logistic regression models) and of aCFS breakage frequencies (explaining ∼45% of the variance in standard regression models). In our optimal models (having highest explanatory power), aCFSs are predominantly located in G-negative chromosomal bands and away from centromeres, are enriched in Alu repeats, and have high DNA flexibility. In alternative models, CpG island density, transcription start site density, H3K4me1 coverage, and mononucleotide microsatellite coverage are significant predictors. Also, aCFSs have high fragility when colocated with evolutionarily conserved chromosomal breakpoints. Our models are predictive of the fragility of aCFSs mapped at a higher resolution. Importantly, the genomic features we identified here as significant predictors of fragility allow us to draw valuable inferences on the molecular mechanisms underlying aCFSs.

[1]  A. Bhargava,et al.  Mutational Dynamics of Microsatellites , 2010, Molecular biotechnology.

[2]  F. Pelliccia,et al.  Replication timing of two human common fragile sites: FRA1H and FRA2G , 2008, Cytogenetic and Genome Research.

[3]  C. Croce,et al.  Fragile site orthologs FHIT/FRA3B and Fhit/Fra14A2: Evolutionarily conserved but highly recombinogenic , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[4]  D. Stone,et al.  Animal model: Chromosomal fragile site expression in dogs: II. Expression in boxer dogs with mast cell tumors , 1991 .

[5]  M. Schwab,et al.  Common fragile site FRA11G and rare fragile site FRA11B at 11q23.3 encompass distinct genomic regions , 2007, Genes, chromosomes & cancer.

[6]  T. Robinson,et al.  Multiple common fragile sites are expressed in the genome of the laboratory rat , 2004, Chromosoma.

[7]  F. Pelliccia,et al.  Breakages at common fragile sites set boundaries of amplified regions in two leukemia cell lines K562 - Molecular characterization of FRA2H and localization of a new CFS FRA2S. , 2010, Cancer letters.

[8]  Jose Castresana,et al.  Is mammalian chromosomal evolution driven by regions of genome fragility? , 2006, Genome Biology.

[9]  A. Papavassiliou,et al.  Oncogene-induced replication stress preferentially targets common fragile sites in preneoplastic lesions. A genome-wide study , 2008, Oncogene.

[10]  F. López-Giráldez,et al.  Assessing the Role of Tandem Repeats in Shaping the Genomic Architecture of Great Apes , 2011, PloS one.

[11]  F. Ruddle,et al.  Chromosomal variation in man: catalog of chromosomal variants and anomalies. , 1975, Birth defects original article series.

[12]  Owen T McCann,et al.  Replication timing of the human genome. , 2004, Human molecular genetics.

[13]  W. Bickmore,et al.  Genes and genomes: chromosome bands – flavours to savour , 1993 .

[14]  David M. Gilbert,et al.  Evaluating genome-scale approaches to eukaryotic DNA replication , 2010, Nature Reviews Genetics.

[15]  Miriam K. Konkel,et al.  A mobile threat to genome stability: The impact of non-LTR retrotransposons upon the human genome. , 2010, Seminars in cancer biology.

[16]  Stijn van Dongen,et al.  miRBase: tools for microRNA genomics , 2007, Nucleic Acids Res..

[17]  A. Kuwano,et al.  Common fragile sites induced by folate deprivation, BrdU and aphidicolin: Their frequency and distribution in Japanese individuals , 1988, The Japanese Journal of Human Genetics.

[18]  R. Scott Hansen,et al.  Cell-type-specific replication initiation programs set fragility of the FRA3B fragile site , 2011, Nature.

[19]  Kateryna D. Makova,et al.  A Macaque's-Eye View of Human Insertions and Deletions: Differences in Mechanisms , 2007, PLoS Comput. Biol..

[20]  David Haussler,et al.  The UCSC Genome Browser database: update 2010 , 2009, Nucleic Acids Res..

[21]  Ming Yi,et al.  Non-B DB: a database of predicted non-B DNA-forming motifs in mammalian genomes , 2010, Nucleic Acids Res..

[22]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[23]  D. Comings Mechanisms of chromosome banding and implications for chromosome structure. , 1978, Annual review of genetics.

[24]  F. Apiou,et al.  Characterization of a conserved aphidicolin-sensitive common fragile site at human 4q22 and mouse 6C1: possible association with an inherited disease and cancer , 2004, Oncogene.

[25]  B. Kerem,et al.  Fragile sites are preferential targets for integrations of MLV vectors in gene therapy , 2006, Gene Therapy.

[26]  A. Ruiz-Herrera,et al.  Conservation of aphidicolin-induced fragile sites in Papionini (Primates) species and humans , 2004, Chromosome Research.

[27]  J. Fox Nonparametric Regression Appendix to An R and S-PLUS Companion to Applied Regression , 2002 .

[28]  S. Dalton,et al.  Evolutionarily conserved replication timing profiles predict long-range chromatin interactions and distinguish closely related cell types. , 2010, Genome research.

[29]  T. Glover,et al.  Induction of sister chromatid exchanges at common fragile sites. , 1987, American journal of human genetics.

[30]  Neerja Karnani,et al.  Genomic Study of Replication Initiation in Human Chromosomes Reveals the Influence of Transcription Regulation and Chromatin Structure on Origin Selection , 2010, Molecular biology of the cell.

[31]  N. Draper,et al.  Applied Regression Analysis , 1966 .

[32]  S. Scherer,et al.  Molecular characterization of a common fragile site (FRA7H) on human chromosome 7 by the cloning of a simian virus 40 integration site. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[33]  J. de Grouchy,et al.  A cytogenetic survey of 110 baboons (Papio cynocephalus). , 1981, American journal of physical anthropology.

[34]  D. Beer,et al.  The murine Fhit gene is highly similar to its human orthologue and maps to a common fragile site region. , 1998, Cancer research.

[35]  David Haussler,et al.  Patterns of insertions and their covariation with substitutions in the rat, mouse, and human genomes. , 2004, Genome research.

[36]  C. Freudenreich,et al.  An AT-rich sequence in human common fragile site FRA16D causes fork stalling and chromosome breakage in S. cerevisiae. , 2007, Molecular cell.

[37]  Nathaniel D. Heintzman,et al.  Histone modifications at human enhancers reflect global cell-type-specific gene expression , 2009, Nature.

[38]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[39]  Giorgio Bernardi,et al.  An isochore map of human chromosomes. , 2006, Genome research.

[40]  David Haussler,et al.  Integration of the cytogenetic map with the draft human genome sequence. , 2003, Human molecular genetics.

[41]  David L. Steffen,et al.  The DNA sequence of the human X chromosome , 2005, Nature.

[42]  R. Espinosa,et al.  Replication of a common fragile site, FRA3B, occurs late in S phase and is delayed further upon induction: implications for the mechanism of fragile site induction. , 1998, Human molecular genetics.

[43]  J. Fryns,et al.  Human chromosome fragility. , 2008, Biochimica et biophysica acta.

[44]  T. Glover,et al.  Common fragile sites as targets for chromosome rearrangements. , 2006, DNA repair.

[45]  A. Helmrich,et al.  Identification of the human/mouse syntenic common fragile site FRA7K/Fra12C1—Relation of FRA7K and other human common fragile sites on chromosome 7 to evolutionary breakpoints , 2007, International journal of cancer.

[46]  F. Hecht Fragile sites, cancer chromosome breakpoints, and oncogenes all cluster in light G bands. , 1988, Cancer genetics and cytogenetics.

[47]  M. W. Glynn,et al.  Stably transfected common fragile site sequences exhibit instability at ectopic sites , 2008, Genes, chromosomes & cancer.

[48]  Riitta Lahesmaa,et al.  Copy number variation and selection during reprogramming to pluripotency , 2011, Nature.

[49]  R Nussinov,et al.  Sequence dependence of DNA conformational flexibility. , 1989, Biochemistry.

[50]  T. Robinson,et al.  Rodent common fragile sites: Are they conserved? Evidence from mouse and rat , 1989, Chromosoma.

[51]  David Haussler,et al.  Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. , 2003, Genome research.

[52]  C. Freudenreich Chromosome fragility: molecular mechanisms and cellular consequences. , 2007, Frontiers in bioscience : a journal and virtual library.

[53]  M. Batzer,et al.  The impact of retrotransposons on human genome evolution , 2009, Nature Reviews Genetics.

[54]  D. Smeets,et al.  Common fragile sites in man and three closely related primate species. , 1990, Cytogenetics and cell genetics.

[55]  Telomere biology and DNA repair: enemies with benefits. , 2010 .

[56]  David M. Gilbert,et al.  Domain-wide regulation of DNA replication timing during mammalian development , 2009, Chromosome Research.

[57]  A. Helmrich,et al.  Common fragile sites are conserved features of human and mouse chromosomes and relate to large active genes. , 2006, Genome research.

[58]  W. Miller,et al.  Sequence conservation at human and mouse orthologous common fragile regions, FRA3B/FHIT and Fra14A2/Fhit , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[59]  E. Calhoun,et al.  The common fragile site FRA16D and its associated gene WWOX are highly conserved in the mouse at Fra8E1 , 2002, Genes, chromosomes & cancer.

[60]  Alain Arneodo,et al.  Replication-associated mutational asymmetry in the human genome. , 2011, Molecular biology and evolution.

[61]  Ichiro Hiratani,et al.  ReplicationDomain: a visualization tool and comparative database for genome-wide replication timing data , 2008, BMC Bioinformatics.

[62]  A. Bernheim,et al.  Initiation of the breakage-fusion-bridge mechanism through common fragile site activation in human breast cancer cells: the model of PIP gene duplication from a break at FRA7I. , 2002, Human molecular genetics.

[63]  F. Pelliccia,et al.  Characterization of FRA7B, a human common fragile site mapped at the 7p chromosome terminal region. , 2010, Cancer genetics and cytogenetics.

[64]  B. Kerem,et al.  Common fragile sites: G-band characteristics within an R-band. , 1999, American journal of human genetics.

[65]  Anton Nekrutenko,et al.  Integrating diverse databases into an unified analysis framework: a Galaxy approach , 2011, Database J. Biol. Databases Curation.

[66]  K. Eckert,et al.  DNA structure and the Werner protein modulate human DNA polymerase delta-dependent replication dynamics within the common fragile site FRA16D , 2009, Nucleic acids research.

[67]  K. Makova,et al.  A genome-wide view of mutation rate co-variation using multivariate analyses , 2011, Genome Biology.

[68]  G. Holmquist,et al.  Characterization of Giemsa dark- and light-band DNA , 1982, Cell.

[69]  R. Richards,et al.  Common chromosomal fragile site FRA16D sequence: identification of the FOR gene spanning FRA16D and homozygous deletions and translocation breakpoints in cancer cells. , 2000, Human molecular genetics.

[70]  I. Hickson,et al.  Replication stress induces sister-chromatid bridging at fragile site loci in mitosis , 2009, Nature Cell Biology.

[71]  Loretta Auvil,et al.  Breakpoint regions and homologous synteny blocks in chromosomes have different evolutionary histories. , 2009, Genome research.

[72]  P. Hanawalt,et al.  Preferential DNA repair of an active gene in human cells. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[73]  L. Wessels,et al.  Domain organization of human chromosomes revealed by mapping of nuclear lamina interactions , 2008, Nature.

[74]  G Vergnaud,et al.  Minisatellites: mutability and genome architecture. , 2000, Genome research.

[75]  T. Graves,et al.  The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes , 2003, Nature.

[76]  Laurent Duret,et al.  Genome-wide studies highlight indirect links between human replication origins and gene regulation , 2008, Proceedings of the National Academy of Sciences.

[77]  Levi C. T. Pierce,et al.  Over half of breakpoints in gene pairs involved in cancer-specific recurrent translocations are mapped to human chromosomal fragile sites , 2009, BMC Genomics.

[78]  M. L. Le Beau,et al.  Common fragile sites are characterized by histone hypoacetylation , 2009, Human molecular genetics.

[79]  M. Stanley,et al.  Characterization of naturally occurring HPV16 integration sites isolated from cervical keratinocytes under noncompetitive conditions. , 2008, Cancer research.

[80]  Stephen W. Scherer,et al.  Replication Delay along FRA7H, a Common Fragile Site on Human Chromosome 7, Leads to Chromosomal Instability , 2000, Molecular and Cellular Biology.

[81]  S. Warren,et al.  Replication stress induces tumor-like microdeletions in FHIT/FRA3B , 2008, Proceedings of the National Academy of Sciences.

[82]  K. Makova,et al.  A matter of life or death: how microsatellites emerge in and vanish from the human genome. , 2011, Genome research.

[83]  S. Gollin,et al.  Relationship between FRA11F and 11q13 gene amplification in oral cancer , 2007, Genes, chromosomes & cancer.

[84]  J. Yunis,et al.  Constitutive fragile sites and cancer. , 1984, Science.

[85]  M. Schwab,et al.  The FRA2C common fragile site maps to the borders of MYCN amplicons in neuroblastoma and is associated with gross chromosomal rearrangements in different cancers. , 2011, Human molecular genetics.

[86]  B. Kerem,et al.  Failure of origin activation in response to fork stalling leads to chromosomal instability at fragile sites. , 2011, Molecular cell.

[87]  J. Weber,et al.  Alu repeats: a source for the genesis of primate microsatellites. , 1995, Genomics.

[88]  T. Glover,et al.  Chromosome fragile sites. , 2007, Annual review of genetics.

[89]  J. Doles,et al.  Folate-sensitive and aphidicolin-inducible fragile sites are expressed in the genome of the domestic cat. , 1993, Cancer genetics and cytogenetics.

[90]  Michael O Dorschner,et al.  Sequencing newly replicated DNA reveals widespread plasticity in human replication timing , 2009, Proceedings of the National Academy of Sciences.

[91]  S. Scherer,et al.  Molecular Basis for Expression of Common and Rare Fragile Sites , 2003, Molecular and Cellular Biology.

[92]  P. Donnelly,et al.  A Fine-Scale Map of Recombination Rates and Hotspots Across the Human Genome , 2005, Science.

[93]  A. Travers,et al.  The structural basis of DNA flexibility , 2004, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[94]  U. Ligges Review of An R and S-PLUS companion to applied regression by J. Fox, Sage Publications, Thousand Oaks, California 2002 , 2003 .

[95]  D. Hancock,et al.  Animal model: Chromosomal fragile site expression in dogs: I. Breed specific differences , 1991 .

[96]  M. L. Le Beau,et al.  The role of late/slow replication of the FRA16D in common fragile site induction , 2004, Genes, chromosomes & cancer.

[97]  A. Bensimon,et al.  Replication dynamics at common fragile site FRA6E , 2010, Chromosoma.

[98]  L. Dillon,et al.  DNA Instability at Chromosomal Fragile Sites in Cancer , 2010, Current genomics.

[99]  M. L. Le Beau,et al.  Impaired replication dynamics at the FRA3B common fragile site. , 2010, Human molecular genetics.

[100]  Uwe Claussen,et al.  Global screening and extended nomenclature for 230 aphidicolin-inducible fragile sites, including 61 yet unreported ones. , 2010, International journal of oncology.

[101]  G. Holmquist,et al.  Chromosome bands, their chromatin flavors, and their functional features. , 1992, American journal of human genetics.