GeSICA: Genome segmentation from intra-chromosomal associations

BackgroundVarious aspects of genome organization have been explored based on data from distinct technologies, including histone modification ChIP-Seq, 3C, and its derivatives. Recently developed Hi-C techniques enable the genome wide mapping of DNA interactomes, thereby providing the opportunity to study genome organization in detail, but these methods also pose challenges in methodology development.ResultsWe developed Genome Segmentation from Intra Chromosomal Associations, or GeSICA, to explore genome organization and applied the method to Hi-C data in human GM06990 and K562 cells. GeSICA calculates a simple logged ratio to efficiently segment the human genome into regions with two distinct states that correspond to rich and poor functional element states. Inside the rich regions, Markov Clustering was subsequently applied to segregate the regions into more detailed clusters. The binding sites of the insulator, cohesion, and transcription complexes are enriched in the boundaries between neighboring clusters, indicating that inferred clusters may have fine organizational features.ConclusionsOur study presents a novel analysis method, known as GeSICA, which gives insight into genome organization based on Hi-C data. GeSICA is open source and freely available at: http://web.tongji.edu.cn/~zhanglab/GeSICA/

[1]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[2]  Thomas Cremer,et al.  Chromosome territories--a functional nuclear landscape. , 2006, Current opinion in cell biology.

[3]  K. Pfeifer,et al.  Shifting insulator boundaries , 2004, Nature Genetics.

[4]  L. Schermelleh,et al.  Functional nuclear organization of transcription and DNA replication: a topographical marriage between chromatin domains and the interchromatin compartment. , 2010, Cold Spring Harbor symposia on quantitative biology.

[5]  Bing Li,et al.  The Role of Chromatin during Transcription , 2007, Cell.

[6]  E. Levanon,et al.  Human housekeeping genes are compact. , 2003, Trends in genetics : TIG.

[7]  Matteo Pellegrini,et al.  Expression profile of CREB knockdown in myeloid leukemia cells , 2008, BMC Cancer.

[8]  J. Burger,et al.  The CXCR4 chemokine receptor in acute and chronic leukaemia: a marrow homing receptor and potential therapeutic target , 2007, British journal of haematology.

[9]  A. Tanay,et al.  Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture , 2011, Nature Genetics.

[10]  A. West,et al.  Conserved CTCF Insulator Elements Flank the Mouse and Human β-Globin Loci , 2002, Molecular and Cellular Biology.

[11]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[12]  Guillaume J. Filion,et al.  Systematic Protein Location Mapping Reveals Five Principal Chromatin Types in Drosophila Cells , 2010, Cell.

[13]  S. Dongen Graph clustering by flow simulation , 2000 .

[14]  Hongyu Zhao,et al.  Protein–DNA interaction mapping using genomic tiling path microarrays in Drosophila , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[15]  David J. Reiss,et al.  CTCF physically links cohesin to chromatin , 2008, Proceedings of the National Academy of Sciences.

[16]  David Haussler,et al.  The UCSC genome browser database: update 2007 , 2006, Nucleic Acids Res..

[17]  Wouter de Laat,et al.  The β-globin nuclear compartment in development and erythroid differentiation , 2003, Nature Genetics.

[18]  Nathaniel D. Heintzman,et al.  Histone modifications at human enhancers reflect global cell-type-specific gene expression , 2009, Nature.

[19]  V. Corces,et al.  CTCF: Master Weaver of the Genome , 2009, Cell.

[20]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[21]  Timothy J. Durham,et al.  "Systematic" , 1966, Comput. J..

[22]  Petra C. Schwalie,et al.  A CTCF-independent role for cohesin in tissue-specific transcription. , 2010, Genome research.

[23]  Amos Tanay,et al.  Comparative Analysis of DNA Replication Timing Reveals Conserved Large-Scale Chromosomal Architecture , 2010, PLoS genetics.

[24]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[25]  P. Fraser Transcriptional control thrown for a loop. , 2006, Current opinion in genetics & development.

[26]  E. J. Kuhn,et al.  Genomic insulators: connecting properties to mechanism. , 2003, Current opinion in cell biology.

[27]  David Haussler,et al.  ENCODE whole-genome data in the UCSC genome browser (2011 update) , 2010, Nucleic Acids Res..

[28]  P. Cook The organization of replication and transcription. , 1999, Science.

[29]  Marjorie Fournier,et al.  The ATAC acetyl transferase complex controls mitotic progression by targeting non‐histone substrates , 2010, The EMBO journal.

[30]  Dustin E. Schones,et al.  Global analysis of the insulator binding protein CTCF in chromatin barrier regions reveals demarcation of active and repressive domains. , 2008, Genome research.

[31]  T. Cheutin,et al.  Chromatin folding: from linear chromosomes to the 4D nucleus. , 2010, Cold Spring Harbor symposia on quantitative biology.

[32]  Jessica Cande,et al.  Stalled Hox promoters as chromosomal boundaries. , 2009, Genes & development.

[33]  H. Aburatani,et al.  Cohesin mediates transcriptional insulation by CCCTC-binding factor , 2008, Nature.

[34]  David A. Orlando,et al.  Mediator and Cohesin Connect Gene Expression and Chromatin Architecture , 2010, Nature.

[35]  S. Razin,et al.  DNA interactions with the nuclear matrix and spatial organization of replication and transcription , 1987, BioEssays : news and reviews in molecular, cellular and developmental biology.

[36]  J. Dekker,et al.  Capturing Chromosome Conformation , 2002, Science.

[37]  R. van Driel,et al.  The eukaryotic genome: a system regulated at different hierarchical levels , 2003, Journal of Cell Science.

[38]  J. Juarez,et al.  Effects of inhibitors of the chemokine receptor CXCR4 on acute lymphoblastic leukemia cells in vitro , 2003, Leukemia.

[39]  A. West,et al.  Conserved CTCF insulator elements flank the mouse and human beta-globin loci. , 2002, Molecular and cellular biology.

[40]  Leighton J. Core,et al.  Paused Pol II captures enhancer activity and acts as a potent insulator. , 2009, Genes & development.

[41]  F. Grosveld,et al.  The beta-globin nuclear compartment in development and erythroid differentiation. , 2003, Nature genetics.

[42]  Y. Nakazawa,et al.  Significance of chemokine receptor expression in aggressive NK cell leukemia , 2005, Leukemia.

[43]  Ian X. Y. Leung,et al.  Intra- and inter-chromosomal interactions correlate with CTCF binding genome wide , 2010, Molecular systems biology.

[44]  Bing Ren,et al.  Discovery and Annotation of Functional Chromatin Signatures in the Human Genome , 2009, PLoS Comput. Biol..

[45]  Nicholas J. Schork,et al.  CCCTC-binding factor (CTCF) and cohesin influence the genomic architecture of the Igh locus and antisense transcription in pro-B cells , 2011, Proceedings of the National Academy of Sciences.

[46]  T. Misteli Beyond the Sequence: Cellular Organization of Genome Function , 2011 .