The Properties of Genome Conformation and Spatial Gene Interaction and Regulation Networks of Normal and Malignant Human Cell Types

The spatial conformation of a genome plays an important role in the long-range regulation of genome-wide gene expression and methylation, but has not been extensively studied due to lack of genome conformation data. The recently developed chromosome conformation capturing techniques such as the Hi-C method empowered by next generation sequencing can generate unbiased, large-scale, high-resolution chromosomal interaction (contact) data, providing an unprecedented opportunity to investigate the spatial structure of a genome and its applications in gene regulation, genomics, epigenetics, and cell biology. In this work, we conducted a comprehensive, large-scale computational analysis of this new stream of genome conformation data generated for three different human leukemia cells or cell lines by the Hi-C technique. We developed and applied a set of bioinformatics methods to reliably generate spatial chromosomal contacts from high-throughput sequencing data and to effectively use them to study the properties of the genome structures in one-dimension (1D) and two-dimension (2D). Our analysis demonstrates that Hi-C data can be effectively applied to study tissue-specific genome conformation, chromosome-chromosome interaction, chromosomal translocations, and spatial gene-gene interaction and regulation in a three-dimensional genome of primary tumor cells. Particularly, for the first time, we constructed genome-scale spatial gene-gene interaction network, transcription factor binding site (TFBS) – TFBS interaction network, and TFBS-gene interaction network from chromosomal contact information. Remarkably, all these networks possess the properties of scale-free modular networks.

[1]  Gary Stacey,et al.  A Protein Domain Co-Occurrence Network Approach for Predicting Protein Function and Inferring Species Phylogeny , 2011, PloS one.

[2]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[3]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[4]  W. D. Laat,et al.  An evaluation of 3C-based methods to capture DNA interactions , 2007, Nature Methods.

[5]  Mathieu Blanchette,et al.  The three-dimensional architecture of Hox cluster silencing , 2010, Nucleic acids research.

[6]  David Haussler,et al.  The UCSC Genome Browser database: update 2010 , 2009, Nucleic Acids Res..

[7]  C. Nusbaum,et al.  Chromosome Conformation Capture Carbon Copy (5C): a massively parallel solution for mapping interactions between genomic elements. , 2006, Genome research.

[8]  Craig L. Peterson,et al.  Chromatin Higher Order Folding--Wrapping up Transcription , 2002, Science.

[9]  Yunqian Ma,et al.  Practical selection of SVM parameters and noise estimation for SVM regression , 2004, Neural Networks.

[10]  Tom Misteli,et al.  Cell biology: Chromosome territories , 2007, Nature.

[11]  J. Dekker,et al.  Capturing Chromosome Conformation , 2002, Science.

[12]  S. Lovell,et al.  Protein-protein interaction networks and biology—what's the connection? , 2008, Nature Biotechnology.

[13]  Albert-László Barabási,et al.  Error and attack tolerance of complex networks , 2000, Nature.

[14]  J. Gall,et al.  Formation and detection of RNA-DNA hybrid molecules in cytological preparations. , 1969, Proceedings of the National Academy of Sciences of the United States of America.

[15]  William Stafford Noble,et al.  A Three-Dimensional Model of the Yeast Genome , 2010, Nature.

[16]  Mario Cannataro,et al.  Semantic similarity analysis of protein data: assessment with biological features and issues , 2012, Briefings Bioinform..

[17]  Thomas A. Hopf,et al.  Protein 3D Structure Computed from Evolutionary Sequence Variation , 2011, PloS one.

[18]  Lukas N. Mueller,et al.  An integrated mass spectrometric and computational framework for the analysis of protein interaction networks , 2007, Nature Biotechnology.

[19]  B. Schwikowski,et al.  A network of protein–protein interactions in yeast , 2000, Nature Biotechnology.

[20]  A. Tanay,et al.  Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture , 2011, Nature Genetics.

[21]  Ting Wang,et al.  The UCSC Genome Browser Database: update 2009 , 2008, Nucleic Acids Res..

[22]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[23]  Ana Pombo,et al.  Chromosome organization: new facts, new models. , 2007, Trends in cell biology.

[24]  K. Sandhu,et al.  Circular chromosome conformation capture (4C) uncovers extensive networks of epigenetically regulated intra- and interchromosomal interactions , 2006, Nature Genetics.

[25]  T. Cremer,et al.  Chromosome territories, nuclear architecture and gene regulation in mammalian cells , 2001, Nature Reviews Genetics.

[26]  J. Hansen,et al.  Conformational dynamics of the chromatin fiber in solution: determinants, mechanisms, and functions. , 2002, Annual review of biophysics and biomolecular structure.

[27]  M. Martí-Renom,et al.  Chromatin globules: a common motif of higher order chromosome structure? , 2011, Current opinion in cell biology.

[28]  K. Welte,et al.  Absence of G‐CSF receptors and absent response to G‐CSF in childhood Burkitt's lymphoma and B‐ALL cells , 1995, British journal of haematology.

[29]  Romain Koszul,et al.  Normalization of a chromosomal contact map , 2012, BMC Genomics.

[30]  Yijun Ruan,et al.  Mapping of transcription factor binding regions in mammalian cells by ChIP: comparison of array- and sequencing-based technologies. , 2007, Genome research.

[31]  Thomas A. Hopf,et al.  Three-Dimensional Structures of Membrane Proteins from Genomic Sequencing , 2012, Cell.

[32]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[33]  D. Heermann,et al.  Spatially confined folding of chromatin in the interphase nucleus , 2009, Proceedings of the National Academy of Sciences.

[34]  Richard Bonneau Learning biological networks: from modules to dynamics. , 2008, Nature chemical biology.