Genome-wide identification of DNaseI hypersensitive sites using active chromatin sequence libraries.

Comprehensive identification of sequences that regulate transcription is one of the major goals of genome biology. Focal alteration in chromatin structure in vivo, detectable through hypersensitivity to DNaseI and other nucleases, is the sine qua non of a diverse cast of transcriptional regulatory elements including enhancers, promoters, insulators, and locus control regions. We developed an approach for genome-scale identification of DNaseI hypersensitive sites (HSs) via isolation and cloning of in vivo DNaseI cleavage sites to create libraries of active chromatin sequences (ACSs). Here, we describe analysis of >61,000 ACSs derived from erythroid cells. We observed peaks in the density of ACSs at the transcriptional start sites of known genes at non-gene-associated CpG islands, and, to a lesser degree, at evolutionarily conserved noncoding sequences. Peaks in ACS density paralleled the distribution of DNaseI HSs. ACSs and DNaseI HSs were distributed between both expressed and nonexpressed genes, suggesting that a large proportion of genes reside within open chromatin domains. The results permit a quantitative approximation of the distribution of HSs and classical cis-regulatory sequences in the human genome.

[1]  Carl Wu The 5′ ends of Drosophila heat shock genes in chromatin are hypersensitive to DNase I , 1980, Nature.

[2]  M. Groudine,et al.  Human fetal to adult hemoglobin switching: changes in chromatin structure of the beta-globin gene locus. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[3]  S. Elgin Molecular biology: Anatomy of hypersensitive sites , 1984, Nature.

[4]  D. Tuan,et al.  The "beta-like-globin" gene domain in human erythroid cells. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[5]  V. Rotter,et al.  Human p53 oncogene contains one promoter upstream of exon 1 and a second, stronger promoter within intron 1. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[6]  D. S. Gross,et al.  Nuclease hypersensitive sites in chromatin. , 1988, Annual review of biochemistry.

[7]  B. Aronow,et al.  Evidence for a complex regulatory array in the first intron of the human adenosine deaminase gene. , 1989, Genes & development.

[8]  P. Rousseeuw,et al.  Unmasking Multivariate Outliers and Leverage Points , 1990 .

[9]  A. Jarman,et al.  A major positive regulatory region located far upstream of the human alpha-globin gene locus. , 1990, Genes & development.

[10]  AC Tose Cell , 1993, Cell.

[11]  M. Reitman,et al.  An enhancer/locus control region is not sufficient to open chromatin , 1993, Molecular and cellular biology.

[12]  J. Baron,et al.  The Molecular Basis of Blood Diseases , 1994 .

[13]  L. Wodicka,et al.  Genome-wide expression monitoring in Saccharomyces cerevisiae , 1997, Nature Biotechnology.

[14]  H. Hurst,et al.  An intron 1 enhancer element mediates oestrogen-induced suppression of ERBB2 expression , 1997, Oncogene.

[15]  F. Grosveld,et al.  Locus control regions, chromatin activation and transcription. , 1998, Current opinion in cell biology.

[16]  D. Tenen,et al.  Multiple control elements are required for expression of the human CD34 gene , 1998 .

[17]  Y. Tone,et al.  High Level Transcription of the Complement Regulatory Protein CD59 Requires an Enhancer Located in Intron 1* , 1999, The Journal of Biological Chemistry.

[18]  Michael Q. Zhang,et al.  Computational identification of promoters and first exons in the human genome , 2001, Nature Genetics.

[19]  G. Stamatoyannopoulos,et al.  Quantification of DNaseI-sensitivity by real-time PCR: quantitative analysis of DNaseI-hypersensitivity of the mouse beta-globin LCR. , 2001, Journal of molecular biology.

[20]  R. Rowntree,et al.  An element in intron 1 of the CFTR gene augments intestinal expression in vivo. , 2001, Human molecular genetics.

[21]  Peter W. Markstein,et al.  Genome-wide analysis of clustered Dorsal binding sites identifies putative target genes in the Drosophila embryo , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[23]  C. Denton,et al.  Characterization of an Evolutionarily Conserved Far-upstream Enhancer in the Human α2(I) Collagen (COL1A2) Gene* , 2001, The Journal of Biological Chemistry.

[24]  L. Pennacchio,et al.  Genomic strategies to identify mammalian regulatory sequences , 2001, Nature Reviews Genetics.

[25]  C. Epstein,et al.  Numerical methods for handling uncertainty in microarray data: an example analyzing perturbed mitochondrial function in yeast. , 2001, Methods in cell biology.

[26]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[27]  G. Rubin,et al.  Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Michael Levine,et al.  Whole-Genome Analysis of Dorsal-Ventral Patterning in the Drosophila Embryo , 2002, Cell.

[29]  Michael Litt,et al.  The insulation of genes from external enhancers and silencing chromatin , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[30]  M. Adams,et al.  Recent Segmental Duplications in the Human Genome , 2002, Science.

[31]  R. van Driel,et al.  The eukaryotic genome: a system regulated at different hierarchical levels , 2003, Journal of Cell Science.

[32]  M. Groudine,et al.  Controlling the double helix , 2003, Nature.

[33]  L. Pachter,et al.  SLAM: cross-species gene finding and alignment with a generalized pair hidden Markov model. , 2003, Genome research.

[34]  E. Birney,et al.  Comparative genomics: genome-wide analysis in metazoan eukaryotes , 2003, Nature Reviews Genetics.

[35]  J. Kawai,et al.  Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Denis Duboule,et al.  A Global Control Region Defines a Chromosomal Regulatory Landscape Containing the HoxD Cluster , 2003, Cell.

[37]  C. Glover,et al.  Gene expression profiling for hematopoietic cell culture , 2006 .