Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome

Eukaryotic gene transcription is accompanied by acetylation and methylation of nucleosomes near promoters, but the locations and roles of histone modifications elsewhere in the genome remain unclear. We determined the chromatin modification states in high resolution along 30 Mb of the human genome and found that active promoters are marked by trimethylation of Lys4 of histone H3 (H3K4), whereas enhancers are marked by monomethylation, but not trimethylation, of H3K4. We developed computational algorithms using these distinct chromatin signatures to identify new regulatory elements, predicting over 200 promoters and 400 enhancers within the 30-Mb region. This approach accurately predicted the location and function of independently identified regulatory elements with high sensitivity and specificity and uncovered a novel functional enhancer for the carnitine transporter SLC22A5 (OCTN2). Our results give insight into the connections between chromatin modifications and transcriptional regulatory activity and provide a new tool for the functional annotation of the human genome.

[1]  G. Felsenfeld,et al.  Chromatin Unfolds , 1996, Cell.

[2]  C. Glass,et al.  Nuclear integration of JAK/STAT and Ras/AP-1 signaling by CBP and p300. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[3]  D. Gründemann,et al.  Molecular cloning and characterization of two novel transport proteins from rat kidney , 1998, FEBS letters.

[4]  J. T. Kadonaga,et al.  Going the distance: a current view of enhancer action. , 1998, Science.

[5]  V. Ganapathy,et al.  cDNA sequence, transport function, and genomic organization of human OCTN2, a new member of the organic cation transporter family. , 1998, Biochemical and biophysical research communications.

[6]  K. Harada,et al.  Evidence for linkage of human primary systemic carnitine deficiency with D5S436: a novel gene locus on chromosome 5q. , 1998, American journal of human genetics.

[7]  J. Nezu,et al.  Molecular and Functional Identification of Sodium Ion-dependent, High Affinity Human Carnitine Transporter OCTN2* , 1998, The Journal of Biological Chemistry.

[8]  H. Kusuhara,et al.  Molecular cloning and characterization of high-affinity carnitine transporter from rat intestine. , 1998, Biochemical and Biophysical Research Communications - BBRC.

[9]  N. Longo,et al.  Mutations in the organic cation/carnitine transporter OCTN2 in primary carnitine deficiency. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[10]  G. Tsujimoto,et al.  Primary systemic carnitine deficiency is caused by mutations in a gene encoding sodium ion-dependent carnitine transporter , 1999, Nature Genetics.

[11]  M. Groudine,et al.  Looping versus linking: toward a model for long-distance gene activation. , 1999, Genes & development.

[12]  R. Tjian,et al.  Orchestrated response: a symphony of transcription factors for gene control. , 2000, Genes & development.

[13]  C. Allis,et al.  The language of covalent histone modifications , 2000, Nature.

[14]  Xiangdong Fang,et al.  Locus control regions. , 2002, Blood.

[15]  I. Talianidis,et al.  Dynamics of enhancer-promoter communication during differentiation-induced gene activation. , 2002, Molecular cell.

[16]  G. Orphanides,et al.  A Unified Theory of Gene Expression , 2002, Cell.

[17]  J. T. Kadonaga,et al.  The RNA polymerase II core promoter. , 2003, Annual review of biochemistry.

[18]  Paul T. Groth,et al.  The ENCODE (ENCyclopedia Of DNA Elements) Project , 2004, Science.

[19]  C. Stanley Carnitine Deficiency Disorders in Children , 2004, Annals of the New York Academy of Sciences.

[20]  I. Talianidis,et al.  Histone modifications defining active genes persist after transcriptional and mitotic inactivation , 2005, The EMBO journal.

[21]  Eric S. Lander,et al.  Genomic Maps and Comparative Analysis of Histone Modifications in Human and Mouse , 2005, Cell.

[22]  D. Reinberg,et al.  The key to development: interpreting the histone code? , 2005, Current opinion in genetics & development.

[23]  Leah Barrera,et al.  A high-resolution map of active promoters in the human genome , 2005, Nature.

[24]  Keji Zhao,et al.  Active chromatin domains are defined by acetylation islands revealed by genome-wide mapping. , 2005, Genes & development.

[25]  N. Friedman,et al.  Single-Nucleosome Mapping of Histone Modifications in S. cerevisiae , 2005, PLoS biology.

[26]  Tatiana A. Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[27]  Megan F. Cole,et al.  Genome-wide Map of Nucleosome Acetylation and Methylation in Yeast , 2005, Cell.

[28]  Myles A Brown,et al.  Spatial and temporal recruitment of androgen receptor and its coactivators involves chromosomal looping and polymerase tracking. , 2005, Molecular cell.

[29]  Karl P Nightingale,et al.  Histone modifications: signalling receptors and potential elements of a heritable epigenetic code. , 2006, Current opinion in genetics & development.

[30]  Leah Barrera,et al.  The transcriptional regulatory code of eukaryotic cells--insights from genome-wide analysis of chromatin organization and transcription factor binding. , 2006, Current opinion in cell biology.

[31]  T. Wolfsberg,et al.  DNase-chip: a high-resolution method to identify DNase I hypersensitive sites using tiled microarrays , 2006, Nature Methods.

[32]  F. Robert,et al.  Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression , 2006 .

[33]  J. Harrow,et al.  GENCODE: producing a reference annotation for ENCODE , 2006, Genome Biology.

[34]  Martin S. Taylor,et al.  Genome-wide analysis of mammalian promoter architecture and evolution , 2006, Nature Genetics.

[35]  Tae Hoon Kim,et al.  Genome-wide analysis of protein-DNA interactions. , 2006, Annual review of genomics and human genetics.

[36]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..