Genomics of long-range regulatory elements.

Transcriptional regulation of gene expression plays a significant role in establishing the diversity of human cell types and biological functions from a common set of genes. The components of regulatory control in the human genome include cis-acting elements that act across immense genomic distances to influence the spatial and temporal distribution of gene expression. Here we review the established categories of distant-acting regulatory elements, discussing the classical and contemporary evidence of their regulatory potential and clinical importance. Current efforts to identify regulatory sequences throughout the genome and elucidate their biological significance depend heavily on advances in sequence conservation-based analyses and on increasingly large-scale efforts applying transgenic technologies in model organisms. We discuss the advantages and limitations of sequence conservation as a predictor of regulatory function and present complementary emerging technologies now being applied to annotate regulatory elements in vertebrate genomes.

[1]  A. Reymond,et al.  Conserved non-genic sequences — an unexpected feature of mammalian genomes , 2005, Nature Reviews Genetics.

[2]  T. Wolfsberg,et al.  Identification of Neural Crest and Glial Enhancers at the Mouse Sox10 Locus through Transgenesis in Zebrafish , 2008, PLoS genetics.

[3]  A. Chakravarti,et al.  A human model for multigenic inheritance: phenotypic expression in Hirschsprung disease requires both the RET gene and a new 9q31 locus. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[4]  K. Sandhu,et al.  Circular chromosome conformation capture (4C) uncovers extensive networks of epigenetically regulated intra- and interchromosomal interactions , 2006, Nature Genetics.

[5]  A. Sandelin,et al.  Applied bioinformatics for the identification of regulatory elements , 2004, Nature Reviews Genetics.

[6]  A. Jarman,et al.  A major positive regulatory region located far upstream of the human alpha-globin gene locus. , 1990, Genes & development.

[7]  Michael A. Beer,et al.  Predicting Gene Expression from Sequence , 2004, Cell.

[8]  M. Nóbrega,et al.  Scanning Human Gene Deserts for Long-Range Enhancers , 2003, Science.

[9]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[10]  Chunhui Hou,et al.  CTCF-dependent enhancer-blocking by alternative chromatin loop formation , 2008, Proceedings of the National Academy of Sciences.

[11]  C. Cremers,et al.  Identification of a hot spot for microdeletions in patients with X-linked deafness type 3 (DFN3) 900 kb proximal to the DFN3 gene POU3F4. , 1996, Human molecular genetics.

[12]  M. Nóbrega,et al.  Comparative genomics at the vertebrate extremes , 2004, Nature Reviews Genetics.

[13]  Nathaniel D. Heintzman,et al.  Histone modifications at human enhancers reflect global cell-type-specific gene expression , 2009, Nature.

[14]  C. Nusbaum,et al.  Chromosome Conformation Capture Carbon Copy (5C): a massively parallel solution for mapping interactions between genomic elements. , 2006, Genome research.

[15]  William H. Majoros,et al.  A Comparison of Whole-Genome Shotgun-Derived Mouse Chromosome 16 and the Human Genome , 2002, Science.

[16]  Ned S. Wingreen,et al.  Finding regulatory modules through large-scale gene-expression data analysis , 2003, Bioinform..

[17]  G. Rubin,et al.  Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Sinead B. O'Leary,et al.  Genetic variation in the 5q31 cytokine gene cluster confers susceptibility to Crohn disease , 2001, Nature Genetics.

[19]  W. de Laat,et al.  Joining the loops: β‐Globin gene regulation , 2008, IUBMB life.

[20]  H. Tanabe,et al.  Chromosomal dynamics at the Shh locus: limb bud-specific differential regulation of competence and active transcription. , 2009, Developmental cell.

[21]  A. Visel,et al.  Genomic Views of Distant-Acting Enhancers , 2009, Nature.

[22]  Sitharthan Kamalakaran,et al.  Identification of Estrogen-responsive Genes Using a Genome-wide Analysis of Promoter Elements for Transcription Factor Binding Sites* , 2005, Journal of Biological Chemistry.

[23]  T. Mikkelsen,et al.  Systematic discovery of regulatory motifs in conserved regions of the human genome, including thousands of CTCF insulator sites , 2007, Proceedings of the National Academy of Sciences.

[24]  Naoto Endo,et al.  Disruption of a long-range cis-acting regulator for Shh causes preaxial polydactyly , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[25]  F. Stossi,et al.  Whole-Genome Cartography of Estrogen Receptor α Binding Sites , 2007, PLoS genetics.

[26]  G. Filippova Genetics and epigenetics of the multifunctional protein CTCF. , 2008, Current topics in developmental biology.

[27]  D. Kingsley,et al.  Efficient studies of long-distance Bmp5 gene regulation using bacterial artificial chromosomes. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[28]  R. Britten,et al.  Gene regulation for higher cells: a theory. , 1969, Science.

[29]  M. Kimura Evolutionary Rate at the Molecular Level , 1968, Nature.

[30]  Kelly M. McGarvey,et al.  A novel 6C assay uncovers Polycomb-mediated higher order chromatin conformations. , 2008, Genome research.

[31]  P. Bucher,et al.  Long Distance Control of MHC Class II Expression by Multiple Distal Enhancers Regulated by Regulatory Factor X Complex and CIITA1 , 2004, The Journal of Immunology.

[32]  K. White,et al.  Genomic Antagonism between Retinoic Acid and Estrogen Signaling in Breast Cancer , 2009, Cell.

[33]  Klaudia Walter,et al.  Open access, freely available online PLoS BIOLOGY Highly Conserved Non-Coding Sequences Are Associated with Vertebrate Development , 2022 .

[34]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[35]  Wyeth W. Wasserman,et al.  A new generation of JASPAR, the open-access repository for transcription factor binding site profiles , 2005, Nucleic Acids Res..

[36]  S. Batzoglou,et al.  Distribution and intensity of constraint in mammalian genomic sequence. , 2005, Genome research.

[37]  V. van Heyningen,et al.  PAX6 in sensory development. , 2002, Human molecular genetics.

[38]  W. McGinnis,et al.  A human HOX4B regulatory element provides head-specific expression in Drosophila embryos , 1992, Nature.

[39]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[40]  Graziano Pesole,et al.  In silico representation and discovery of transcription factor binding sites , 2004, Briefings Bioinform..

[41]  Jun S. Liu,et al.  Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. , 1993, Science.

[42]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[43]  B. Steensel,et al.  Nuclear organization of active and inactive chromatin domains uncovered by chromosome conformation capture–on-chip (4C) , 2006, Nature Genetics.

[44]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[45]  Shyam Prabhakar,et al.  Close sequence comparisons are sufficient to identify human cis-regulatory elements. , 2005, Genome research.

[46]  Richard M Myers,et al.  Genomic determination of the glucocorticoid response reveals unexpected mechanisms of gene regulation. , 2009, Genome research.

[47]  L. Lettice,et al.  Long-range gene control and genetic disease. , 2008, Advances in genetics.

[48]  D. Haussler,et al.  An RNA gene expressed during cortical development evolved rapidly in humans , 2006, Nature.

[49]  Nathaniel D. Heintzman,et al.  Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome , 2007, Nature Genetics.

[50]  B. Birren,et al.  Campomelic dysplasia translocation breakpoints are scattered over 1 Mb proximal to SOX9: evidence for an extended control region. , 1999, American journal of human genetics.

[51]  Shyam Prabhakar,et al.  Mapping cis-regulatory domains in the human genome using multi-species conservation of synteny. , 2005, Human molecular genetics.

[52]  Edward M. Rubin,et al.  Megabase deletions of gene deserts result in viable mice , 2004, Nature.

[53]  A. Visel,et al.  Response to Comment on "Human-Specific Gain of Function in a Developmental Enhancer" , 2009, Science.

[54]  K. Pollard,et al.  Detection of nonneutral substitution rates on mammalian phylogenies. , 2010, Genome research.

[55]  M. Frasch,et al.  Evolutionary-conserved enhancers direct region-specific expression of the murine Hoxa-1 and Hoxa-2 loci in both mice and Drosophila. , 1995, Development.

[56]  A. Visel,et al.  ChIP-seq accurately predicts tissue-specific activity of enhancers , 2009, Nature.

[57]  J. Monod,et al.  Teleonomic mechanisms in cellular metabolism, growth, and differentiation. , 1961, Cold Spring Harbor symposia on quantitative biology.

[58]  S. Pääbo,et al.  Accelerated Evolution of Conserved Noncoding Sequences in Humans , 2006, Science.

[59]  R. Vinton,et al.  Asymmetrical distribution of non-conserved regulatory sequences at PHOX2B is reflected at the ENCODE loci and illuminates a possible genome-wide trend , 2009, BMC Genomics.

[60]  Michael Q. Zhang,et al.  Analysis of the Vertebrate Insulator Protein CTCF-Binding Sites in the Human Genome , 2007, Cell.

[61]  John H. White,et al.  Genome-wide identification of high-affinity estrogen response elements in human and mouse. , 2004, Molecular endocrinology.

[62]  A. McCallion,et al.  Genomic variation in multigenic traits: Hirschsprung disease. , 2003, Cold Spring Harbor symposia on quantitative biology.

[63]  A. Schedl,et al.  Aniridia-associated translocations, DNase hypersensitivity, sequence comparison and transgenic analysis redefine the functional domain of PAX6. , 2001, Human molecular genetics.

[64]  Dustin E. Schones,et al.  Global analysis of the insulator binding protein CTCF in chromatin barrier regions reveals demarcation of active and repressive domains. , 2008, Genome research.

[65]  R. Evans,et al.  Orphan nuclear receptors--new ligands and new possibilities. , 1998, Genes & development.

[66]  T. Ley,et al.  Conservation of the primary structure, organization, and function of the human and mouse beta-globin locus-activating regions. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[67]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[68]  Ivan Ovcharenko,et al.  Predicting tissue-specific enhancers in the human genome. , 2006, Genome research.

[69]  Christopher D. Brown,et al.  Qualifying the relationship between sequence conservation and molecular function. , 2008, Genome research.

[70]  E. Grice,et al.  A common sex-dependent mutation in a RET enhancer underlies Hirschsprung disease risk , 2005, Nature.

[71]  Zhiping Weng,et al.  Integrated analysis of experimental data sets reveals many novel promoters in 1% of the human genome. , 2007, Genome research.

[72]  Hiroshi Masuya,et al.  Phylogenetic conservation of a limb-specific, cis-acting regulator of Sonic hedgehog (Shh) , 2004, Mammalian Genome.

[73]  B. Katzenellenbogen Estrogen receptors: bioactivities and interactions with cell signaling pathways. , 1996, Biology of reproduction.

[74]  Raja Jothi,et al.  Genome-wide identification of in vivo protein–DNA binding sites from ChIP-Seq data , 2008, Nucleic acids research.

[75]  Axel Visel,et al.  Deletion of Ultraconserved Elements Yields Viable Mice , 2007, PLoS biology.

[76]  S. Kato,et al.  Retracted: Transrepression by a liganded nuclear receptor via a bHLH activator through co‐regulator switching , 2004, The EMBO journal.

[77]  W. Miller,et al.  Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. , 2000, Science.

[78]  Y. Fukushima,et al.  Aniridia-associated cytogenetic rearrangements suggest that a position effect may cause the mutant phenotype. , 1995, Human molecular genetics.

[79]  J. Dekker,et al.  Capturing Chromosome Conformation , 2002, Science.

[80]  S. Antonarakis,et al.  Hypoxia-inducible nuclear factors bind to an enhancer element located 3' to the human erythropoietin gene. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[81]  Francis S Collins,et al.  A HapMap harvest of insights into the genetics of common disease. , 2008, The Journal of clinical investigation.

[82]  F. Cremers,et al.  A duplication/paracentric inversion associated with familial X-linked deafness (DFN3) suggests the presence of a regulatory element more than 400 kb upstream of the POU3F4 gene. , 1995, Human molecular genetics.

[83]  M. McGrane Vitamin A regulation of gene expression: molecular mechanism of a prototype gene. , 2007, The Journal of nutritional biochemistry.

[84]  D. Kingsley,et al.  An extensive 3' regulatory region controls expression of Bmp5 in specific anatomical structures of the mouse embryo. , 1998, Genetics.

[85]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[86]  F. Grosveld,et al.  The beta-globin dominant control region. , 1989, Progress in clinical and biological research.

[87]  A. Orth,et al.  Large-scale analysis of the human and mouse transcriptomes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[88]  R. Myers,et al.  Comprehensive analysis of transcriptional promoter structure and function in 1% of the human genome. , 2005, Genome research.

[89]  Thomas J. Hudson,et al.  Cis-Acting Regulatory Variation in the Human Genome , 2004, Science.

[90]  Clifford A. Meyer,et al.  FoxA1 Translates Epigenetic Signatures into Enhancer-Driven Lineage-Specific Transcription , 2008, Cell.

[91]  A. Visel,et al.  Ultraconservation identifies a small subset of extremely constrained developmental enhancers , 2008, Nature Genetics.

[92]  C. Glass,et al.  A Corepressor/Coactivator Exchange Complex Required for Transcriptional Activation by Nuclear Receptors and Other Regulated Transcription Factors , 2004, Cell.

[93]  E. Liu,et al.  An Oestrogen Receptor α-bound Human Chromatin Interactome , 2009, Nature.

[94]  G. Kollias,et al.  Position-independent, high-level expression of the human β-globin gene in transgenic mice , 1987, Cell.

[95]  D. Ovcharenko,et al.  Genomic deletion of a long-range bone enhancer misregulates sclerostin in Van Buchem disease. , 2005, Genome research.

[96]  D. Kleinjan,et al.  Long-range control of gene expression: emerging mechanisms and disruption in disease. , 2005, American journal of human genetics.

[97]  S. Gabriel,et al.  The Structure of Haplotype Blocks in the Human Genome , 2002, Science.

[98]  Axel Visel,et al.  Disruption of an AP-2α binding site in an IRF6 enhancer is strongly associated with cleft lip , 2008, Nature Genetics.

[99]  R. Myers,et al.  An abundance of bidirectional promoters in the human genome. , 2003, Genome research.

[100]  Leena Peltonen,et al.  Identification of a variant associated with adult-type hypolactasia , 2002, Nature Genetics.

[101]  Alan M. Moses,et al.  In vivo enhancer analysis of human conserved non-coding sequences , 2006, Nature.

[102]  F. Grosveld,et al.  The β-globin dominant control region activates homologous and heterologous promoters in a tissue-specific manner , 1989, Cell.

[103]  Webb Miller,et al.  Evolution and functional classification of vertebrate gene deserts. , 2005, Genome research.

[104]  Michael R. Green,et al.  Transcriptional regulatory elements in the human genome. , 2006, Annual review of genomics and human genetics.

[105]  A. Sidow,et al.  Phenotype-genotype correlation in Hirschsprung disease is illuminated by comparative analysis of the RET protein sequence. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[106]  V. Corces,et al.  CTCF: Master Weaver of the Genome , 2009, Cell.

[107]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[108]  Stephen C. J. Parker,et al.  Local DNA Topography Correlates with Functional Noncoding Regions of the Human Genome , 2009, Science.

[109]  Rolf Ohlsson,et al.  CTCF binding at the H19 imprinting control region mediates maternally inherited higher-order chromatin conformation to restrict enhancer access to Igf2. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[110]  R. Britten,et al.  Repetitive and Non-Repetitive DNA Sequences and a Speculation on the Origins of Evolutionary Novelty , 1971, The Quarterly Review of Biology.

[111]  Michael A. Beer,et al.  Metrics of sequence constraint overlook regulatory sequences in an exhaustive analysis at phox2b. , 2008, Genome research.

[112]  S. Fisher,et al.  Conservation of RET Regulatory Function from Human to Zebrafish Without Sequence Similarity , 2006, Science.

[113]  Alexander E. Kel,et al.  TRANSFAC®: transcriptional regulation, from patterns to profiles , 2003, Nucleic Acids Res..

[114]  G. Felsenfeld,et al.  Insulators: exploiting transcriptional and epigenetic mechanisms , 2006, Nature Reviews Genetics.

[115]  Jeffrey C. Murray,et al.  Mutations in IRF6 cause Van der Woude and popliteal pterygium syndromes , 2002, Nature Genetics.

[116]  G. Felsenfeld,et al.  A 5′ element of the chicken β-globin domain serves as an insulator in human erythroid cells and protects against position effect in Drosophila , 1993, Cell.

[117]  D. Walton,et al.  3' deletions cause aniridia by preventing PAX6 gene expression. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[118]  Richard Wade-Martins,et al.  A Common Variant Associated with Dyslexia Reduces Expression of the KIAA0319 Gene , 2009, PLoS genetics.

[119]  R. Young,et al.  A Chromatin Landmark and Transcription Initiation at Most Promoters in Human Cells , 2007, Cell.

[120]  Ryan D. Morin,et al.  The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). , 2004, Genome research.

[121]  B. Oostra,et al.  A long-range Shh enhancer regulates expression in the developing limb and fin and is associated with preaxial polydactyly. , 2003, Human molecular genetics.

[122]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[123]  Clifford A. Meyer,et al.  Genome-wide analysis of estrogen receptor binding sites , 2006, Nature Genetics.