Massively parallel cis-regulatory analysis in the mammalian central nervous system

Cis-regulatory elements (CREs, e.g., promoters and enhancers) regulate gene expression, and variants within CREs can modulate disease risk. Next-generation sequencing has enabled the rapid generation of genomic data that predict the locations of CREs, but a bottleneck lies in functionally interpreting these data. To address this issue, massively parallel reporter assays (MPRAs) have emerged, in which barcoded reporter libraries are introduced into cells, and the resulting barcoded transcripts are quantified by next-generation sequencing. Thus far, MPRAs have been largely restricted to assaying short CREs in a limited repertoire of cultured cell types. Here, we present two advances that extend the biological relevance and applicability of MPRAs. First, we adapt exome capture technology to instead capture candidate CREs, thereby tiling across the targeted regions and markedly increasing the length of CREs that can be readily assayed. Second, we package the library into adeno-associated virus (AAV), thereby allowing delivery to target organs in vivo. As a proof of concept, we introduce a capture library of about 46,000 constructs, corresponding to roughly 3500 DNase I hypersensitive (DHS) sites, into the mouse retina by ex vivo plasmid electroporation and into the mouse cerebral cortex by in vivo AAV injection. We demonstrate tissue-specific cis-regulatory activity of DHSs and provide examples of high-resolution truncation mutation analysis for multiplex parsing of CREs. Our approach should enable massively parallel functional analysis of a wide range of CREs in any organ or species that can be infected by AAV, such as nonhuman primates and human stem cell-derived organoids.

[1]  Xiaochen Bo,et al.  Genome-wide analysis of the relationships between DNaseI HS, histone modifications and gene expression reveals distinct modes of chromatin domains , 2011, Nucleic acids research.

[2]  F. J. Livesey,et al.  Vertebrate neural cell-fate determination: Lessons from the retina , 2001, Nature Reviews Neuroscience.

[3]  J. Corbo,et al.  Quantifying the activity of cis-regulatory elements in the mouse retina by explant electroporation. , 2013, Methods in molecular biology.

[4]  G. Lyons,et al.  Mef2 gene expression marks the cardiac and skeletal muscle lineages during mouse embryogenesis. , 1994, Development.

[5]  Shane J. Neph,et al.  A comparative encyclopedia of DNA elements in the mouse genome , 2014, Nature.

[6]  C. von Kalle,et al.  Adeno-Associated Virus Vector Genomes Persist as Episomal Chromatin in Primate Muscle , 2008, Journal of Virology.

[7]  G. Wray The evolutionary significance of cis-regulatory mutations , 2007, Nature Reviews Genetics.

[8]  C. Glass,et al.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. , 2010, Molecular cell.

[9]  David Haussler,et al.  The UCSC Genome Browser database: 2014 update , 2013, Nucleic Acids Res..

[10]  J. Corbo,et al.  The Cis-regulatory Logic of the Mammalian Photoreceptor Transcriptional Network , 2007, PloS one.

[11]  Hani Z. Girgis,et al.  A High-Resolution Enhancer Atlas of the Developing Telencephalon , 2013, Cell.

[12]  Jay Shendure,et al.  High-resolution analysis of DNA regulatory elements by synthetic saturation mutagenesis , 2009, Nature Biotechnology.

[13]  Martina Rath,et al.  Enhancer–core-promoter specificity separates developmental and housekeeping gene regulation , 2014, Nature.

[14]  J. Engelhardt,et al.  Inverted Terminal Repeat Sequences Are Important for Intermolecular Recombination and Circularization of Adeno-Associated Virus Genomes , 2005, Journal of Virology.

[15]  S. Sealfon,et al.  Validated Genomic Approach to Study Differentially Expressed Genes in Complex Tissues , 2002, Neurochemical Research.

[16]  B. Hendrich,et al.  Transcriptional repressors: multifaceted regulators of gene expression , 2013, Development.

[17]  Kenneth I. Berns,et al.  Gene Therapy Using Adeno-Associated Virus Vectors , 2008, Clinical Microbiology Reviews.

[18]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[19]  A. Baiker,et al.  Universal Real-Time PCR for the Detection and Quantification of Adeno-Associated Virus Serotype 2-Derived Inverted Terminal Repeat Sequences , 2011 .

[20]  P. Courtoy,et al.  Control of liver cell fate decision by a gradient of TGF beta signaling modulated by Onecut transcription factors. , 2005, Genes & development.

[21]  C. Mussolino,et al.  Novel Adeno-Associated Virus Serotypes Efficiently Transduce Murine Photoreceptors , 2007, Journal of Virology.

[22]  S. Nelson,et al.  Improving the efficiency of genomic loci capture using oligonucleotide arrays for high throughput resequencing , 2009, BMC Genomics.

[23]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[24]  S. Nelson,et al.  Cell Type-Specific Transcriptomics in the Brain , 2011, The Journal of Neuroscience.

[25]  R. J. Mullen,et al.  NeuN, a neuronal specific nuclear protein in vertebrates. , 1992, Development.

[26]  D. Zack,et al.  Ret 4, a Positive Acting Rhodopsin Regulatory Element Identified Using a Bovine Retina in Vitro Transcription System* , 1996, The Journal of Biological Chemistry.

[27]  Hiroaki Kitano,et al.  The PANTHER database of protein families, subfamilies, functions and pathways , 2004, Nucleic Acids Res..

[28]  I. Maclachlan,et al.  Cationic lipid-mediated transfection of cells in culture requires mitotic activity , 1999, Gene Therapy.

[29]  B. L,et al.  The accessible chromatin landscape of the human genome , 2016 .

[30]  J. Rabinowitz,et al.  Analysis of AAV serotypes 1-9 mediated gene expression and tropism in mice after systemic injection. , 2008, Molecular therapy : the journal of the American Society of Gene Therapy.

[31]  Morgan C. Giddings,et al.  Defining functional DNA elements in the human genome , 2014, Proceedings of the National Academy of Sciences.

[32]  G. Natoli,et al.  Latent Enhancers Activated by Stimulation in Differentiated Cells , 2013, Cell.

[33]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[34]  Mineo Kondo,et al.  Nrl is required for rod photoreceptor development , 2001, Nature Genetics.

[35]  Nathan C. Sheffield,et al.  The accessible chromatin landscape of the human genome , 2012, Nature.

[36]  V. Beneš,et al.  CRX ChIP-seq reveals the cis-regulatory architecture of mouse photoreceptors. , 2010, Genome research.

[37]  A. Swaroop,et al.  Combinatorial Regulation of Photoreceptor Differentiation Factor, Neural Retina Leucine Zipper Gene Nrl, Revealed by in Vivo Promoter Analysis* , 2011, The Journal of Biological Chemistry.

[38]  J. Flannery,et al.  Enhanced gene delivery to the neonatal retina through systemic administration of tyrosine-mutated AAV9 , 2011, Gene Therapy.

[39]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[40]  Donald J Zack,et al.  Crx, a Novel Otx-like Paired-Homeodomain Protein, Binds to and Transactivates Photoreceptor Cell-Specific Genes , 1997, Neuron.

[41]  DNase I hypersensitivity analysis of the mouse brain and retina identifies region-specific regulatory elements , 2015, Epigenetics & Chromatin.

[42]  K. High,et al.  Therapeutic in vivo gene transfer for genetic disease using AAV: progress and challenges , 2011, Nature Reviews Genetics.

[43]  H. Herweijer,et al.  Gene therapy progress and prospects: Hydrodynamic gene delivery , 2007, Gene Therapy.

[44]  Joseph B Hiatt,et al.  Massively parallel functional dissection of mammalian enhancers in vivo , 2012, Nature Biotechnology.

[45]  J. Maguire,et al.  Solution Hybrid Selection with Ultra-long Oligonucleotides for Massively Parallel Targeted Sequencing , 2009, Nature Biotechnology.

[46]  Berthold Göttgens,et al.  Function-based Identification of Mammalian Enhancers Using Site-Specific Integration , 2014, Nature Methods.

[47]  D. Mccarty Self-complementary AAV vectors; advances and applications. , 2008, Molecular therapy : the journal of the American Society of Gene Therapy.

[48]  R. B. Azevedo,et al.  On the Immortality of Television Sets: “Function” in the Human Genome According to the Evolution-Free Gospel of ENCODE , 2013, Genome biology and evolution.

[49]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[50]  R. Samulski,et al.  Integration of adeno-associated virus (AAV) and recombinant AAV vectors. , 2004, Annual review of genetics.

[51]  Stephen W Scherer,et al.  Cone-Rod Dystrophy Due to Mutations in a Novel Photoreceptor-Specific Homeobox Gene ( CRX ) Essential for Maintenance of the Photoreceptor , 1997, Cell.

[52]  Timothy R. Hughes,et al.  G+C content dominates intrinsic nucleosome occupancy , 2009, BMC Bioinformatics.

[53]  Charles Blatti,et al.  Integrating motif, DNA accessibility and gene expression data to build regulatory maps in an organism , 2015, Nucleic acids research.

[54]  A. Stark,et al.  Transcriptional enhancers: from properties to genome-wide predictions , 2014, Nature Reviews Genetics.

[55]  Howard Y. Chang,et al.  Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position , 2013, Nature Methods.

[56]  Patrick J. Paddison,et al.  Production of complex nucleic acid libraries using highly parallel in situ oligonucleotide synthesis , 2004, Nature Methods.

[57]  Chris Mungall,et al.  AmiGO: online access to ontology and annotation data , 2008, Bioinform..

[58]  A. Visel,et al.  Rapid and Pervasive Changes in Genome-wide Enhancer Usage during Mammalian Development , 2013, Cell.

[59]  M. Becker‐André,et al.  A novel isoform of the orphan nuclear receptor RORbeta is specifically expressed in pineal gland and retina. , 1998, Gene.

[60]  B. Arenkiel,et al.  A rapid approach to high-resolution fluorescence imaging in semi-thick brain slices. , 2011, Journal of visualized experiments : JoVE.

[61]  William Stafford Noble,et al.  Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors , 2012, Genome research.

[62]  E. Segal,et al.  In pursuit of design principles of regulatory sequences , 2014, Nature Reviews Genetics.

[63]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[64]  J. Lee,et al.  Quantitative fine-tuning of photoreceptor cis-regulatory elements through affinity modulation of transcription factor binding sites , 2010, Gene Therapy.

[65]  H. Bussemaker,et al.  In search of the determinants of enhancer-promoter interaction specificity. , 2014, Trends in cell biology.

[66]  P. J. van der Zaag,et al.  Targeted enrichment of genomic DNA regions for next-generation sequencing , 2011, Briefings in functional genomics.

[67]  Cory Y. McLean,et al.  GREAT improves functional interpretation of cis-regulatory regions , 2010, Nature Biotechnology.

[68]  Łukasz M. Boryń,et al.  Genome-Wide Quantitative Enhancer Activity Maps Identified by STARR-seq , 2013, Science.

[69]  Chris M Rands,et al.  8.2% of the Human Genome Is Constrained: Variation in Rates of Turnover across Functional Element Classes in the Human Lineage , 2014, PLoS genetics.

[70]  Madeline A. Lancaster,et al.  Cerebral organoids model human brain development and microcephaly , 2013, Nature.

[71]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[72]  J. Corbo,et al.  Transcriptional Regulation of Neural Retina Leucine Zipper (Nrl), a Photoreceptor Cell Fate Determinant* , 2011, The Journal of Biological Chemistry.

[73]  William Stafford Noble,et al.  Sequence and chromatin determinants of cell-type–specific transcription factor binding , 2012, Genome research.

[74]  R. Rohs,et al.  A widespread role of the motif environment in transcription factor binding across diverse protein families , 2015, Genome research.

[75]  A. Clark,et al.  Evolution of transcription factor binding sites in Mammalian gene regulatory regions: conservation and turnover. , 2002, Molecular biology and evolution.

[76]  N. Muzyczka,et al.  Next generation of adeno-associated virus 2 vectors: Point mutations in tyrosines lead to high-efficiency transduction at lower doses , 2008, Proceedings of the National Academy of Sciences.

[77]  C. Glass,et al.  The selection and function of cell type-specific enhancers , 2015, Nature Reviews Molecular Cell Biology.

[78]  Zhijian Wu,et al.  Effect of genome size on AAV vector packaging. , 2010, Molecular therapy : the journal of the American Society of Gene Therapy.

[79]  Euiseok J. Kim,et al.  Ascl1 (Mash1) lineage cells contribute to discrete cell populations in CNS architecture , 2008, Molecular and Cellular Neuroscience.

[80]  Lloyd H. Michael,et al.  The Guide for the Care and Use of Laboratory Animals. , 2016, ILAR journal.

[81]  M. Schwartz,et al.  The retina as a window to the brain—from eye research to CNS disorders , 2013, Nature Reviews Neurology.

[82]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[83]  C. Walsh,et al.  Genetic Changes Shaping the Human Brain , 2015 .

[84]  A. Visel,et al.  ChIP-seq accurately predicts tissue-specific activity of enhancers , 2009, Nature.

[85]  R. Masland,et al.  The Major Cell Populations of the Mouse Retina , 1998, The Journal of Neuroscience.

[86]  Shomi S. Bhattacharya,et al.  Photoreceptor degeneration: genetic and mechanistic dissection of a complex trait , 2010, Nature Reviews Genetics.

[87]  P. Franken,et al.  Sleep Loss Reduces the DNA-Binding of BMAL1, CLOCK, and NPAS2 to Specific Clock Genes in the Mouse Cerebral Cortex , 2011, PloS one.

[88]  R. Samulski,et al.  Adeno-associated virus serotypes: vector toolkit for human gene therapy. , 2006, Molecular therapy : the journal of the American Society of Gene Therapy.

[89]  J. Grieger,et al.  Production and characterization of adeno-associated viral vectors , 2006, Nature Protocols.

[90]  Nathan C. Sheffield,et al.  Predicting cell-type–specific gene expression from regions of open chromatin , 2012, Genome research.

[91]  A. Baiker,et al.  Universal real-time PCR for the detection and quantification of adeno-associated virus serotype 2-derived inverted terminal repeat sequences. , 2012, Human gene therapy methods.

[92]  W. Hauswirth,et al.  Production and purification of serotype 1, 2, and 5 recombinant adeno-associated viral vectors. , 2002, Methods.

[93]  Lan T M Dao,et al.  High-throughput and quantitative assessment of enhancer activity in mammals by CapStarr-seq , 2015, Nature Communications.

[94]  C. Murre,et al.  Helix-Loop-Helix Proteins: Regulators of Transcription in Eucaryotic Organisms , 2000, Molecular and Cellular Biology.

[95]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[96]  Irene K. Moore,et al.  High Nucleosome Occupancy Is Encoded at Human Regulatory Sequences , 2010, PloS one.

[97]  G. Lyons,et al.  Mef 2 gene expression marks the cardiac and skeletal muscle lineages during mouse embryogenesis , 1994 .

[98]  M. Kumar,et al.  Systematic determination of the packaging limit of lentiviral vectors. , 2001, Human gene therapy.

[99]  Barak A. Cohen,et al.  Complex effects of nucleotide variants in a mammalian cis-regulatory element , 2012, Proceedings of the National Academy of Sciences.

[100]  O. Rando,et al.  Mechanisms underlying nucleosome positioning in vivo. , 2014, Annual review of biophysics.

[101]  R. Dahm,et al.  Transfection Techniques for Neuronal Cells , 2010, The Journal of Neuroscience.

[102]  Fidencio J. Neri,et al.  Mouse regulatory DNA landscapes reveal global principles of cis-regulatory evolution , 2014, Science.

[103]  D. S. Gross,et al.  Nuclease hypersensitive sites in chromatin. , 1988, Annual review of biochemistry.

[104]  A. Behrens,et al.  Role of the AP-1 transcription factor c-Jun in developing, adult and injured brain , 2006, Progress in Neurobiology.

[105]  A. Swaroop,et al.  Transcriptional regulation of photoreceptor development and homeostasis in the mammalian retina , 2010, Nature Reviews Neuroscience.

[106]  T. Meehan,et al.  An atlas of active enhancers across human cell types and tissues , 2014, Nature.

[107]  S. Rumpel,et al.  Analysis of Transduction Efficiency, Tropism and Axonal Transport of AAV Serotypes 1, 2, 5, 6, 8 and 9 in the Mouse Brain , 2013, PloS one.

[108]  B. Cohen,et al.  Massively parallel in vivo enhancer assay reveals that highly local features determine the cis-regulatory function of ChIP-seq peaks , 2013, Proceedings of the National Academy of Sciences.

[109]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[110]  B. Cohen,et al.  High-throughput functional testing of ENCODE segmentation predictions , 2014, Genome research.

[111]  J. Flannery,et al.  Advances in AAV vector development for gene therapy in the retina. , 2014, Advances in experimental medicine and biology.

[112]  Lee E. Edsall,et al.  A map of the cis-regulatory sequences in the mouse genome , 2012, Nature.

[113]  Manolis Kellis,et al.  Interpreting non-coding variation in complex disease genetics , 2012, Nature Biotechnology.

[114]  Axel Visel,et al.  Genomic Perspectives of Transcriptional Regulation in Forebrain Development , 2015, Neuron.

[115]  A. C. Meyer,et al.  Functional Inactivation of a Fraction of Excitatory Synapses in Mice Deficient for the Active Zone Protein Bassoon , 2003, Neuron.

[116]  L. Cauller,et al.  Reduction of background autofluorescence in brain sections following immersion in sodium borohydride , 1998, Journal of Neuroscience Methods.

[117]  Martha L. Bulyk,et al.  Highly parallel assays of tissue-specific enhancers in whole Drosophila embryos , 2013, Nature Methods.

[118]  Leszek Rychlewski,et al.  FFAS03: a server for profile–profile sequence alignments , 2005, Nucleic Acids Res..

[119]  C. Glass,et al.  Epigenomics: Roadmap for regulation , 2015, Nature.

[120]  E. Davidson Genomic Regulatory Systems: Development and Evolution , 2005 .

[121]  Boris Lenhard,et al.  Mammalian RNA polymerase II core promoters: insights from genome-wide studies , 2007, Nature Reviews Genetics.

[122]  C. Whitelaw,et al.  Chromosomal position effects and the modulation of transgene expression. , 1994, Reproduction, fertility, and development.

[123]  D. Corcoran,et al.  Human promoters are intrinsically directional. , 2015, Molecular cell.

[124]  S. Kain,et al.  An enhanced green fluorescent protein allows sensitive detection of gene transfer in mammalian cells. , 1996, Biochemical and biophysical research communications.

[125]  M. Gut,et al.  Supplemental information for : “ CpG islands and GC content dictate nucleosome depletion in a transcription independent manner at mammalian promoters ” , 2012 .

[126]  Udo Albus,et al.  Book Review: Guide for the Care and use of Laboratory Animals , 1998 .

[127]  Pavel Osten,et al.  Stereotaxic gene delivery in the rodent brain , 2007, Nature Protocols.

[128]  L. Kruglyak,et al.  The role of regulatory variation in complex traits and disease , 2015, Nature Reviews Genetics.

[129]  Richard Bonneau,et al.  FIREWACh: High-throughput Functional Detection of Transcriptional Regulatory Modules in Mammalian Cells , 2014, Nature Methods.

[130]  J. T. Kadonaga,et al.  The RNA polymerase II core promoter: a key component in the regulation of gene expression. , 2002, Genes & development.