Exploring the DNA-recognition potential of homeodomains

The recognition potential of most families of DNA-binding domains (DBDs) remains relatively unexplored. Homeodomains (HDs), like many other families of DBDs, display limited diversity in their preferred recognition sequences. To explore the recognition potential of HDs, we utilized a bacterial selection system to isolate HD variants, from a randomized library, that are compatible with each of the 64 possible 3' triplet sites (i.e., TAANNN). The majority of these selections yielded sets of HDs with overrepresented residues at specific recognition positions, implying the selection of specific binders. The DNA-binding specificity of 151 representative HD variants was subsequently characterized, identifying HDs that preferentially recognize 44 of these target sites. Many of these variants contain novel combinations of specificity determinants that are uncommon or absent in extant HDs. These novel determinants, when grafted into different HD backbones, produce a corresponding alteration in specificity. This information was used to create more explicit HD recognition models, which can inform the prediction of transcriptional regulatory networks for extant HDs or the engineering of HDs with novel DNA-recognition potential. The diversity of recovered HD recognition sequences raises important questions about the fitness barrier that restricts the evolution of alternate recognition modalities in natural systems.

[1]  R. Mann,et al.  Hox specificity unique roles for cofactors and collaborators. , 2009, Current topics in developmental biology.

[2]  Michael A. Crickmore,et al.  Functional Specificity of a Hox Protein Mediated by the Recognition of Minor Groove Structure , 2007, Cell.

[3]  P. Sharp,et al.  Homeodomain determinants of major groove recognition. , 1994, Biochemistry.

[4]  E. Fraenkel,et al.  Engrailed homeodomain-DNA complex at 2.2 A resolution: a detailed view of the interface and comparison with other engrailed structures. , 1998, Journal of molecular biology.

[5]  Pierre Gönczy,et al.  A single amino acid can determine the DNA binding specificity of homeodomain proteins , 1989, Cell.

[6]  M. Noyes,et al.  A systematic characterization of factors that regulate Drosophila segmentation via a bacterial one-hybrid system , 2008, Nucleic acids research.

[7]  P. Donnelly,et al.  Drive Against Hotspot Motifs in Primates Implicates the PRDM9 Gene in Meiotic Recombination , 2010, Science.

[8]  Raymond C Stevens,et al.  Crystal structure and DNA binding of the homeodomain of the stem cell transcription factor Nanog. , 2008, Journal of molecular biology.

[9]  E. Rebar,et al.  Genome editing with engineered zinc finger nucleases , 2010, Nature Reviews Genetics.

[10]  Gary D. Stormo,et al.  Recognition models to predict DNA-binding specificities of homeodomain proteins , 2012, Bioinform..

[11]  Lihua Julie Zhu,et al.  Zinc finger protein-dependent and -independent contributions to the in vivo off-target activity of zinc finger nucleases , 2010, Nucleic Acids Res..

[12]  Nir Friedman,et al.  Ab Initio Prediction of Transcription Factor Targets Using Structural Knowledge , 2005, PLoS Comput. Biol..

[13]  J. Williamson,et al.  Quantitative analysis of protein-RNA interactions by gel mobility shift. , 2008, Methods in molecular biology.

[14]  Antonio J Giraldez,et al.  Evaluation and application of modularly assembled zinc-finger nucleases in zebrafish , 2011, Development.

[15]  R. Emerson,et al.  Adaptive Evolution in Zinc Finger Transcription Factors , 2009, PLoS genetics.

[16]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[17]  P. Bradley,et al.  Extensive protein and DNA backbone sampling improves structure-based specificity prediction for C2H2 zinc fingers , 2011, Nucleic acids research.

[18]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[19]  W. Gehring,et al.  Homeodomain proteins. , 1994, Annual review of biochemistry.

[20]  Mona Singh,et al.  An expanded binding model for Cys2His2 zinc finger protein–DNA interfaces , 2011, Physical biology.

[21]  C. Deppmann,et al.  Cross-species annotation of basic leucine zipper factor interactions: Insight into the evolution of closed interaction networks. , 2006, Molecular biology and evolution.

[22]  G. Tell,et al.  A molecular code dictates sequence‐specific DNA recognition by homeodomains. , 1996, The EMBO journal.

[23]  Anthony A. Philippakis,et al.  Predicting the binding preference of transcription factors to individual DNA k-mers , 2009, Bioinform..

[24]  B. Sun,et al.  The degree of variation in DNA sequence recognition among four Drosophila homeotic proteins. , 1994, The EMBO journal.

[25]  Andrew R. Gehrke,et al.  Genome-wide analysis of ETS-family DNA-binding in vitro and in vivo , 2010, The EMBO journal.

[26]  Saurabh Sinha,et al.  FlyFactorSurvey: a database of Drosophila transcription factor binding specificities determined using the bacterial one-hybrid system , 2010, Nucleic Acids Res..

[27]  G. Coop,et al.  PRDM9 Is a Major Determinant of Meiotic Recombination Hotspots in Humans and Mice , 2010, Science.

[28]  R. Mann,et al.  Origins of specificity in protein-DNA recognition. , 2010, Annual review of biochemistry.

[29]  W. Gehring,et al.  The interaction with DNA of wild‐type and mutant fushi tarazu homeodomains. , 1990, The EMBO journal.

[30]  Cynthia Wolberger,et al.  Crystal structure of a MAT alpha 2 homeodomain-operator complex suggests a general model for homeodomain-DNA interactions. , 1991, Cell.

[31]  Panayiotis V. Benos,et al.  Inferring protein-DNA dependencies using motif alignments and mutual information , 2007, ISMB/ECCB.

[32]  Daniel E. Newburger,et al.  Variation in Homeodomain DNA Binding Revealed by High-Resolution Analysis of Sequence Preferences , 2008, Cell.

[33]  A. Riggs,et al.  Lac repressor binding to non-operator DNA: detailed studies and a comparison of eequilibrium and rate competition methods. , 1972, Journal of molecular biology.

[34]  M. Cleary,et al.  Structure of a HoxB1–Pbx1 Heterodimer Bound to DNA Role of the Hexapeptide and a Fourth Homeodomain Helix in Complex Formation , 1999, Cell.

[35]  Juan M. Vaquerizas,et al.  A census of human transcription factors: function, expression and evolution , 2009, Nature Reviews Genetics.

[36]  G. Stormo,et al.  Sequence analysis Context-dependent DNA recognition code for C 2 H 2 zinc-finger transcription factors , 2008 .

[37]  Toni Cathomen,et al.  Unexpected failure rates for modular assembly of engineered zinc fingers , 2008, Nature Methods.

[38]  Sarah E. Ades,et al.  Differential DNA-binding specificity of the engrailed homeodomain: the role of residue 50. , 1994, Biochemistry.

[39]  Panayiotis V Benos,et al.  Probabilistic code for DNA recognition by proteins of the EGR family. , 2002, Journal of molecular biology.

[40]  Yaoqi Zhou,et al.  Structure-based prediction of DNA-binding proteins by structural alignment and a volume-fraction corrected DFIRE-based energy function , 2010, Bioinform..

[41]  R. Mann,et al.  Cofactor Binding Evokes Latent Differences in DNA Binding Specificity between Hox Proteins , 2011, Cell.

[42]  T. Bürglin,et al.  Homeodomain subtypes and functional diversity. , 2011, Sub-cellular biochemistry.

[43]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[44]  Sarah E. Ades,et al.  Engrailed (Gln50-->Lys) homeodomain-DNA complex at 1.9 A resolution: structural basis for enhanced affinity and altered specificity. , 1997, Structure.

[45]  R. Sauer,et al.  Specificity of minor-groove and major-groove interactions in a homeodomain-DNA complex. , 1995, Biochemistry.

[46]  Martha L. Bulyk,et al.  Using a structural and logics systems approach to infer bHLH–DNA binding specificity determinants , 2011, Nucleic acids research.

[47]  C. Pabo,et al.  Exploring the role of glutamine 50 in the homeodomain-DNA interface: crystal structure of engrailed (Gln50 --> ala) complex at 2.0 A. , 2000, Biochemistry.

[48]  P. Sharp,et al.  Structure-based design of transcription factors. , 1995, Science.

[49]  Shannon R. Magari,et al.  A humanized system for pharmacologic control of gene expression , 1996, Nature Medicine.

[50]  Mona Singh,et al.  Predicting DNA recognition by Cys2His2 zinc finger proteins , 2009, Bioinform..

[51]  Daniel E. Newburger,et al.  Diversity and Complexity in DNA Recognition by Transcription Factors , 2009, Science.

[52]  Aneel K. Aggarwal,et al.  Structure of a DNA-bound Ultrabithorax–Extradenticle homeodomain complex , 1999, Nature.

[53]  T. Kornberg,et al.  Understanding the homeodomain. , 1993, The Journal of biological chemistry.

[54]  C. Francklyn,et al.  Mutational analysis of the engrailed homeodomain recognition helix by phage display. , 1999, Nucleic acids research.

[55]  R. Mann,et al.  The role of DNA shape in protein-DNA recognition , 2009, Nature.

[56]  R. Brent,et al.  A genetic model for interaction of the homeodomain recognition helix with DNA. , 1991, Science.

[57]  J. Geiger,et al.  Crystal structure of the Msx-1 homeodomain/DNA complex. , 2001, Biochemistry.

[58]  G. Stormo,et al.  Analysis of Homeodomain Specificities Allows the Family-wide Prediction of Preferred Recognition Sites , 2008, Cell.

[59]  G. Stormo,et al.  Quantitative analysis demonstrates most transcription factors require only simple models of specificity , 2011, Nature Biotechnology.

[60]  Carl O. Pabo,et al.  Crystal structure of an engrailed homeodomain-DNA complex at 2.8 Å resolution: A framework for understanding homeodomain-DNA interactions , 1990, Cell.

[61]  E. Gelmann,et al.  DNA-binding sequence of the human prostate-specific homeodomain protein NKX3.1. , 2000, Nucleic acids research.

[62]  Gary D. Stormo,et al.  Context-dependent DNA recognition code for C2H2 zinc-finger transcription factors , 2008, Bioinform..