Multiparameter functional diversity of human C2H2 zinc finger proteins

C2H2 zinc finger proteins represent the largest and most enigmatic class of human transcription factors. Their C2H2-ZF arrays are highly variable, indicating that most will have unique DNA binding motifs. However, most of the binding motifs have not been directly determined. In addition, little is known about whether or how these proteins regulate transcription. Most of the ∼700 human C2H2-ZF proteins also contain at least one KRAB, SCAN, BTB, or SET domain, suggesting that they may have common interacting partners and/or effector functions. Here, we report a multifaceted functional analysis of 131 human C2H2-ZF proteins, encompassing DNA binding sites, interacting proteins, and transcriptional response to genetic perturbation. We confirm the expected diversity in DNA binding motifs and genomic binding sites, and provide motif models for 78 previously uncharacterized C2H2-ZF proteins, most of which are unique. Surprisingly, the diversity in protein-protein interactions is nearly as high as diversity in DNA binding motifs: Most C2H2-ZF proteins interact with a unique spectrum of co-activators and co-repressors. Thus, multiparameter diversification likely underlies the evolutionary success of this large class of human proteins.

[1]  Cole Trapnell,et al.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions , 2013, Genome Biology.

[2]  K. Yamamoto,et al.  DNA Binding Site Sequence Directs Glucocorticoid Receptor Structure and Activity , 2009, Science.

[3]  J. Mackay,et al.  Two-timing zinc finger transcription factors liaising with RNA. , 2012, Trends in biochemical sciences.

[4]  T. Bailey,et al.  Inferring direct DNA binding from ChIP-seq , 2012, Nucleic acids research.

[5]  Sudhir Kumar,et al.  Mutation rates in mammalian genomes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Trisha R. Stankiewicz,et al.  C-terminal binding proteins: central players in development and disease , 2014, Biomolecular concepts.

[7]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[8]  P. Stogios,et al.  Sequence and structural analysis of BTB domain proteins , 2005, Genome Biology.

[9]  Christian Hennig,et al.  Recovering the number of clusters in data sets with noise features using feature rescaling factors , 2015, Inf. Sci..

[10]  A. Green,et al.  RFP represses transcriptional activation by bHLH transcription factors , 2005, Oncogene.

[11]  H. Yasuda,et al.  Regulation of the osteoblast-specific transcription factor Osterix by NO66, a Jumonji family histone demethylase , 2009, The EMBO journal.

[12]  Andrea Corsinotti,et al.  Global and Stage Specific Patterns of Krüppel-Associated-Box Zinc Finger Protein Gene Expression in Murine Early Embryonic Cells , 2013, PloS one.

[13]  Helen M. Rowe,et al.  Dynamic control of endogenous retroviruses during development. , 2011, Virology.

[14]  G. Maul,et al.  SETDB1: a novel KAP-1-associated histone H3, lysine 9-specific methyltransferase that contributes to HP1-mediated silencing of euchromatic genes by KRAB zinc-finger proteins. , 2002, Genes & development.

[15]  S. Ojeda,et al.  A Novel Transcription Complex That Selectively Modulates Apoptosis of Breast Cancer Cells through Regulation of FASTKD2 , 2011, Molecular and Cellular Biology.

[16]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[17]  Paul Theodor Pyl,et al.  HTSeq—a Python framework to work with high-throughput sequencing data , 2014, bioRxiv.

[18]  Juan M. Vaquerizas,et al.  DNA-Binding Specificities of Human Transcription Factors , 2013, Cell.

[19]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[20]  D. Higgins,et al.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.

[21]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[22]  Daniel E. Newburger,et al.  A Multiparameter Network Reveals Extensive Divergence between C. elegans bHLH Transcription Factors , 2009, Cell.

[23]  D. Ballinger,et al.  The SCAN Domain Mediates Selective Oligomerization* , 2000, The Journal of Biological Chemistry.

[24]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[25]  K. Struhl,et al.  The Cyc8-Tup1 complex inhibits transcription primarily by masking the activation domain of the recruiting protein. , 2011, Genes & development.

[26]  Thomas Whitington,et al.  Transcription Factor Binding in Human Cells Occurs in Dense Clusters Formed around Cohesin Anchor Sites , 2013, Cell.

[27]  E. Kalkhoven,et al.  CBP and p300: HATs for different occasions. , 2004, Biochemical pharmacology.

[28]  Mihai Albu,et al.  C2H2 zinc finger proteins greatly expand the human regulatory lexicon , 2015, Nature Biotechnology.

[29]  P. Defossez,et al.  Born to bind: the BTB protein–protein interaction domain , 2006, BioEssays : news and reviews in molecular, cellular and developmental biology.

[30]  S. d'Auria,et al.  Structure/function of KRAB repression domains: Structural properties of KRAB modules inferred from hydrodynamic, circular dichroism, and FTIR spectroscopic analyses , 2005, Proteins.

[31]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[32]  Brendan J. Frey,et al.  A compendium of RNA-binding motifs for decoding gene regulation , 2013, Nature.

[33]  David J. Segal,et al.  The Protein-Binding Potential of C2H2 Zinc Finger Domains , 2008, Cell Biochemistry and Biophysics.

[34]  L. Stubbs,et al.  Function and Evolution of C2H2 Zinc Finger Arrays. , 2011, Sub-cellular biochemistry.

[35]  M. Washburn,et al.  YY1 functions with INO80 to activate transcription , 2007, Nature Structural &Molecular Biology.

[36]  Laurie Gordon,et al.  A comprehensive catalog of human KRAB-associated zinc finger genes: insights into the evolutionary history of a large family of transcriptional repressors. , 2006, Genome research.

[37]  J. Bieker Krüppel-like Factors: Three Fingers in Many Pies* , 2001, The Journal of Biological Chemistry.

[38]  Amyj . Williams,et al.  All in the Family: the BTB/POZ, KRAB, and SCAN Domains , 2001, Molecular and Cellular Biology.

[39]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[40]  Jan Baumbach,et al.  Comparing the performance of biomedical clustering methods , 2015, Nature Methods.

[41]  H. Dyson,et al.  Intrinsically unstructured proteins and their functions , 2005, Nature Reviews Molecular Cell Biology.

[42]  R. Emerson,et al.  Adaptive Evolution in Zinc Finger Transcription Factors , 2009, PLoS genetics.

[43]  C. Pabo,et al.  DNA recognition by Cys2His2 zinc finger proteins. , 2000, Annual review of biophysics and biomolecular structure.

[44]  M. Ladomery,et al.  Multifunctional zinc finger proteins in development and disease , 2002 .

[45]  A. Jeffrey,et al.  How the λ repressor and cro work , 1980, Cell.

[46]  Charles L. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[47]  S. Goff,et al.  Embryonic stem cells use ZFP809 to silence retroviral DNAs , 2009, Nature.

[48]  D. Aswad,et al.  Regulation of transcription by a protein methyltransferase. , 1999, Science.

[49]  Johannes Söding,et al.  The HHpred interactive server for protein homology detection and structure prediction , 2005, Nucleic Acids Res..

[50]  H. Kimura,et al.  Proviral silencing in embryonic stem cells requires the histone methyltransferase ESET , 2010, Nature.

[51]  T R Hughes,et al.  A catalogue of eukaryotic transcription factor types, their evolutionary origin, and species distribution. , 2011, Sub-cellular biochemistry.

[52]  Andrei L. Turinsky,et al.  Human-chromatin-related protein interactions identify a demethylase complex required for chromosome segregation. , 2014, Cell reports.

[53]  D S Latchman,et al.  Eukaryotic transcription factors. , 1990, The Biochemical journal.

[54]  Juancarlos Chan,et al.  Gene Ontology Consortium: going forward , 2014, Nucleic Acids Res..

[55]  Guomin Liu,et al.  SAINTexpress: improvements and additional features in Significance Analysis of INTeractome software. , 2014, Journal of proteomics.

[56]  Jussi Taipale,et al.  Conservation of transcription factor binding specificities across 600 million years of bilateria evolution , 2015, eLife.

[57]  Vincent J. Lynch,et al.  Resurrecting the Role of Transcription Factor Change in Developmental Evolution , 2008, Evolution; international journal of organic evolution.

[58]  A Keith Dunker,et al.  Intrinsically disordered proteins and intrinsically disordered protein regions. , 2014, Annual review of biochemistry.

[59]  Mihai Albu,et al.  Identification of C2H2-ZF binding preferences from ChIP-seq data using RCADE , 2015, Bioinform..