Structural facets of POU2F1 in light of the functional annotations and sequence-structure patterns

POU domain class 2 homebox 1 or POU2F1 is broadly known as an important transcription factor. Due to its association with different types of malignancies, POU2F1 became one of the key factors in pancancer analysis. However, in spite of considering this protein as a potential drug target, none of the drug targeting POU2F1 has been designed as of yet due to the extreme structural flexibility of this protein. In this article, we have proposed a three-level comprehensive framework for understanding the structural conservation and co-variation of POU2F1. First, a gene regulatory network based on the normal and pathological functions of POU2F1 has been created for better understanding the strong association between POU2F1 deregulation and cancers. After that, based on the evolutionary sequence space analysis, the comparative sequence dynamics of the protein members of POU domain family has been studied mostly between non-human and human species. Subsequently, the reciprocity effect of the residual co-variation has been identified through direct coupling analysis. Along with that, the structure of POU2F1 has been analyzed depending on quality assessment and normal mode-based structure network. Comparing the sequence and structure space information, the most significant set of residues viz., 3, 9, 13, 17, 20, 21, 28, 35, and 36 have been identified as structural facet for function. This study demonstrates that the structural malleability of POU2F1 serves as one of the prime reason behind its functional multiplicity in terms of protein moonlighting.

[1]  Hyojin Kim,et al.  TRRUST v2: an expanded reference database of human and mouse transcriptional regulatory interactions , 2017, Nucleic Acids Res..

[2]  Marc S. Cortese,et al.  Flexible nets , 2005, The FEBS journal.

[3]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[4]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[5]  S Walter Englander,et al.  Protein folding: the stepwise assembly of foldon units. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[6]  S. Vishveshwara,et al.  A network representation of protein structures: implications for protein stability. , 2005, Biophysical journal.

[7]  Ujjwal Maulik,et al.  Understanding the evolutionary trend of intrinsically structural disorders in cancer relevant proteins as probed by Shannon entropy scoring and structure network analysis , 2018, BMC Bioinformatics.

[8]  Lukasz Kurgan,et al.  Untapped Potential of Disordered Proteins in Current Druggable Human Proteome. , 2016, Current drug targets.

[9]  Zhi-yong Wu,et al.  E6/E7-P53-POU2F1-CTHRC1 axis promotes cervical cancer metastasis and activates Wnt/PCP pathway , 2017, Scientific Reports.

[10]  Constance J Jeffery,et al.  An introduction to protein moonlighting. , 2014, Biochemical Society transactions.

[11]  C. Sander,et al.  Direct-coupling analysis of residue coevolution captures native contacts across many protein families , 2011, Proceedings of the National Academy of Sciences.

[12]  Roland L. Dunbrack,et al.  PONDR-FIT: a meta-predictor of intrinsically disordered amino acids. , 2010, Biochimica et biophysica acta.

[13]  A Keith Dunker,et al.  Drugs for 'protein clouds': targeting intrinsically disordered transcription factors. , 2010, Current opinion in pharmacology.

[14]  Seung Joon Baek,et al.  Moonlighting proteins in cancer. , 2016, Cancer letters.

[15]  Christopher J. Oldfield,et al.  Intrinsically disordered proteins in human diseases: introducing the D2 concept. , 2008, Annual review of biophysics.

[16]  R. Holt,et al.  Allele-specific transcription of the asthma-associated PHD finger protein 11 gene (PHF11) modulated by octamer-binding transcription factor 1 (Oct-1). , 2011, The Journal of allergy and clinical immunology.

[17]  Xinjian Chen,et al.  Oct1/Pou2f1 is selectively required for colon regeneration and regulates colon malignancy , 2019, PLoS genetics.

[18]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[19]  P. Romero,et al.  Sequence complexity of disordered protein , 2001, Proteins.

[20]  Vladimir N Uversky,et al.  Intrinsic disorder-based protein interactions and their modulators. , 2013, Current pharmaceutical design.

[21]  Ze'ev Ronai,et al.  ATM-dependent phosphorylation of ATF2 is required for the DNA damage response. , 2005, Molecular cell.

[22]  Yang Zhang,et al.  I-TASSER: a unified platform for automated protein structure and function prediction , 2010, Nature Protocols.

[23]  Yonglei Liu,et al.  miR-449a promotes liver cancer cell apoptosis by downregulation of Calpain 6 and POU2F1 , 2015, Oncotarget.

[24]  Hiroshi Handa,et al.  A general mechanism for transcription regulation by Oct1 and Oct4 in response to genotoxic and oxidative stress. , 2009, Genes & development.

[25]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[26]  Geoffrey J. Barton,et al.  TarO: a target optimisation system for structural biology , 2008, Nucleic Acids Res..

[27]  Zhan Tong,et al.  TransmiR v2.0: an updated transcription factor-microRNA regulation database , 2018, Nucleic Acids Res..

[28]  M. Rosenfeld,et al.  POU domain factors in the neuroendocrine system: lessons from developmental biology provide insights into human disease. , 2001, Endocrine reviews.

[29]  Vladimir N Uversky,et al.  Intrinsically disordered proteins and novel strategies for drug discovery , 2012, Expert opinion on drug discovery.

[30]  Juli D. Klemm,et al.  Crystal structure of the Oct-1 POU domain bound to an octamer site: DNA recognition with tethered DNA-binding modules , 1994, Cell.

[31]  Haipeng Liu,et al.  MoonProt 2.0: an expansion and update of the moonlighting proteins database , 2017, Nucleic Acids Res..

[32]  S. Vishveshwara,et al.  Identification of side-chain clusters in protein structures by a graph spectral method. , 1999, Journal of molecular biology.

[33]  Lukasz A. Kurgan,et al.  D2P2: database of disordered protein predictions , 2012, Nucleic Acids Res..

[34]  T. D. Schneider,et al.  Consensus sequence Zen. , 2002, Applied bioinformatics.

[35]  Yang Zhang,et al.  I-TASSER server for protein 3D structure prediction , 2008, BMC Bioinformatics.

[36]  Christopher J. Oldfield,et al.  Showing your ID: intrinsic disorder as an ID for recognition, regulation and cell signaling , 2005, Journal of molecular recognition : JMR.

[37]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Guangchuang Yu,et al.  ACK1 promotes gastric cancer epithelial–mesenchymal transition and metastasis through AKT–POU2F1–ECD signalling , 2015, The Journal of pathology.

[39]  Vladimir N Uversky,et al.  The multifaceted roles of intrinsic disorder in protein complexes , 2015, FEBS letters.

[40]  Cédric Notredame,et al.  Multiple sequence alignment modeling: methods and applications , 2016, Briefings Bioinform..

[41]  Hsien-Da Huang,et al.  miRTarBase update 2018: a resource for experimentally validated microRNA-target interactions , 2017, Nucleic Acids Res..

[42]  Yang Zhang,et al.  The I-TASSER Suite: protein structure and function prediction , 2014, Nature Methods.

[43]  Adam A. Margolin,et al.  Enabling transparent and collaborative computational analysis of 12 tumor types within The Cancer Genome Atlas , 2013, Nature Genetics.

[44]  Vladimir N Uversky,et al.  Pathological unfoldomics of uncontrolled chaos: intrinsically disordered proteins and human diseases. , 2014, Chemical reviews.

[45]  Guang-Rong Yan,et al.  POU2F1 over-expression correlates with poor prognoses and promotes cell growth and epithelial-to-mesenchymal transition in hepatocellular carcinoma , 2017, Oncotarget.

[46]  Lincoln Stein,et al.  Reactome: a database of reactions, pathways and biological processes , 2010, Nucleic Acids Res..

[47]  Vladimir N Uversky,et al.  The triple power of D³: protein intrinsic disorder in degenerative diseases. , 2014, Frontiers in bioscience.

[48]  Sofia G. Georgieva,et al.  Different N-terminal isoforms of Oct-1 control expression of distinct sets of genes and their high levels in Namalwa Burkitt's lymphoma cells affect a wide range of cellular processes , 2016, Nucleic acids research.

[49]  Zsuzsanna Dosztányi,et al.  IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding , 2018, Nucleic Acids Res..

[50]  V. Uversky Unusual biophysics of intrinsically disordered proteins. , 2013, Biochimica et biophysica acta.

[51]  Stephen McQuaid,et al.  POU2F1 activity regulates HOXD10 and HOXD11 promoting a proliferative and invasive phenotype in Head and Neck cancer , 2014, Oncotarget.

[52]  Nita Parekh,et al.  NAPS: Network Analysis of Protein Structures , 2016, Nucleic Acids Res..