microProtein Prediction Program (miP3): A Software for Predicting microProteins and Their Target Transcription Factors

An emerging concept in transcriptional regulation is that a class of truncated transcription factors (TFs), called microProteins (miPs), engages in protein-protein interactions with TF complexes and provides feedback controls. A handful of miP examples have been described in the literature but the extent of their prevalence is unclear. Here we present an algorithm that predicts miPs and their target TFs from a sequenced genome. The algorithm is called miP prediction program (miP3), which is implemented in Python. The software will help shed light on the prevalence, biological roles, and evolution of miPs. Moreover, miP3 can be used to predict other types of miP-like proteins that may have evolved from other functional classes such as kinases and receptors. The program is freely available and can be applied to any sequenced genome.

[1]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[2]  E. Magnani,et al.  KNOX Lost the OX: The Arabidopsis KNATM Gene Defines a Novel Class of KNOX Transcriptional Regulators Missing the Homeodomain[W] , 2008, The Plant Cell Online.

[3]  S. Rhee,et al.  A Comprehensive Analysis of MicroProteins Reveals Their Potentially Widespread Mechanism of Transcriptional Regulation1[W] , 2014, Plant Physiology.

[4]  Ilha Lee,et al.  HD-ZIP III Activity Is Modulated by Competitive Inhibitors via a Feedback Loop in Arabidopsis Shoot Apical Meristem Development[W] , 2008, The Plant Cell Online.

[5]  D. J. McKay,et al.  Distinct functions of homeodomain-containing and homeodomain-less isoforms encoded by homothorax. , 2006, Genes & development.

[6]  M. Barton,et al.  A Feedback Regulatory Module Formed by LITTLE ZIPPER and HD-ZIPIII Genes[W][OA] , 2007, The Plant Cell Online.

[7]  Kai Xia,et al.  Impacts of protein-protein interaction domains on organism and network complexity. , 2008, Genome research.

[8]  Daniel Koenig,et al.  Natural Variation in Leaf Morphology Results from Mutation of a Novel KNOX Gene , 2008, Current Biology.

[9]  Rolf Apweiler,et al.  InterProScan - an integration platform for the signature-recognition methods in InterPro , 2001, Bioinform..

[10]  S. Wenkel,et al.  Regulation of protein function by ‘microProteins’ , 2011, EMBO reports.

[11]  E. Sonnhammer,et al.  Evolution of protein domain architectures. , 2012, Methods in molecular biology.

[12]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[13]  Gabriel Moreno-Hagelsieb,et al.  Choosing BLAST options for better detection of orthologs as reciprocal best hits , 2008, Bioinform..

[14]  Tanya Z. Berardini,et al.  The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools , 2011, Nucleic Acids Res..

[15]  Robert D. Finn,et al.  InterPro in 2011: new developments in the family and domain prediction database , 2011, Nucleic acids research.

[16]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..