Expression Patterns of Protein Kinases Correlate with Gene Architecture and Evolutionary Rates

Background Protein kinase (PK) genes comprise the third largest superfamily that occupy ∼2% of the human genome. They encode regulatory enzymes that control a vast variety of cellular processes through phosphorylation of their protein substrates. Expression of PK genes is subject to complex transcriptional regulation which is not fully understood. Principal Findings Our comparative analysis demonstrates that genomic organization of regulatory PK genes differs from organization of other protein coding genes. PK genes occupy larger genomic loci, have longer introns, spacer regions, and encode larger proteins. The primary transcript length of PK genes, similar to other protein coding genes, inversely correlates with gene expression level and expression breadth, which is likely due to the necessity to reduce metabolic costs of transcription for abundant messages. On average, PK genes evolve slower than other protein coding genes. Breadth of PK expression negatively correlates with rate of non-synonymous substitutions in protein coding regions. This rate is lower for high expression and ubiquitous PKs, relative to low expression PKs, and correlates with divergence in untranslated regions. Conversely, rate of silent mutations is uniform in different PK groups, indicating that differing rates of non-synonymous substitutions reflect variations in selective pressure. Brain and testis employ a considerable number of tissue-specific PKs, indicating high complexity of phosphorylation-dependent regulatory network in these organs. There are considerable differences in genomic organization between PKs up-regulated in the testis and brain. PK genes up-regulated in the highly proliferative testicular tissue are fast evolving and small, with short introns and transcribed regions. In contrast, genes up-regulated in the minimally proliferative nervous tissue carry long introns, extended transcribed regions, and evolve slowly. Conclusions/Significance PK genomic architecture, the size of gene functional domains and evolutionary rates correlate with the pattern of gene expression. Structure and evolutionary divergence of tissue-specific PK genes is related to the proliferative activity of the tissue where these genes are predominantly expressed. Our data provide evidence that physiological requirements for transcription intensity, ubiquitous expression, and tissue-specific regulation shape gene structure and affect rates of evolution.

[1]  C. Paweletz,et al.  Identification and Characterization of SSTK, a Serine/Threonine Protein Kinase Essential for Male Fertility , 2022 .

[2]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[3]  A. Sgourou,et al.  Thalassaemia mutations within the 5′UTR of the human β‐globin gene disrupt transcription , 2004, British journal of haematology.

[4]  Cristian I. Castillo-Davis,et al.  Selection for short introns in highly expressed genes , 2002, Nature Genetics.

[5]  K. Lindblad-Toh,et al.  Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammals , 2005, Nature.

[6]  P. Mitchell,et al.  mRNA turnover. , 2001, Current opinion in cell biology.

[7]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[8]  D. Landsman,et al.  Statistical analysis of over-represented words in human promoter sequences. , 2004, Nucleic acids research.

[9]  M. Rajadhyaksha,et al.  An Unusual Member of the Cdk Family: Cdk5 , 2008, Cellular and Molecular Neurobiology.

[10]  S. Shabalina,et al.  The mammalian transcriptome and the function of non-coding DNA sequences , 2004, Genome Biology.

[11]  E. Koonin,et al.  Origins and evolution of eukaryotic RNA interference. , 2008, Trends in ecology & evolution.

[12]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[13]  Michael Q. Zhang,et al.  Identifying tissue-selective transcription factor binding sites in vertebrate promoters. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[14]  N A Kolchanov,et al.  Eukaryotic mRNAs encoding abundant and scarce proteins are statistically dissimilar in many structural features , 1998, FEBS letters.

[15]  D. Lipman,et al.  Patterns in interspecies similarity correlate with nucleotide composition in mammalian 3'UTRs. , 2003, Nucleic acids research.

[16]  M. Kimura A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences , 1980, Journal of Molecular Evolution.

[17]  A. Willis,et al.  The implications of structured 5' untranslated regions on translation and disease. , 2005, Seminars in cell & developmental biology.

[18]  J. Sutcliffe,et al.  1G5: a calmodulin-binding, vesicle-associated, protein kinase-like protein enriched in forebrain neurites , 1994, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[19]  Hardie Dg An emerging role for protein kinases: the response to nutritional and environmental stress. , 1994 .

[20]  Yan Zhang,et al.  GEPIS - quantitative gene expression profiling in normal and cancer tissues , 2004, Bioinform..

[21]  V. Mauro,et al.  An mRNA-rRNA base-pairing mechanism for translation initiation in eukaryotes , 2006, Nature Structural &Molecular Biology.

[22]  Z. Weng,et al.  Detection of functional DNA motifs via statistical over-representation. , 2004, Nucleic acids research.

[23]  T. Ogihara,et al.  Mutation of the Follicle-Stimulating Hormone Receptor Gene 5′-Untranslated Region Associated With Female Hypertension , 2006, Hypertension.

[24]  T. Hunter,et al.  The eukaryotic protein kinase superfamily: kinase (catalytic) domain structure and classification 1 , 1995, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[25]  A. Flenniken,et al.  Segmental expression of the EphA4 (Sek-1) receptor tyrosine kinase in the hindbrain is under direct transcriptional control of Krox-20. , 1998, Development.

[26]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[27]  A. Andres,et al.  A Novel Family of Serine/Threonine Kinases Participating in Spermiogenesis , 1997, The Journal of cell biology.

[28]  F. Amaldi,et al.  A somatic mutation in the 5′UTR of BRCA1 gene in sporadic breast cancer causes down-modulation of translation efficiency , 2001, Oncogene.

[29]  Alexander E. Kel,et al.  TRANSFAC® and its module TRANSCompel®: transcriptional gene regulation in eukaryotes , 2005, Nucleic Acids Res..

[30]  A. Thomson,et al.  Identification of a Novel AU-Rich Element in the 3′ Untranslated Region of Epidermal Growth Factor Receptor mRNA That Is the Target for Regulated RNA-Binding Proteins , 2001, Molecular and Cellular Biology.

[31]  S. Shabalina,et al.  Pattern of selective constraint in C. elegans and C. briggsae genomes. , 1999, Genetical research.

[32]  Aleksey Y. Ogurtsov,et al.  OWEN: aligning long collinear regions of genomes , 2002, Bioinform..

[33]  Yan Zhang,et al.  GeneHub-GEPIS: digital expression profiling for normal and cancer tissues based on an integrated gene database , 2007, Nucleic Acids Res..

[34]  M. Kozak An analysis of 5'-noncoding sequences from 699 vertebrate messenger RNAs. , 1987, Nucleic acids research.

[35]  M. Kozak,et al.  An analysis of vertebrate mRNA sequences: intimations of translational control , 1991, The Journal of cell biology.

[36]  S. Narumiya,et al.  Molecular Cloning and Characterization of CLICK-III/CaMKIγ, a Novel Membrane-anchored Neuronal Ca2+/Calmodulin-dependent Protein Kinase (CaMK)* , 2003, The Journal of Biological Chemistry.

[37]  C. Lawrence,et al.  Human-mouse genome comparisons to locate regulatory sites , 2000, Nature Genetics.

[38]  A. W. van der Velden,et al.  The role of the 5' untranslated region of an mRNA in translation regulation during development. , 1999, The international journal of biochemistry & cell biology.

[39]  Nafisa N. Nazipova,et al.  SAMSON: a software package for the biopolymer primary structure analysis , 1995, Comput. Appl. Biosci..

[40]  P. Sassone-Corsi,et al.  Testis-specific transcription mechanisms promoting male germ-cell differentiation. , 2004, Reproduction.

[41]  N. Iguchi,et al.  Cloning and characterization of human haspin gene encoding haploid germ cell-specific nuclear protein kinase. , 2001, Molecular human reproduction.

[42]  Samuel H. Wilson,et al.  Identification of a nuclear protein binding element within the rat brain protein kinase C γ promoter that is related to the developmental control of this gene , 1993, FEBS letters.

[43]  L. Duret,et al.  Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate. , 2000, Molecular biology and evolution.

[44]  Aleksey Y Ogurtsov,et al.  Distant conserved sequences flanking endothelial-specific promoters contain tissue-specific DNase-hypersensitive sites and over-represented motifs. , 2006, Human molecular genetics.

[45]  R. Tjian,et al.  Transcription regulation and animal diversity , 2003, Nature.

[46]  Sandra Orchard,et al.  The Annotation of Both Human and Mouse Kinomes in UniProtKB/Swiss-Prot , 2008, Molecular & Cellular Proteomics.

[47]  Michael Q. Zhang,et al.  A clustering property of highly-degenerate transcription factor binding sites in the mammalian genome , 2006, Nucleic acids research.

[48]  T. Hunter,et al.  The Protein Kinase Complement of the Human Genome , 2002, Science.

[49]  R. Russell,et al.  Animal MicroRNAs Confer Robustness to Gene Expression and Have a Significant Impact on 3′UTR Evolution , 2005, Cell.

[50]  E. Levanon,et al.  Human housekeeping genes are compact. , 2003, Trends in genetics : TIG.

[51]  M. Boguski,et al.  Evolutionary parameters of the transcribed mammalian genome: an analysis of 2,820 orthologous rodent and human sequences. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[52]  E. Koonin,et al.  Conservation and coevolution in the scale-free human gene coexpression network. , 2004, Molecular biology and evolution.

[53]  A. Means,et al.  Spermiogenesis and exchange of basic nuclear proteins are impaired in male germ cells lacking Camk4 , 2000, Nature Genetics.

[54]  Alexey S Kondrashov,et al.  Classification of common conserved sequences in mammalian intergenic regions. , 2002, Human molecular genetics.

[55]  Jun Yu,et al.  How many human genes can be defined as housekeeping with current expression data? , 2008, BMC Genomics.

[56]  G. Coetzee,et al.  Identification of two germline point mutations in the 5'UTR of the androgen receptor gene in men with prostate cancer. , 1997, The Journal of urology.

[57]  Thomas Huber,et al.  Phosphoregulators: protein kinases and protein phosphatases of mouse. , 2003, Genome research.

[58]  Ivan Ovcharenko,et al.  Predicting tissue-specific enhancers in the human genome. , 2006, Genome research.

[59]  F. Robert,et al.  Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression , 2006 .

[60]  Vasudevan Seshadri,et al.  Translational control by the 3'-UTR: the ends specify the means. , 2003, Trends in biochemical sciences.

[61]  O. Matveeva,et al.  Intermolecular mRNA-rRNA hybridization and the distribution of potential interaction regions in murine 18S rRNA. , 1993, Nucleic acids research.

[62]  Aleksey Y. Ogurtsov,et al.  A periodic pattern of mRNA secondary structure created by the genetic code , 2006, Nucleic acids research.

[63]  N. Nath,et al.  PAK5, a New Brain-Specific Kinase, Promotes Neurite Outgrowth in N1E-115 Cells , 2002, Molecular and Cellular Biology.

[64]  Jean L. Chang,et al.  Initial sequence of the chimpanzee genome and comparison with the human genome , 2005, Nature.

[65]  T. Hunter,et al.  The mouse kinome: discovery and comparative genomics of all mouse protein kinases. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[66]  P. Hagerman,et al.  The (CGG)n repeat element within the 5' untranslated region of the FMR1 message provides both positive and negative cis effects on in vivo translation of a downstream reporter. , 2003, Human molecular genetics.

[67]  D. Hardie Metabolic control: A new solution to an old problem , 2000, Current Biology.

[68]  A. Ogurtsov,et al.  Selective constraint in intergenic regions of human and mouse genomes. , 2001, Trends in genetics : TIG.

[69]  T. Sunyer,et al.  Sequence analysis and DNA-protein interactions within the 5' flanking region of the Ca2+/calmodulin-dependent protein kinase II alpha-subunit gene. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[70]  T. Hunter,et al.  The eukaryotic protein kinase superfamily: kinase (catalytic) domain structure and classification 1 , 1995, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[71]  A. Munnich,et al.  Father‐to‐daughter transmission of Cornelia de Lange syndrome caused by a mutation in the 5′ untranslated region of the NIPBL Gene , 2006, Human mutation.

[72]  T. Hunter,et al.  Signaling—2000 and Beyond , 2000, Cell.

[73]  G. Edelman,et al.  The ribosome filter hypothesis , 2002, Proceedings of the National Academy of Sciences of the United States of America.