Inferring Hypotheses on Functional Relationships of Genes: Analysis of the Arabidopsis thaliana Subtilase Gene Family

The gene family of subtilisin-like serine proteases (subtilases) in Arabidopsis thaliana comprises 56 members, divided into six distinct subfamilies. Whereas the members of five subfamilies are similar to pyrolysins, two genes share stronger similarity to animal kexins. Mutant screens confirmed 144 T-DNA insertion lines with knockouts for 55 out of the 56 subtilases. Apart from SDD1, none of the confirmed homozygous mutants revealed any obvious visible phenotypic alteration during growth under standard conditions. Apart from this specific case, forward genetics gave us no hints about the function of the individual 54 non-characterized subtilase genes. Therefore, the main objective of our work was to overcome the shortcomings of the forward genetic approach and to infer alternative experimental approaches by using an integrative bioinformatics and biological approach. Computational analyses based on transcriptional co-expression and co-response pattern revealed at least two expression networks, suggesting that functional redundancy may exist among subtilases with limited similarity. Furthermore, two hubs were identified, which may be involved in signalling or may represent higher-order regulatory factors involved in responses to environmental cues. A particular enrichment of co-regulated genes with metabolic functions was observed for four subtilases possibly representing late responsive elements of environmental stress. The kexin homologs show stronger associations with genes of transcriptional regulation context. Based on the analyses presented here and in accordance with previously characterized subtilases, we propose three main functions of subtilases: involvement in (i) control of development, (ii) protein turnover, and (iii) action as downstream components of signalling cascades. Supplemental material is available in the Plant Subtilase Database (PSDB) (http://csbdb.mpimp-golm.mpg.de/psdb.html) , as well as from the CSB.DB (http://csbdb.mpimp-golm.mpg.de).

[1]  B. N. Golovkin,et al.  New subtilisin-like collagenase from leaves of common plantain. , 2001, Biochimie.

[2]  I. Bancroft,et al.  Insights Into the Structural and Functional Evolution of Plant Genomes Afforded by the Nucleotide Sequences of Chromosomes 2 and 4 of Arabidopsis Thaliana , 2000, Yeast.

[3]  J. Görlach,et al.  Growth Stage–Based Phenotypic Analysis of Arabidopsis , 2001, The Plant Cell Online.

[4]  B. N. Golovkin,et al.  Macluralisin — a serine proteinase from fruits of Maclura pomifera (Raf.) Schneid. , 2004, Planta.

[5]  Charlie Hodgman,et al.  A historical perspective on gene/protein functional assignment , 2000, Bioinform..

[6]  Neel G. Barnaby,et al.  Cleavage specificity of the subtilisin-like protease C1 from soybean. , 2002, Biochimica et biophysica acta.

[7]  Stefan R. Henz,et al.  A gene expression map of Arabidopsis thaliana development , 2005, Nature Genetics.

[8]  V. Puizdar,et al.  A novel subtilase from common bean leaves , 2002, FEBS letters.

[9]  Dirk Eick,et al.  The last CTD repeat of the mammalian RNA polymerase II large subunit is important for its stability. , 2004, Nucleic acids research.

[10]  N. Seidah,et al.  Proprotein and prohormone convertases: a family of subtilases generating diverse bioactive polypeptides 1 Published on the World Wide Web on 17 August 1999. 1 , 1999, Brain Research.

[11]  L. Baringhaus,et al.  On a new multivariate two-sample test , 2004 .

[12]  P. Espenshade,et al.  Molecular identification of the sterol-regulated luminal protease that cleaves SREBPs and controls lipid composition of animal cells. , 1998, Molecular cell.

[13]  Björn Usadel,et al.  CSB.DB: a comprehensive systems-biology database , 2004, Bioinform..

[14]  G.N. Rudenskaya,et al.  Taraxalisin – a serine proteinase from dandelion Taraxacum officinale Webb s.l , 1998, FEBS letters.

[15]  Y. Nagaoka,et al.  Cucumisin, a serine protease from melon fruits, shares structural homology with subtilisin and is generated from a large precursor. , 1994, The Journal of biological chemistry.

[16]  B. Clark,et al.  Inhibition of translation initiation complex formation by MS1 , 1972, FEBS letters.

[17]  T. Altmann,et al.  The Subtilisin-Like Serine Protease SDD1 Mediates Cell-to-Cell Signaling during Arabidopsis Stomatal Development Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.001016. , 2002, The Plant Cell Online.

[18]  N. Seidah,et al.  Mammalian subtilisin/kexin isozyme SKI-1: A widely expressed proprotein convertase with a unique cleavage specificity and cellular localization. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[20]  S. Brunak,et al.  Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. , 2000, Journal of molecular biology.

[21]  N. Amrhein,et al.  Characterization of the subtilase gene family in tomato (Lycopersicon esculentum Mill.) , 1999, Plant Molecular Biology.

[22]  Alan M. Jones,et al.  The S8 serine, C1A cysteine and A1 aspartic protease families in Arabidopsis. , 2004, Phytochemistry.

[23]  B. Ndimba,et al.  Ara12 subtilisin-like protease from Arabidopsis thaliana: purification, substrate specificity and tissue localization. , 2003, The Biochemical journal.

[24]  P. Barr,et al.  Mammalian subtilisins: The long-sought dibasic processing endoproteases , 1991, Cell.

[25]  J. Gower,et al.  Metric and Euclidean properties of dissimilarity coefficients , 1986 .

[26]  A Wlodawer,et al.  Catalytic triads and their relatives. , 1998, Trends in biochemical sciences.

[27]  Roderic D. M. Page,et al.  TreeView: an application to display phylogenetic trees on personal computers , 1996, Comput. Appl. Biosci..

[28]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[29]  Boris Mirkin,et al.  Mathematical Classification and Clustering , 1996 .

[30]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[31]  Nick James,et al.  NASCArrays: a repository for microarray data generated by NASC's transcriptomics service , 2004, Nucleic Acids Res..

[32]  J. Thorner,et al.  Yeast prohormone processing enzyme (KEX2 gene product) is a Ca2+-dependent serine protease. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[33]  K. K. Thomsen,et al.  Purification and Characterization of Hordolisin, a Subtilisin-like Serine Endoprotease from Barley , 2000 .

[34]  Yong Li,et al.  An Arabidopsis thaliana T-DNA mutagenized population (GABI-Kat) for flanking sequence tag-based reverse genetics , 2003, Plant Molecular Biology.

[35]  P. Vera,et al.  Characterization of LRP, a leucine-rich repeat (LRR) protein from tomato plants that is processed during pathogenesis. , 1996, The Plant journal : for cell and molecular biology.

[36]  Jungwon Yoon,et al.  The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community , 2003, Nucleic Acids Res..

[37]  M. Kaneda,et al.  Isolation and characterization of a proteinase from the sarcocarp of melon fruit. , 1975, Journal of biochemistry.

[38]  J. Leunissen,et al.  Subtilases: The superfamily of subtilisin‐like serine proteases , 1997, Protein science : a publication of the Protein Society.

[39]  J. Bull,et al.  An Empirical Test of Bootstrapping as a Method for Assessing Confidence in Phylogenetic Analysis , 1993 .

[40]  P. Vera,et al.  A Genomic Cluster Containing Four Differentially Regulated Subtilisin-like Processing Protease Genes Is in Tomato Plants* , 1999, The Journal of Biological Chemistry.

[41]  F. James Rohlf,et al.  Biometry: The Principles and Practice of Statistics in Biological Research , 1969 .

[42]  P. Vera,et al.  Pathogenesis-related proteins of tomato : p-69 as an alkaline endoproteinase. , 1988, Plant physiology.

[43]  J. Dyer,et al.  Identification of a Subtilisin-Like Protease in Seeds of Developing Tung Fruits , 1999 .

[44]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[45]  S. Shiu,et al.  Receptor-like kinases from Arabidopsis form a monophyletic gene family related to animal receptor kinases , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[46]  A J Davison,et al.  Alphaherpesviruses possess a gene homologous to the protein kinase gene family of eukaryotes and retroviruses. , 1986, Nucleic acids research.

[47]  J. Thompson,et al.  The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. , 1997, Nucleic acids research.

[48]  T. Altmann,et al.  A subtilisin-like serine protease involved in the regulation of stomatal density and distribution in Arabidopsis thaliana. , 2000, Genes & development.

[49]  P. Vera,et al.  Primary structure and expression of a pathogen-induced protease (PR-P69) in tomato plants: Similarity of functional domains to subtilisin-like endoproteases. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[50]  J. O. Berry,et al.  Processing and secretion of a virally encoded antifungal toxin in transgenic tobacco plants: evidence for a Kex2p pathway in plants. , 1995, The Plant cell.

[51]  Y. Machida,et al.  A subtilisin-like serine protease is required for epidermal surface formation in Arabidopsis embryos and juvenile plants. , 2001, Development.

[52]  A. Bateman,et al.  The PA domain: A protease‐associated domain , 2000, Protein science : a publication of the Protein Society.

[53]  T. Gibson,et al.  Applying motif and profile searches. , 1996, Methods in enzymology.

[54]  R. Jung,et al.  Two subtilisin-like proteases from soybean. , 2002, Physiologia plantarum.

[55]  Vladimir Batagelj,et al.  Pajek - Analysis and Visualization of Large Networks , 2001, Graph Drawing Software.

[56]  Joachim Selbig,et al.  Hypothesis-driven approach to predict transcriptional units from gene expression data , 2004, Bioinform..

[57]  A. M. Bogacheva Plant subtilisins. , 1999, Biochemistry. Biokhimiia.

[58]  M Caboche,et al.  Improved PCR-walking for large-scale isolation of plant T-DNA borders. , 2001, BioTechniques.

[59]  J. McDowell,et al.  Strong, constitutive expression of the Arabidopsis ACT2/ACT8 actin subclass in vegetative tissues. , 1996, The Plant journal : for cell and molecular biology.

[60]  C. Bonferroni Il calcolo delle assicurazioni su gruppi di teste , 1935 .

[61]  K. Hofmann,et al.  The protease-associated domain: a homology domain associated with multiple classes of proteases. , 2001, Trends in biochemical sciences.

[62]  B. Jones,et al.  SEP-1 – a subtilisin-like serine endopeptidase from germinated seeds of Hordeum vulgare L. cv. Morex , 2002, Planta.

[63]  M. Schmid,et al.  Genome-Wide Insertional Mutagenesis of Arabidopsis thaliana , 2003, Science.

[64]  M. Hauser,et al.  Identification and Characterization of the ARIADNEGene Family in Arabidopsis. A Group of Putative E3 Ligases1 , 2003, Plant Physiology.

[65]  Hans-Werner Mewes,et al.  MIPS Arabidopsis thaliana Database (MAtDB): an integrated biological knowledge resource for plant genomics , 2004, Nucleic Acids Res..

[66]  Y. Yamazaki,et al.  Identification of amino acid residues important in the cyclization reactions of chalcone and stilbene synthases. , 2000, The Biochemical journal.