SUBA: the Arabidopsis Subcellular Database

Knowledge of protein localisation contributes towards our understanding of protein function and of biological inter-relationships. A variety of experimental methods are currently being used to produce localisation data that need to be made accessible in an integrated manner. Chimeric fluorescent fusion proteins have been used to define subcellular localisations with at least 1100 related experiments completed in Arabidopsis. More recently, many studies have employed mass spectrometry to undertake proteomic surveys of subcellular components in Arabidopsis yielding localisation information for approximately 2600 proteins. Further protein localisation information may be obtained from other literature references to analysis of locations (AmiGO: approximately 900 proteins), location information from Swiss-Prot annotations (approximately 2000 proteins); and location inferred from gene descriptions (approximately 2700 proteins). Additionally, an increasing volume of available software provides location prediction information for proteins based on amino acid sequence. We have undertaken to bring these various data sources together to build SUBA, a SUBcellular location database for Arabidopsis proteins. The localisation data in SUBA encompasses 10 distinct subcellular locations, >6743 non-redundant proteins and represents the proteins encoded in the transcripts responsible for 51% of Arabidopsis expressed sequence tags. The SUBA database provides a powerful means by which to assess protein subcellular localisation in Arabidopsis (http://www.suba.bcs.uwa.edu.au).

[1]  K. V. van Wijk,et al.  The Oligomeric Stromal Proteome of Arabidopsis thaliana Chloroplasts *S , 2006, Molecular & Cellular Proteomics.

[2]  A. Bairoch,et al.  The Swiss-Prot protein knowledgebase and ExPASy: providing the plant community with high quality proteomic data and tools. , 2004, Plant physiology and biochemistry : PPB.

[3]  Rod B. Watson,et al.  Mapping the Arabidopsis organelle proteome. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[4]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[5]  Julian Tonti-Filippini,et al.  Combining Experimental and Predicted Datasets for Determination of the Subcellular Location of Proteins in Arabidopsis1[w] , 2005, Plant Physiology.

[6]  Iris Meier,et al.  A proteomic study of the arabidopsis nuclear matrix , 2003, Journal of cellular biochemistry.

[7]  Seung Y. Rhee,et al.  High-Throughput Fluorescent Tagging of Full-Length Arabidopsis Gene Products in Planta1 , 2004, Plant Physiology.

[8]  A. Millar,et al.  Analysis of the Arabidopsis mitochondrial proteome. , 2001, Plant physiology.

[9]  W. Frommer,et al.  ARAMEMNON, a Novel Database for Arabidopsis Integral Membrane Proteins1 , 2003, Plant Physiology.

[10]  Roland Arnold,et al.  MIPS Arabidopsis thaliana Database (MAtDB): an integrated biological knowledge resource based on the first complete plant genome , 2002, Nucleic Acids Res..

[11]  A. Oliphant,et al.  A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). , 2002, Science.

[12]  Jungwon Yoon,et al.  The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community , 2003, Nucleic Acids Res..

[13]  M. Mann,et al.  Proteomic analysis of the Arabidopsis nucleolus suggests novel nucleolar functions. , 2004, Molecular biology of the cell.

[14]  Seung Y Rhee,et al.  Systematic Analysis of Arabidopsis Organelles and a Protein Localization Database for Facilitating Fluorescent Tagging of Full-Length Arabidopsis Proteins1[W] , 2006, Plant Physiology.

[15]  B. Haas,et al.  Complete reannotation of the Arabidopsis genome: methods, tools, protocols and the final release , 2005, BMC Biology.

[16]  Kuo-Chen Chou,et al.  Prediction and classification of protein subcellular location—sequence‐order effect and pseudo amino acid composition , 2003, Journal of cellular biochemistry.

[17]  G. Friso,et al.  In-Depth Analysis of the Thylakoid Membrane Proteome of Arabidopsis thaliana Chloroplasts: New Proteins, New Functions, and a Plastid Proteome Database On-line version contains Web-only data. , 2004, The Plant Cell Online.

[18]  C. Wilkerson,et al.  Proteomic study of the Arabidopsis thaliana chloroplastic envelope membrane utilizing alternatives to traditional two-dimensional electrophoresis. , 2003, Journal of proteome research.

[19]  Amos Bairoch,et al.  Swiss-Prot: Juggling between evolution and stability , 2004, Briefings Bioinform..

[20]  H. Braun,et al.  Proteomic approach to identify novel mitochondrial proteins in Arabidopsis. , 2001, Plant physiology.

[21]  E. Cho,et al.  Analysis of the Arabidopsis nuclear proteome and its response to cold stress. , 2003, The Plant journal : for cell and molecular biology.

[22]  Rodrigo A Gutiérrez,et al.  The Plant-Specific Database. Classification of Arabidopsis Proteins Based on Their Phylogenetic Profile1 , 2004, Plant Physiology.

[23]  Etienne Gagnon,et al.  Organelle proteomics: looking at less to see more. , 2003, Trends in cell biology.

[24]  A. Harvey Millar,et al.  Location, location, location: surveying the intracellular real estate through proteomics in plants. , 2004, Functional plant biology : FPB.

[25]  Huanming Yang,et al.  A Draft Sequence of the Rice Genome (Oryza sativa L. ssp. indica) , 2002, Science.

[26]  J. Garin,et al.  Proteomics of the Chloroplast Envelope Membranes from Arabidopsis thaliana*S , 2003, Molecular & Cellular Proteomics.

[27]  Kenichi Higo,et al.  Rice Proteome Database based on two-dimensional polyacrylamide gel electrophoresis: its status in 2003 , 2004, Nucleic Acids Res..

[28]  Patrick F Chinnery,et al.  Evolutionary diversification of mitochondrial proteomes: implications for human disease. , 2003, Trends in genetics : TIG.

[29]  Gene Ontology Consortium,et al.  The Gene Ontology (GO) project in 2006 , 2005, Nucleic Acids Res..

[30]  Peter Roepstorff,et al.  Central Functions of the Lumenal and Peripheral Thylakoid Proteome of Arabidopsis Determined by Experimentation and Genome-Wide Prediction Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.010304. , 2002, The Plant Cell Online.

[31]  Zheng-Hui He,et al.  A cluster of five cell wall-associated receptor kinase genes, Wak1–5, are expressed in specific organs of Arabidopsis , 1999, Plant Molecular Biology.

[32]  K. Sjölander,et al.  The Arabidopsis thaliana Chloroplast Proteome Reveals Pathway Abundance and Novel Protein Functions , 2004, Current Biology.

[33]  S. Cutler,et al.  Random GFP::cDNA fusions enable visualization of subcellular structures in cells of Arabidopsis at a high frequency. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Julian Tonti-Filippini,et al.  Experimental Analysis of the Arabidopsis Mitochondrial Proteome Highlights Signaling and Regulatory Components, Provides Assessment of Targeting Prediction Programs, and Indicates Plant-Specific Mitochondrial Proteins Online version contains Web-only data. Article, publication date, and citation inf , 2004, The Plant Cell Online.

[35]  B. Haas,et al.  Proteome Map of the Chloroplast Lumen of Arabidopsis thaliana * , 2002, The Journal of Biological Chemistry.

[36]  Peter Shaw,et al.  High-throughput protein localization in Arabidopsis using Agrobacterium-mediated transient expression of GFP-ORF fusions. , 2004, The Plant journal : for cell and molecular biology.

[37]  M. Schmid,et al.  Genome-Wide Insertional Mutagenesis of Arabidopsis thaliana , 2003, Science.