Comparison of different versions of SignalP and TargetP for diatom plastid protein predictions with ASAFind

Plastid targeted proteins of diatoms and related algae can be predicted with high sensitivity and specificity using the ASAFind method published in 2015. ASAFind predictions rely on SignalP predictions of endoplasmic reticulum (ER) targeting signal peptides. Recently (in 2019), a new version of SignalP was released, SignalP 5.0. We tested the ability of SignalP 5.0 to recognize signal peptides of nucleus-encoded, plastid-targeted diatom pre-proteins, and to identify the signal peptide cleavage site. The results were compared to manual predictions of the characteristic cleavage site motif, and to previous versions of SignalP. SignalP 5.0 is less sensitive than the previous versions of SignalP in this specific task, and also in the detection of signal peptides of non-plastid proteins in diatoms. However, in combination with ASAFind, the resulting prediction performance for plastid proteins is high. In addition, we tested the multi-location prediction tool TargetP for its suitability to provide signal peptide information to ASAFind. The newest version, TargetP 2.0, had the highest prediction performances for diatom signal peptides and mitochondrial transit peptides compared to other versions of SignalP and TargetP, thus it provides a good basis for ASAFind predictions.

[1]  A. Falciatore,et al.  Diatom Molecular Research Comes of Age: Model Species for Studying Phytoplankton Biology and Diversity[OPEN]. , 2020, The Plant cell.

[2]  S. Kiontke,et al.  Optimized mRuby3 is a Suitable Fluorescent Protein for in vivo Co-localization Studies with GFP in the Diatom Phaeodactylum tricornutum. , 2020, Protist.

[3]  O. Winther,et al.  Detecting sequence signals in targeting peptides using deep learning , 2019, Life Science Alliance.

[4]  Ansgar Gruber,et al.  Organelle Studies and Proteome Analyses of Mitochondria and Plastids Fractions from the Diatom Thalassiosira pseudonana , 2019, Plant & cell physiology.

[5]  Søren Brunak,et al.  A Brief History of Protein Sorting Prediction , 2019, The Protein Journal.

[6]  M. Oborník,et al.  Subcellular Compartments Interplay for Carbon and Nitrogen Allocation in Chromera velia and Vitrella brassicaformis , 2019, bioRxiv.

[7]  H. Verbruggen,et al.  The golden paradox – a new heterokont lineage with chloroplasts surrounded by two membranes , 2019, Journal of phycology.

[8]  Konstantinos D. Tsirigos,et al.  SignalP 5.0 improves signal peptide predictions using deep neural networks , 2019, Nature Biotechnology.

[9]  F. Daboussi,et al.  Genome editing in diatoms: achievements and goals , 2018, Plant Cell Reports.

[10]  A. Hehl,et al.  Mitochondrial Glycolysis in a Major Lineage of Eukaryotes , 2018, Genome biology and evolution.

[11]  I. Haferkamp,et al.  Reduced vacuolar β-1,3-glucan synthesis affects carbohydrate metabolism as well as plastid homeostasis and structure in Phaeodactylum tricornutum , 2018, Proceedings of the National Academy of Sciences.

[12]  D. Moog In Silico Tools for the Prediction of Protein Import into Secondary Plastids. , 2018, Methods in molecular biology.

[13]  Ansgar Gruber,et al.  Intracellular metabolic pathway distribution in diatoms and tools for genome-enabled experimental diatom research , 2017, Philosophical Transactions of the Royal Society B: Biological Sciences.

[14]  Hanhua Hu,et al.  The peculiar carbon metabolism in diatoms , 2017, Philosophical Transactions of the Royal Society B: Biological Sciences.

[15]  U. Maier,et al.  The Central Vacuole of the Diatom Phaeodactylum tricornutum: Identification of New Vacuolar Membrane Proteins and of a Functional Di-leucine-based Targeting Motif. , 2017, Protist.

[16]  D. Richter,et al.  Chimeric origins of ochrophytes and haptophytes revealed through an ancient plastid proteome , 2017, eLife.

[17]  Henrik Nielsen,et al.  Predicting Secretory Proteins with SignalP. , 2017, Methods in molecular biology.

[18]  P. Kroth,et al.  Diatom Vacuolar 1,6‐β‐Transglycosylases can Functionally Complement the Respective Yeast Mutants , 2016, The Journal of eukaryotic microbiology.

[19]  T. Mock,et al.  Plastid proteome prediction for diatoms and other algae with secondary plastids of the red lineage , 2015, The Plant journal : for cell and molecular biology.

[20]  Ansgar Gruber,et al.  Deducing intracellular distributions of metabolic pathways from genomic data. , 2014, Methods in molecular biology.

[21]  Pitter F. Huesgen,et al.  Proteomic Amino-Termini Profiling Reveals Targeting Information for Protein Import into Complex Plastids , 2013, PloS one.

[22]  A. Salamov,et al.  Algal genomes reveal evolutionary mosaicism and the fate of nucleomorphs , 2012, Nature.

[23]  S. Brunak,et al.  SignalP 4.0: discriminating signal peptides from transmembrane regions , 2011, Nature Methods.

[24]  Uwe G Maier,et al.  The physical and functional borders of transit peptide-like sequences in secondary endosymbionts , 2010, BMC Plant Biology.

[25]  E. Virginia Armbrust,et al.  The life of diatoms in the world's oceans , 2009, Nature.

[26]  Yann Guermeur,et al.  HECTAR: A method to predict subcellular targeting in heterokonts , 2008, BMC Bioinformatics.

[27]  G. McFadden,et al.  Plastid evolution. , 2008, Annual review of plant biology.

[28]  N. Patron,et al.  Transit peptide diversity and divergence: A global analysis of plastid targeting signals. , 2007, BioEssays : news and reviews in molecular, cellular and developmental biology.

[29]  Ansgar Gruber,et al.  Protein targeting into complex diatom plastids: functional characterisation of a specific targeting motif , 2007, Plant Molecular Biology.

[30]  S. Brunak,et al.  Locating proteins in the cell using TargetP, SignalP and related tools , 2007, Nature Protocols.

[31]  P. Kroth,et al.  Identification and characterization of a new conserved motif within the presequence of proteins targeted into complex diatom plastids. , 2004, The Plant journal : for cell and molecular biology.

[32]  Nicholas H. Putnam,et al.  The Genome of the Diatom Thalassiosira Pseudonana: Ecology, Evolution, and Metabolism , 2004, Science.

[33]  S. Brunak,et al.  Improved prediction of signal peptides: SignalP 3.0. , 2004, Journal of molecular biology.

[34]  D. Bhaya,et al.  Targeting proteins to diatom plastids involves transport through an endoplasmic reticulum , 1991, Molecular and General Genetics MGG.

[35]  A. Grossman,et al.  In vivo characterization of diatom multipartite plastid targeting signals , 2002, Journal of Cell Science.

[36]  P. Kroth Protein transport into secondary plastids and the evolution of primary and secondary plastids. , 2002, International review of cytology.

[37]  S. Brunak,et al.  Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. , 2000, Journal of molecular biology.

[38]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[39]  P. Kroth,et al.  Protein Transport into “Complex” Diatom Plastids Utilizes Two Different Targeting Signals* , 1998, The Journal of Biological Chemistry.

[40]  Anders Krogh,et al.  Prediction of Signal Peptides and Signal Anchors by a Hidden Markov Model , 1998, ISMB.

[41]  S. Brunak,et al.  SHORT COMMUNICATION Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites , 1997 .

[42]  H. Strotmann,et al.  Structure of the nuclear encoded γ subunit of CF0CF1 of the diatom Odontella sinensis including its presequence , 1993, FEBS letters.