P4P: a peptidome-based strain-level genome comparison web tool

Abstract Peptidome similarity analysis enables researchers to gain insights into differential peptide profiles, providing a robust tool to discriminate strain-specific peptides, true intra-species differences among biological replicates or even microorganism-phenotype variations. However, no in silico peptide fingerprinting software existed to facilitate such phylogeny inference. Hence, we developed the Peptidomes for Phylogenies (P4P) web tool, which enables the survey of similarities between microbial proteomes and simplifies the process of obtaining new biological insights into their phylogeny. P4P can be used to analyze different peptide datasets, i.e. bacteria, viruses, eukaryotic species or even metaproteomes. Also, it is able to work with whole proteome datasets and experimental mass-to-charge lists originated from mass spectrometers. The ultimate aim is to generate a valid and manageable list of peptides that have phylogenetic signal and are potentially sample-specific. Sample-to-sample comparison is based on a consensus peak set matrix, which can be further submitted to phylogenetic analysis. P4P holds great potential for improving phylogenetic analyses in challenging taxonomic groups, biomarker identification or epidemiologic studies. Notably, P4P can be of interest for applications handling large proteomic datasets, which it is able to reduce to small matrices while maintaining high phylogenetic resolution. The web server is available at http://sing-group.org/p4p.

[1]  Aleksey Y. Ogurtsov,et al.  Identification of Microorganisms by High Resolution Tandem Mass Spectrometry with Accurate Statistical Significance , 2015, Journal of The American Society for Mass Spectrometry.

[2]  Markus Ringnér,et al.  SPECLUST: a web tool for clustering of mass spectra , 2007 .

[3]  J. Peter Gogarten,et al.  Bioinformatic Genome Comparisons for Taxonomic and Phylogenetic Assignments Using Aeromonas as a Test Case , 2014, mBio.

[4]  Ramon Rosselló-Móra,et al.  Towards a taxonomy of Bacteria and Archaea based on interactive and cumulative data repositories. , 2012, Environmental microbiology.

[5]  John P. Huelsenbeck,et al.  MRBAYES: Bayesian inference of phylogenetic trees , 2001, Bioinform..

[6]  Hans-Peter Klenk,et al.  Standard operating procedure for calculating genome-to-genome distances based on high-scoring segment pairs , 2010, Standards in genomic sciences.

[7]  Alexander F. Auch,et al.  Genome sequence-based species delimitation with confidence intervals and improved distance functions , 2013, BMC Bioinformatics.

[8]  Anália Lourenço,et al.  Improving Phylogeny Reconstruction at the Strain Level Using Peptidome Datasets , 2016, PLoS Comput. Biol..

[9]  D. Maddison,et al.  NEXUS: an extensible file format for systematic information. , 1997, Systematic biology.

[10]  Erik Kristiansson,et al.  Proteotyping: Proteomic characterization, classification and identification of microorganisms--A prospectus. , 2015, Systematic and applied microbiology.

[11]  Oliver Horlacher,et al.  MzJava: An open source library for mass spectrometry data processing. , 2015, Journal of proteomics.

[12]  M. Ringnér,et al.  Detection and identification of protein isoforms using cluster analysis of MALDI-MS mass spectra. , 2006, Journal of proteome research.

[13]  Martin Ester,et al.  PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes , 2010, Bioinform..

[14]  Baltasar Mayo,et al.  Viability and diversity of probiotic Lactobacillus and Bifidobacterium populations included in commercial fermented milks , 2004 .

[15]  Anália Lourenço,et al.  A peptidome-based phylogeny pipeline reveals differential peptides at the strain level within Bifidobacterium animalis subsp. lactis. , 2016, Food microbiology.

[16]  Rodolphe Barrangou,et al.  Strain-Specific Genotyping of Bifidobacterium animalis subsp. lactis by Using Single-Nucleotide Polymorphisms, Insertions, and Deletions , 2009, Applied and Environmental Microbiology.

[17]  Hans-Peter Klenk,et al.  Digital DNA-DNA hybridization for microbial species delineation by means of genome-to-genome sequence comparison , 2010, Standards in genomic sciences.

[18]  M R Adams,et al.  Determination of survival, identity and stress resistance of probiotic bifidobacteria in bio‐yoghurts , 2006, Letters in Applied Microbiology.

[19]  Glenn R. Gibson,et al.  The International Scientific Association for Probiotics and Prebiotics consensus statement on the scope and appropriate use of the term probiotic , 2014 .

[20]  C. Allen,et al.  Genomic and proteomic evidence supporting the division of the plant pathogen Ralstonia solanacearum into three species , 2016, BMC Genomics.

[21]  R. Roberts,et al.  Development of a rapid SNP-typing assay to differentiate Bifidobacterium animalis ssp. lactis strains used in probiotic-supplemented dairy products. , 2015, Journal of dairy science.

[22]  N. Singhal,et al.  MALDI-TOF mass spectrometry: an emerging technology for microbial identification and diagnosis , 2015, Front. Microbiol..