Spectra library assisted de novo peptide sequencing for HCD and ETD spectra pairs

BackgroundDe novo peptide sequencing via tandem mass spectrometry (MS/MS) has been developed rapidly in recent years. With the use of spectra pairs from the same peptide under different fragmentation modes, performance of de novo sequencing is greatly improved. Currently, with large amount of spectra sequenced everyday, spectra libraries containing tens of thousands of annotated experimental MS/MS spectra become available. These libraries provide information of the spectra properties, thus have the potential to be used with de novo sequencing to improve its performance.ResultsIn this study, an improved de novo sequencing method assisted with spectra library is proposed. It uses spectra libraries as training datasets and introduces significant scores of the features used in our previous de novo sequencing method for HCD and ETD spectra pairs. Two pairs of HCD and ETD spectral datasets were used to test the performance of the proposed method and our previous method. The results show that this proposed method achieves better sequencing accuracy with higher ranked correct sequences and less computational time.ConclusionsThis paper proposed an advanced de novo sequencing method for HCD and ETD spectra pair and used information from spectra libraries and significant improved previous similar methods.

[1]  F. McLafferty,et al.  Automated de novo sequencing of proteins by tandem high-resolution mass spectrometry. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Oliver Kohlbacher,et al.  De novo peptide sequencing by tandem MS using complementary CID and electron transfer dissociation , 2009, Electrophoresis.

[3]  Fang-Xiang Wu,et al.  Recent Developments in Computational Methods for De Novo Peptide Sequencing from Tandem Mass Spectrometry (MS/MS). , 2015, Protein and peptide letters.

[4]  Ming Li,et al.  PEAKS: powerful software for peptide de novo sequencing by tandem mass spectrometry. , 2003, Rapid communications in mass spectrometry : RCM.

[5]  A. L. Burlingame,et al.  Statistical analysis of Peptide electron transfer dissociation fragmentation mass spectrometry. , 2010, Analytical chemistry.

[6]  Bin Ma,et al.  Adepts: Advanced peptide de novo Sequencing with a Pair of Tandem Mass Spectra , 2010, J. Bioinform. Comput. Biol..

[7]  B. Ma Novor: Real-Time Peptide de Novo Sequencing Software , 2015, Journal of The American Society for Mass Spectrometry.

[8]  Pavel A. Pevzner,et al.  De Novo Peptide Sequencing via Tandem Mass Spectrometry , 1999, J. Comput. Biol..

[9]  Nagiza F. Samatova,et al.  A high-throughput de novo sequencing approach for shotgun proteomics using high-resolution tandem mass spectrometry , 2010, BMC Bioinformatics.

[10]  Fang-Xiang Wu,et al.  NovoHCD: De novo Peptide Sequencing From HCD Spectra , 2014, IEEE Transactions on NanoBioscience.

[11]  P. Pevzner,et al.  PepNovo: de novo peptide sequencing via probabilistic network modeling. , 2005, Analytical chemistry.

[12]  Fang-Xiang Wu,et al.  A Framework of De Novo Peptide Sequencing for Multiple Tandem Mass Spectra , 2015, IEEE Transactions on NanoBioscience.

[13]  Ting Chen,et al.  Algorithms for de novo peptide sequencing using tandem mass spectrometry , 2004 .

[14]  K. Clauser,et al.  Sequencing-grade de novo analysis of MS/MS triplets (CID/HCD/ETD) from overlapping peptides. , 2013, Journal of proteome research.

[15]  S. Mohammed,et al.  Improved peptide identification by targeted fragmentation using CID, HCD and ETD on an LTQ-Orbitrap Velos. , 2011, Journal of proteome research.

[16]  Fanglin Chen,et al.  Gender Identification of Human Brain Image with A Novel 3D Descriptor , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[17]  Guilong Cheng,et al.  Mass spectrometry of peptides and proteins. , 2005, Methods.

[18]  Fang-Xiang Wu,et al.  NovoExD: De novo Peptide Sequencing for ETD/ECD Spectra , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[19]  M. Mann,et al.  A large synthetic peptide and phosphopeptide reference library for mass spectrometry–based proteomics , 2013, Nature Biotechnology.

[20]  M. Savitski,et al.  Proteomics-grade de novo sequencing approach. , 2005, Journal of proteome research.

[21]  Frank Kjeldsen,et al.  Analytical utility of small neutral losses from reduced species in electron capture dissociation studied using SwedECD database. , 2008, Analytical chemistry.

[22]  Ruedi Aebersold,et al.  Building and searching tandem mass (MS/MS) spectral libraries for peptide identification in proteomics. , 2011, Methods.

[23]  David L Tabb,et al.  DirecTag: accurate sequence tags from peptide MS/MS through statistical scoring. , 2008, Journal of proteome research.