论文信息 - MS2DB: A Mass-Based Hashing Algorithm for the Identification of Disulfide Linkage Patterns in Protein Utilizing Mass Spectrometric Data

MS2DB: A Mass-Based Hashing Algorithm for the Identification of Disulfide Linkage Patterns in Protein Utilizing Mass Spectrometric Data

The tertiary structure and biological function of a protein can be better understood given knowledge of the number and location of its disulfide bonds. By utilizing mass spectrometric (MS) experimental procedures that produce spectra of the protein's peptides joined by a disulfide bond, we can make initial identifications of these bonded cysteine pairings. The algorithmic problem then becomes how to match a theoretical mass space of all possible bonded peptides against the MS data. Our solution, MSHashID, utilizes the expected amino acid mass in combination with a hash structure to improve the time complexity of making an identification from worse than O(n2) to approximately O(n), where n is the size of the mass space. We have developed a software package, MS2DB, which includes an implementation of this algorithm. Experiments using published data show that the MSHashID algorithm efficiently makes the correct initial identifications, which can then be confirmed using tandem mass spectrometry (MS/MS).

Rahul Singh | Timothy Lee | Ten-Yang Yen | Bruce Macher

[1] Ten-Yang Yen,et al. Determination of glycosylation sites and disulfide bond structures using LC/ESI-MS/MS analysis. , 2006, Methods in enzymology.

[2] R M Knegtel,et al. Neighboring cysteine residues in human fucosyltransferase VII are engaged in disulfide bridges, forming small loop structures. , 2001, Glycobiology.

[3] R C Beavis,et al. Implementation of an algorithm for modeling disulfide bond patterns using mass spectrometry. , 2003, Journal of proteome research.

[4] R. K. Shyamasundar,et al. Introduction to algorithms , 1996 .

[5] Ten-Yang Yen,et al. Highly Conserved Cysteines of Mouse Core 2 β1,6-N-Acetylglucosaminyltransferase I Form a Network of Disulfide Bonds and Include a Thiol That Affects Enzyme Activity* , 2003, Journal of Biological Chemistry.

[6] Paolo Frasconi,et al. Disulfide connectivity prediction using recursive neural networks and evolutionary information , 2004, Bioinform..

[7] D. Smith,et al. Strategies for locating disulfide bonds in proteins. , 1990, Methods in enzymology.

[8] A. El-Battari,et al. Unique Disulfide Bond Structures Found in ST8Sia IV Polysialyltransferase Are Required for Its Activity* , 2001, The Journal of Biological Chemistry.

[9] Rahul Singh,et al. MS2DB: An Algorithmic Approach to Determine Disulfide Linkage Patterns , 2006, 19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06).