Reducing Mass Degeneracy in SAR by MS by Stable Isotopic Labeling

Mass spectrometry (MS) promises to be an invaluable tool for functional genomics, by supporting low-cost, high-throughput experiments. However, large-scale MS faces the potential problem of mass degeneracy--indistinguishable masses for multiple biopolymer fragments (e.g. from a limited proteolytic digest). This paper studies the tasks of planning and interpreting MS experiments that use selective isotopic labeling, thereby substantially reducing potential mass degeneracy. Our algorithms support an experimental-computational protocol called Structure-Activity Relation by Mass Spectrometry (SAR by MS), for elucidating the function of protein-DNA and protein-protein complexes. SAR by MS enzymatically cleaves a crosslinked complex and analyzes the resulting mass spectrum for mass peaks of hypothesized fragments. Depending on binding mode, some cleavage sites will be shielded; the absence of anticipated peaks implicates corresponding fragments as either part of the interaction region or inaccessible due to conformational change upon binding. Thus different mass spectra provide evidence for different structure-activity relations. We address combinatorial and algorithmic questions in the areas of data analysis (constraining binding mode based on mass signature) and experiment planning (determining an isotopic labeling strategy to reduce mass degeneracy and aid data analysis). We explore the computational complexity of these problems, obtaining upper and lower bounds. We report experimental results from implementations of our algorithms.

[1]  Jon Louis Bentley,et al.  Multidimensional divide-and-conquer , 1980, CACM.

[2]  László Lovász,et al.  Approximating clique is almost NP-complete , 1991, [1991] Proceedings 32nd Annual Symposium of Foundations of Computer Science.

[3]  Edoardo Amaldi,et al.  The Complexity and Approximability of Finding Maximum Feasible Subsystems of Linear Relations , 1995, Theor. Comput. Sci..

[4]  Steven L. Cohen,et al.  Probing the solution structure of the DNA‐binding protein Max by a combination of proteolysis and mass spectrometry , 1995, Protein science : a publication of the Protein Society.

[5]  A G Marshall,et al.  High-resolution multistage MS, MS2, and MS3 matrix-assisted laser desorption/ionization FT-ICR mass spectra of peptides from a single laser shot. , 1996, Analytical chemistry.

[6]  Gerhard Klebe,et al.  What Can We Learn from Molecular Recognition in Protein–Ligand Complexes for the Design of New Drugs? , 1996 .

[7]  Anna Tramontano,et al.  Probing the tertiary structure of proteins by limited proteolysis and mass spectrometry: The case of minibody , 1996, Protein science : a publication of the Protein Society.

[8]  Jacques Stern,et al.  The Hardness of Approximate Optima in Lattices, Codes, and Systems of Linear Equations , 1997, J. Comput. Syst. Sci..

[9]  J. Loo,et al.  Studying noncovalent protein complexes by electrospray ionization mass spectrometry. , 1997, Mass spectrometry reviews.

[10]  Alan G. Marshall,et al.  Protein Molecular Mass to 1 Da by 13C, 15N Double-Depletion and FT-ICR Mass Spectrometry , 1997 .

[11]  M. Sternberg,et al.  Modelling protein docking using shape complementarity, electrostatics and biochemical information. , 1997, Journal of molecular biology.

[12]  Y. Cao,et al.  Photoaffinity labeling analysis of the interaction of pituitary adenylate-cyclase-activating polypeptide (PACAP) with the PACAP type I receptor. , 1997, European journal of biochemistry.

[13]  Sanjeev Arora,et al.  Probabilistic checking of proofs: a new characterization of NP , 1998, JACM.

[14]  J M Thornton,et al.  Assessment of conformational parameters as predictors of limited proteolytic sites in native protein structures. , 1998, Protein engineering.

[15]  A. Scaloni,et al.  Topology of the calmodulin-melittin complex. , 1998, Journal of molecular biology.

[16]  Edoardo Amaldi,et al.  On the Approximability of Minimizing Nonzero Variables or Unsatisfied Relations in Linear Systems , 1998, Theor. Comput. Sci..

[17]  Giorgio Gambosi,et al.  Complexity and approximation: combinatorial optimization problems and their approximability properties , 1999 .

[18]  L. M. Smith,et al.  Controlling charge states of large ions. , 1999, Science.

[19]  J. Yates,et al.  Identification of proteins in complexes by solid-phase microextraction/multistep elution/capillary electrophoresis/tandem mass spectrometry. , 1999, Analytical chemistry.

[20]  J. Yates,et al.  Direct analysis of protein complexes using mass spectrometry , 1999, Nature Biotechnology.

[21]  L. M. Smith,et al.  Stable-isotope-assisted MALDI-TOF mass spectrometry for accurate determination of nucleotide compositions of PCR products. , 1999, Analytical chemistry.

[22]  T. Craig,et al.  Analysis of transcription complexes and effects of ligands by microelectrospray ionization mass spectrometry , 1999, Nature Biotechnology.

[23]  M. Bantscheff,et al.  Identification of linker regions and domain borders of the transcription activator protein NtrC from Escherichia coli by limited proteolysis, in-gel digestion, and mass spectrometry. , 1999, Biochemistry.

[24]  H. Wolfson,et al.  Examination of shape complementarity in docking of Unbound proteins , 1999, Proteins.

[25]  Hideo Takahashi,et al.  A novel NMR method for determining the interfaces of large protein–protein complexes , 2000, Nature Structural Biology.

[26]  Chris Bailey-Kellogg,et al.  The NOESY jigsaw: automated protein secondary structure and main-chain assignment from sparse, unassigned NMR data , 2000, RECOMB '00.

[27]  Malin M. Young,et al.  High throughput protein fold identification by using experimental constraints derived from intramolecular cross-links and mass spectrometry , 2000, Proc. Natl. Acad. Sci. USA.

[28]  E. Bradbury,et al.  Site-specific mass tagging with stable isotopes in proteins for accurate and efficient protein identification. , 2000, Analytical chemistry.