Utility of accurate mass tags for proteome-wide protein identification.

An enabling capability for proteomics would be the ability to study protein expression on a global scale. While several different separation and analysis options are being investigated to advance the practice of proteomics, mass spectrometry (MS) is rapidly becoming the core instrumental technology used to characterize the large number of proteins that constitute a proteome. To be most effective, proteomic measurements must be high-throughput, ideally allowing thousands of proteins to be identified on a time scale of hours. Most strategies of identification by MS rely on the analysis of enzymatically produced peptides originating from an isolated protein followed by either peptide mapping or tandem MS (MS/MS) to obtain sequence information for a single peptide. In the case of peptide mapping, several peptide masses are needed to unambiguously identify a protein with the typically achieved mass measurement accuracies (MMA). The ability to identify proteins based on the mass of a single peptide (i.e., an accurate mass tag; AMT) is proposed and is largely dependent on the MMA that can be achieved. To determine the MMA necessary to enable the use of AMTs for proteome-wide protein identification, we analyzed the predicted proteins and their tryptic fragments from Saccharomyces cerevisiae and Caenorhabditis elegans. The results show that low ppm (i.e., approximately 1 ppm) level measurements have practical utility for analysis of small proteomes. Additionally, up to 85% of the peptides predicted from these organisms can function as AMTs at sub-ppm MMA levels attainable using Fourier transform ion cyclotron resonance MS. Additional information, such as sequence constraints, should enable even more complex proteomes to be studied at more modest mass measurement accuracies. Once AMTs are established, subsequent high-throughput measurements of proteomes (e.g., after perturbations) will be greatly facilitated.