Predictions of peptides' retention times in reversed‐phase liquid chromatography as a new supportive tool to improve protein identification in proteomics

One of the initial steps of proteomic analysis is peptide separation. However, little information from RP‐HPLC, employed for peptides separation, is utilized in proteomics. Meanwhile, prediction of the retention time for a given peptide, combined with routine MS/MS data analysis, could help to improve the confidence of peptide identifications. Recently, a number of models has been proposed to characterize quantitatively the structure of a peptide and to predict its gradient RP‐HPLC retention at given separation conditions. The chromatographic behavior of peptides has usually been related to their amino acid composition. However, different values of retention coefficients of the same amino acid in different peptides at different neighborhoods were commonly observed. Therefore, specific retention coefficients were derived by regression analysis or by artificial neural networks (ANNs) with the use of a set of peptides retention. In the review, various approaches for peptide elution time prediction in RP‐HPLC are presented and critically discussed. The contribution of sequence dependent parameters (e.g., amphipathicity or peptide sequence) and peptide physicochemical descriptors (e.g., hydrophobicity or peptide length) that have been shown to affect the peptide retention time in LC are considered and analyzed. The predictive capability of the retention time prediction models based on quantitative structure–retention relationships (QSRRs) are discussed in details. Advantages and limitations of various retention prediction strategies are identified. It is concluded that proper processing of chromatographic data by statistical learning techniques can result in information of direct use for proteomics, which is otherwise wasted.

[1]  Knut Reinert,et al.  OpenMS – An open-source software framework for mass spectrometry , 2008, BMC Bioinformatics.

[2]  R. Taft,et al.  Study of retention processes in reversed-phase high-performance liquid chromatography by the use of the solvatochromic comparison method. , 1985, Analytical chemistry.

[3]  P. Jurs,et al.  Prediction of gas and liquid chromatographic retention indexes of polyhalogenated biphenyls , 1990 .

[4]  Masaru Tomita,et al.  Prediction of liquid chromatographic retention times of peptides generated by protease digestion of the Escherichia coli proteome using artificial neural networks. , 2006, Journal of proteome research.

[5]  R. Kaliszan Chapter 11 Recent advances in quantitative structure-retention relationships (QSRR) , 2000 .

[6]  C. Hansch,et al.  p-σ-π Analysis. A Method for the Correlation of Biological Activity and Chemical Structure , 1964 .

[7]  J. Meek,et al.  Factors affecting retention and resolution of peptides in high-performance liquid chromatography , 1981 .

[8]  Magnus Palmblad,et al.  Prediction of chromatographic retention and protein identification in liquid chromatography/mass spectrometry. , 2002, Analytical chemistry.

[9]  Brian Tripet,et al.  Requirements for prediction of peptide retention time in reversed-phase high-performance liquid chromatography: hydrophilicity/hydrophobicity of side-chains at the N- and C-termini of peptides are dramatically affected by the end-groups and location. , 2007, Journal of chromatography. A.

[10]  D. Martire,et al.  Unified molecular theory of chromatography and its application to supercritical fluid mobile phases. 1. Fluid-liquid (absorption) chromatography , 1987 .

[11]  A. Görg,et al.  The current state of two‐dimensional electrophoresis with immobilized pH gradients , 2000, Electrophoresis.

[12]  John P Cortens,et al.  Use of peptide retention time prediction for protein identification by off-line reversed-phase HPLC-MALDI MS/MS. , 2006, Analytical chemistry.

[13]  R. Kearney,et al.  Quantification of uncertainty of peptide retention time predictions from a sequence-based model in LC-MS/MS proteomics experiments , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[14]  P. Martín-Alvarez,et al.  Comparative prediction of the retention behaviour of small peptides in several reversed-phase high-performance liquid chromatography columns by using partial least squares and multiple linear regression , 1996 .

[15]  J. Yates,et al.  Large-scale analysis of the yeast proteome by multidimensional protein identification technology , 2001, Nature Biotechnology.

[16]  Tomasz Ba¸czek,et al.  Influence of Acetyl and Amide Groups on Peptides RP-LC Retention Behavior , 2008 .

[17]  R. Anderegg,et al.  Two-dimensional SEC/RPLC coupled to mass spectrometry for the analysis of peptides. , 1997, Analytical chemistry.

[18]  T. Rabilloud Two‐dimensional gel electrophoresis in proteomics: Old, old fashioned, but it still climbs up the mountains , 2002, Proteomics.

[19]  R. Kaliszan Quantitative structure-retention relationships , 1992 .

[20]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[21]  Ronald J. Moore,et al.  Phosphopeptide elution times in reversed-phase liquid chromatography. , 2007, Journal of chromatography. A.

[22]  C. Mant,et al.  Context-dependent effects on the hydrophilicity/hydrophobicity of side-chains during reversed-phase high-performance liquid chromatography: Implications for prediction of peptide retention behaviour. , 2006, Journal of chromatography. A.

[23]  A. Katritzky,et al.  QSPR correlation and predictions of GC retention indexes for methyl-branched hydrocarbons produced by insects. , 2000, Analytical chemistry.

[24]  Sarka Beranova-Giorgianni,et al.  Proteome analysis by two-dimensional gel electrophoresis and mass spectrometry: strengths and limitations , 2003 .

[25]  R. Kaliszan,et al.  Comparative evaluation of high-performance liquid chromatography stationary phases used for the separation of peptides in terms of quantitative structure-retention relationships. , 2007, Journal of chromatography. A.

[26]  Yvan Vander Heyden,et al.  The evaluation of two‐step multivariate adaptive regression splines for chromatographic retention prediction of peptides , 2007, Proteomics.

[27]  Ruedi Aebersold,et al.  Proteome analysis of low-abundance proteins using multidimensional chromatography and isotope-coded affinity tags. , 2002, Journal of proteome research.

[28]  T. Bączek Chemometric evaluation of relationships between retention and physicochemical parameters in terms of multidimensional liquid chromatography of peptides. , 2006, Journal of separation science.

[29]  Zbigniew Grzonka,et al.  Prediction of high‐performance liquid chromatography retention of peptides with the use of quantitative structure‐retention relationships , 2005, Proteomics.

[30]  Knut Reinert,et al.  TOPP - the OpenMS proteomics pipeline , 2007, Bioinform..

[31]  Joshua E. Elias,et al.  Evaluation of multidimensional chromatography coupled with tandem mass spectrometry (LC/LC-MS/MS) for large-scale protein analysis: the yeast proteome. , 2003, Journal of proteome research.

[32]  R. Taft,et al.  Study of temperature and mobile-phase effects in reversed-phase high-performance liquid chromatography by the use of the solvatochromic comparison method. , 1986, Analytical chemistry.

[33]  C. Mant,et al.  Correlation of protein retention times in reversed-phase chromatography with polypeptide chain length and hydrophobicity. , 1989, Journal of chromatography.

[34]  Magnus Palmblad,et al.  Protein identification by liquid chromatography-mass spectrometry using retention time prediction. , 2004, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[35]  R. Anderegg,et al.  Comprehensive on-line LC/LC/MS of proteins. , 1997, Analytical chemistry.

[36]  T. Hancock,et al.  A performance comparison of modern statistical techniques for molecular descriptor selection and retention prediction in chromatographic QSRR studies , 2005 .

[37]  S Solomon,et al.  The isolation of peptides by high-performance liquid chromatography using predicted elution positions. , 1982, Analytical biochemistry.

[38]  Richard D. Smith,et al.  Application of peptide LC retention time information in a discriminant function for peptide identification by tandem mass spectrometry. , 2004, Journal of proteome research.

[39]  R. Bischoff,et al.  An automated on-line multidimensional HPLC system for protein and peptide mapping with integrated sample preparation. , 2002, Analytical chemistry.

[40]  O. Krokhin,et al.  Sequence-specific retention calculator. Algorithm for peptide retention prediction in ion-pair RP-HPLC: application to 300- and 100-A pore size C18 sorbents. , 2006, Analytical chemistry.

[41]  C. Mant,et al.  Prediction of peptide retention times in reversed-phase high-performance liquid chromatography II. Correlation of observed and predicted peptide retention times factors and influencing the retention times of peptides , 1986 .

[42]  W. Ens,et al.  Sequence-specific retention calculator. A family of peptide retention time prediction algorithms in reversed-phase HPLC: applicability to various chromatographic conditions and columns. , 2007, Analytical chemistry.

[43]  J. Meek Prediction of peptide retention times in high-pressure liquid chromatography on the basis of amino acid composition. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Oliver Kohlbacher,et al.  Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics , 2007, BMC Bioinformatics.

[45]  Ying Xu,et al.  Improved peptide elution time prediction for reversed-phase liquid chromatography-MS by incorporating peptide sequence information. , 2006, Analytical chemistry.

[46]  Mikhail V Gorshkov,et al.  Liquid chromatography at critical conditions: comprehensive approach to sequence-dependent retention time prediction. , 2006, Analytical chemistry.

[47]  R A Houghten,et al.  Effect of positional environmental domains on the variation of high-performance liquid chromatographic peptide retention coefficients. , 1987, Journal of chromatography.

[48]  Xiang Zhang,et al.  Neural network prediction of peptide separation in strong anion exchange chromatography , 2007, Bioinform..

[49]  J. Yates,et al.  Direct analysis of protein complexes using mass spectrometry , 1999, Nature Biotechnology.

[50]  C. Hansch,et al.  SUBSTITUENT CONSTANTS FOR ALIPHATIC FUNCTIONS OBTAINED FROM PARTITION COEFFICIENTS. , 1965, Journal of medicinal chemistry.

[51]  Michael J MacCoss,et al.  Improving tandem mass spectrum identification using peptide retention time prediction across diverse chromatography conditions. , 2007, Analytical chemistry.

[52]  Yvan Vander Heyden,et al.  Prediction of peptide retention at different HPLC conditions from multiple linear regression models. , 2005, Journal of proteome research.

[53]  M Daszykowski,et al.  Retention prediction of peptides based on uninformative variable elimination by partial least squares. , 2006, Journal of proteome research.

[54]  IDENTIFICATION OF PEPTIDES IN PROTEOMICS SUPPORTED BY PREDICTION OF PEPTIDE RETENTION BY MEANS OF QUANTITATIVE STRUCTURE-RETENTION RELATIONSHIPS , 2007 .

[55]  Alan R. Katritzky,et al.  A New Efficient Approach for Variable Selection Based on Multiregression: Prediction of Gas Chromatographic Retention Times and Response Factors , 1999, J. Chem. Inf. Comput. Sci..

[57]  C. Mant,et al.  Effect of preferred binding domains on peptide retention behavior in reversed-phase chromatography: amphipathic alpha-helices. , 1990, Peptide research.

[58]  R. Beavis,et al.  An Improved Model for Prediction of Retention Times of Tryptic Peptides in Ion Pair Reversed-phase HPLC , 2004, Molecular & Cellular Proteomics.

[59]  Roman Kaliszan,et al.  Quantitative structure-chromatographic retention relationships , 1987 .

[60]  Zhide Hu,et al.  Prediction of retention times of peptides in RPLC by using radial basis function neural networks and projection pursuit regression , 2008 .

[61]  G. Opiteck,et al.  Comprehensive on-line RPLC-CZE-MS of peptides , 1997 .

[62]  S. Patterson,et al.  Automated LC-LC-MS-MS platform using binary ion-exchange and gradient reversed-phase chromatography for improved proteomic analyses. , 2001, Journal of Chromatography B: Biomedical Sciences and Applications.

[63]  C. Mant,et al.  Prediction of peptide retention times in reversed-phase high-performance liquid chromatography I. Determination of retention coefficients of amino acid residues of model synthetic peptides , 1986 .

[64]  Toshihide Nishimura,et al.  Protein identification from product ion spectra of peptides validated by correlation between measured and predicted elution times in liquid chromatography/mass spectrometry , 2005, Proteomics.

[65]  Gordon A Anderson,et al.  Use of artificial neural networks for the accurate prediction of peptide liquid chromatography elution times in proteome analyses. , 2003, Analytical chemistry.