Chemometric Multivariate Tools for Candidate Biomarker Identification: LDA, PLS-DA, SIMCA, Ranking-PCA.

2-D gel electrophoresis usually provides complex maps characterized by a low reproducibility: this hampers the use of spot volume data for the identification of reliable biomarkers. Under these circumstances, effective and robust methods for the comparison and classification of 2-D maps are fundamental for the identification of an exhaustive panel of candidate biomarkers. Multivariate methods are the most suitable since they take into consideration the relationships between the variables, i.e., effects of synergy and antagonism between the spots. Here the most common multivariate methods used in spot volume datasets analysis are presented. The methods are applied on a sample dataset to prove their effectiveness.

[1]  Emilio Marengo,et al.  Study of proteomic changes associated with healthy and tumoral murine samples in neuroblastoma by principal component analysis and classification methods. , 2004, Clinica chimica acta; international journal of clinical chemistry.

[2]  David H. Burns,et al.  Parsimonious calibration models for near-infrared spectroscopy using wavelets and scaling functions , 2006 .

[3]  Silvia Lanteri,et al.  Classification models: Discriminant analysis, SIMCA, CART , 1989 .

[4]  Emilio Marengo,et al.  Evaluation of the variables characterized by significant discriminating power in the application of SIMCA classification method to proteomic studies. , 2008, Journal of proteome research.

[5]  B. Kowalski,et al.  The parsimony principle applied to multivariate calibration , 1993 .

[6]  Lisa M Bellini,et al.  William of Occam and Occam's razor. , 2002, Annals of internal medicine.

[7]  D. Massart Chemometrics: A Textbook , 1988 .

[8]  Emilio Marengo,et al.  Development of a classification and ranking method for the identification of possible biomarkers in two-dimensional gel-electrophoresis based on principal component analysis and variable selection procedures. , 2011, Molecular bioSystems.

[9]  Emilio Marengo,et al.  The principle of exhaustiveness versus the principle of parsimony: a new approach for the identification of biomarkers from proteomic spot volume datasets based on principal component analysis , 2010, Analytical and bioanalytical chemistry.

[10]  Emilio Marengo,et al.  Multivariate statistical tools applied to the characterization of the proteomic profiles of two human lymphoma cell lines by two‐dimensional gel electrophoresis , 2006, Electrophoresis.

[11]  Johanna Smeyers-Verbeke,et al.  Handbook of Chemometrics and Qualimetrics: Part A , 1997 .

[12]  Bruce R. Kowalski,et al.  Calibration method choice by comparison of model basis functions to the theoretical instrumental response function , 1997 .

[13]  Stefania Balzan,et al.  Proteomic changes involved in tenderization of bovine Longissimus dorsi muscle during prolonged ageing. , 2012, Food chemistry.