Application of Multivariate Data Analysis for the Classification of Two Dimensional Gel Images in Neuroproteomics

Two-dimensional gel electrophoresis (2DE) still plays a key role in proteomics for exploring the protein content of complex biological mixtures. However, the development of fully automatic strategies in extracting interpretable information from gel images is still a challenging task. In this work, we present a computational strategy aiming at an automatic classification of the discriminant patterns emerging from separation images intended as fingerprints of the correspondent biological conditions. The method was applied to gel images acquired in a study on motor neuron diseases: 33 2DE maps generated from samples of cerebrospinal fluid were processed (26 pathologic and 7 control subjects). Quantitative image descriptors were extracted and fitted to a partial least squares-discriminant analysis (PLSDA) assessing the chance to classify the samples. Moreover, the model was able to identify gel areas that most differ through the clinical categories. Combining multivariate statistical techniques with 2DE may represent a valid tool to extract informative protein patterns. This kind of approach can contribute to the development of a system of screening to discriminate different clinical conditions on the basis of the overall patterns emerging from the maps, representing a useful complementary analysis in the routine of a proteomic laboratory.

[1]  Guang-Zhong Yang,et al.  The role of bioinformatics in two‐dimensional gel electrophoresis , 2003, Proteomics.

[2]  Stefano Cappa,et al.  Differential expression of ceruloplasmin isoforms in the cerebrospinal fluid of amyotrophic lateral sclerosis patients , 2008, Proteomics. Clinical applications.

[3]  T. Rabilloud Two‐dimensional gel electrophoresis in proteomics: Old, old fashioned, but it still climbs up the mountains , 2002, Proteomics.

[4]  Tero Aittokallio,et al.  Comparison of PDQuest and Progenesis software packages in the analysis of two‐dimensional electrophoresis gels , 2003, Proteomics.

[5]  K. Blennow,et al.  Proteomics for drug target discovery , 2004 .

[6]  M. Tenenhaus,et al.  Prediction of clinical outcome with microarray data: a partial least squares discriminant analysis (PLS-DA) approach , 2003, Human Genetics.

[7]  Francesca Antonucci,et al.  Numerical approaches for quantitative analysis of two‐dimensional maps: A review of commercial software and home‐made systems , 2005, Proteomics.

[8]  S. Wold,et al.  PLS-regression: a basic tool of chemometrics , 2001 .

[9]  J Gottfries,et al.  Diagnosis of dementias using partial least squares discriminant analysis. , 1995, Dementia.

[10]  A. Ludolph,et al.  Amyotrophic lateral sclerosis. , 2012, Current opinion in neurology.

[11]  M. Barker,et al.  Partial least squares for discrimination , 2003 .

[12]  Peter Mose Larsen,et al.  2D or not 2D , 2001 .

[13]  Danh V. Nguyen,et al.  Multi-class cancer classification via partial least squares with gene expression profiles , 2002, Bioinform..

[14]  S. Fey,et al.  2D or not 2D. Two-dimensional gel electrophoresis. , 2001, Current opinion in chemical biology.

[15]  D. Speicher,et al.  Two‐Dimensional Gel Electrophoresis , 1999, Current protocols in cell biology.

[16]  Bo Jørgensen,et al.  Extractin information from two‐dimensional electrophoresis gels by partial least squares regression , 2002 .

[17]  D. Baunsgaard,et al.  Mechanisms of hydrazine toxicity in rat liver investigated by proteomics and multivariate data analysis , 2004, Proteomics.

[18]  Paola Gramatica,et al.  Methods for reliability and uncertainty assessment and for applicability evaluations of classification- and regression-based QSARs. , 2003, Environmental health perspectives.

[19]  Erik Johansson,et al.  Detection of ovarian cancer using chemometric analysis of proteomic profiles , 2006 .

[20]  Flemming Jessen,et al.  Combination of statistical approaches for analysis of 2-DE data gives complementary results. , 2008, Journal of proteome research.

[21]  Anne-Laure Boulesteix,et al.  Partial least squares: a versatile tool for the analysis of high-dimensional genomic data , 2006, Briefings Bioinform..

[22]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Flemming Jessen,et al.  Extracting information from two-dimensional electrophoresis gels by partial least squares regression. , 2002, Proteomics.

[24]  Danh V. Nguyen,et al.  Tumor classification by partial least squares using microarray gene expression data , 2002, Bioinform..

[25]  Xiwu Lin,et al.  Megavariate data analysis of mass spectrometric proteomics data using latent variable projection method , 2003, Proteomics.

[26]  Margaret A. Nemeth,et al.  Multi- and Megavariate Data Analysis , 2003, Technometrics.

[27]  N. Karp,et al.  Application of partial least squares discriminant analysis to two‐dimensional difference gel studies in expression proteomics , 2005, Proteomics.

[28]  Sergio Cerutti,et al.  An Integrated Strategy in Two-Dimensional Electrophoresis Analysis Able to Identify Discriminants Between Different Clinical Conditions , 2008, Experimental biology and medicine.

[29]  S. Bijlsma,et al.  In search of secreted protein biomarkers for the anti-inflammatory effect of beta2-adrenergic receptor agonists: application of DIGE technology in combination with multivariate and univariate data analysis tools. , 2005, Journal of proteome research.