Application of multivariate curve resolution to the analysis of yeast genome-wide screens

In this work, the application of Multivariate Curve Resolution to the analysis of yeast genome-wide screens obtained by means of DNA microarray technology is shown. In order to perform the analysis of this type of data, two algorithms based on Alternating Least Squares (MCR-ALS) and on its maximum likelihood weighted projection (MCR-WALS) variant are compared. The utilization of the modified weighted alternating least (WALS) squares algorithm is motivated by the rather poor quality, uncertainties and experimental noise associated to DNA microarray data. Moreover, a large number of missing values are usually present in these data sets and the weighted WALS approach allowed circumventing this problem. Two different experimental datasets were used for this comparison. In the first dataset, gene expression values in budding yeast were monitored in-response to glucose limitation. In the second dataset, the changes in the gene expression caused by the daunorubicin drug were monitored as a function of time. Results obtained by application of Multivariate Curve Resolution in the two cases allowed a good recovery of the evolving gene expression profiles and the identification of metabolic pathways and individual genes involved in these gene expression changes.

[1]  R. Tauler Multivariate curve resolution applied to second order data , 1995 .

[2]  Raj Chari,et al.  Recent advances in array comparative genomic hybridization technologies and their applications in human genetics , 2006, European Journal of Human Genetics.

[3]  P. Wentzell,et al.  DNA microarrays: is there a role for analytical chemistry? , 2005, The Analyst.

[4]  Peter D. Wentzell,et al.  Comparison of the results obtained by four receptor modelling methods in aerosol source apportionment studies , 2009 .

[5]  I. Androulakis,et al.  Analysis of time-series gene expression data: methods, challenges, and opportunities. , 2007, Annual review of biomedical engineering.

[6]  Erik Johansson,et al.  Using chemometrics for navigating in the large data sets of genomics, proteomics, and metabonomics (gpm) , 2004, Analytical and bioanalytical chemistry.

[7]  Kara Dolinski,et al.  Homeostatic adjustment and metabolic remodeling in glucose-limited yeast cultures. , 2005, Molecular biology of the cell.

[8]  John Quackenbush,et al.  Computational genetics: Computational analysis of microarray data , 2001, Nature Reviews Genetics.

[9]  Ka Yee Yeung,et al.  Principal component analysis for clustering gene expression data , 2001, Bioinform..

[10]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[11]  J. Tukey,et al.  Variations of Box Plots , 1978 .

[12]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[13]  Romà Tauler,et al.  Multivariate Curve Resolution (MCR) from 2000: Progress in Concepts and Applications , 2006 .

[14]  Ziv Bar-Joseph,et al.  Analyzing time series gene expression data , 2004, Bioinform..

[15]  David J. C. MacKay,et al.  Reproducibility Assessment of Independent Component Analysis of Expression Ratios From DNA Microarrays , 2003, Comparative and functional genomics.

[16]  Gene H. Golub,et al.  Singular value decomposition and least squares solutions , 1970, Milestones in Matrix Computation.

[17]  B. Piña,et al.  Selective inhibition of yeast regulons by daunorubicin: A transcriptome-wide analysis , 2008, BMC Genomics.

[18]  Romà Tauler,et al.  Assessment of new constraints applied to the alternating least squares method , 1997 .

[19]  Peter D Wentzell,et al.  Bootstrap method for the estimation of measurement uncertainty in spotted dual-color DNA microarrays , 2007, Analytical and bioanalytical chemistry.

[20]  Bart J A Mertens,et al.  Microarrays, pattern recognition and exploratory data analysis , 2003, Statistics in medicine.

[21]  Paul Geladi,et al.  Principal Component Analysis , 1987, Comprehensive Chemometrics.

[22]  Raimundo Gargallo,et al.  Application of multivariate resolution methods to the study of biochemical and biophysical processes. , 2004, Analytical biochemistry.

[23]  Margaret Werner-Washburne,et al.  BMC Bioinformatics BioMed Central Methodology article Multivariate curve resolution of time course microarray data , 2006 .

[24]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[25]  A. Brazma,et al.  Gene expression data analysis. , 2001, FEBS letters.

[26]  Romà Tauler,et al.  Exploratory data analysis of DNA microarrays by multivariate curve resolution. , 2006, Analytical biochemistry.

[27]  S. Van Huffel,et al.  On the equivalence between total least squares and maximum likelihood PCA , 2005 .

[28]  P. Brown,et al.  DNA arrays for analysis of gene expression. , 1999, Methods in enzymology.

[29]  W. Windig,et al.  Interactive self-modeling mixture analysis , 1991 .

[30]  B. Kowalski,et al.  Selectivity, local rank, three‐way data analysis and ambiguity in multivariate curve resolution , 1995 .

[31]  John Quackenbush Microarray data normalization and transformation , 2002, Nature Genetics.

[32]  E. Kok,et al.  The application of DNA microarrays in gene expression analysis. , 2000, Journal of biotechnology.

[33]  Romà Tauler,et al.  A graphical user-friendly interface for MCR-ALS: a new tool for multivariate curve resolution in MATLAB , 2005 .

[34]  David M. Haaland,et al.  Multivariate curve resolution for hyperspectral image analysis: applications to microarray technology , 2003, SPIE BiOS.