Integrated analysis of transcript, protein and metabolite data to study lignin biosynthesis in hybrid aspen.

Tree biotechnology will soon reach a mature state where it will influence the overall supply of fiber, energy and wood products. We are now ready to make the transition from identifying candidate genes, controlling important biological processes, to discovering the detailed molecular function of these genes on a broader, more holistic, systems biology level. In this paper, a strategy is outlined for informative data generation and integrated modeling of systematic changes in transcript, protein and metabolite profiles measured from hybrid aspen samples. The aim is to study characteristics of common changes in relation to genotype-specific perturbations affecting the lignin biosynthesis and growth. We show that a considerable part of the systematic effects in the system can be tracked across all platforms and that the approach has a high potential value in functional characterization of candidate genes.

[1]  Matej Oresic,et al.  Integrative biological analysis of the APOE*3-leiden transgenic mouse. , 2004, Omics : a journal of integrative biology.

[2]  M. Hirai,et al.  Functional genomics by integrated analysis of metabolome and transcriptome of Arabidopsis plants over-expressing an MYB transcription factor. , 2005, The Plant journal : for cell and molecular biology.

[3]  Paul Geladi,et al.  Principal Component Analysis , 1987, Comprehensive Chemometrics.

[4]  G. Wingsle,et al.  Purification of a plant mediator from Arabidopsis thaliana identifies PFT1 as the Med25 subunit. , 2007, Molecular cell.

[5]  Eric K. Neumann,et al.  Phenotype Characterisation Using Integrated Gene Transcript, Protein and Metabolite Profiling , 2004, Applied bioinformatics.

[6]  Daniel Eriksson,et al.  MASQOT-GUI: spot quality assessment for the two-channel microarray platform , 2006, Bioinform..

[7]  D. Baunsgaard,et al.  Combination of ‘omics’ data to investigate the mechanism(s) of hydrazine-induced hepatotoxicity in Rats and to identify potential biomarkers , 2004, Biomarkers : biochemical indicators of exposure, response, and susceptibility to chemicals.

[8]  F. Sterky,et al.  MYB transcription factors are differentially expressed and regulated during secondary vascular tissue development in hybrid aspen , 2004, Plant Molecular Biology.

[9]  S. Wold Cross-Validatory Estimation of the Number of Components in Factor and Principal Components Models , 1978 .

[10]  G. Wasteneys Progress in understanding the role of microtubules in plant cells. , 2004, Current opinion in plant biology.

[11]  M. Hirai,et al.  Integration of transcriptomics and metabolomics for understanding of global responses to nutritional stresses in Arabidopsis thaliana. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[12]  F. Loewus,et al.  myo-Inositol metabolism in plants , 2000 .

[13]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[14]  Matej Oresic,et al.  Gene-to-metabolite networks for terpenoid indole alkaloid biosynthesis in Catharanthus roseus cells. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[15]  M. Sjöström,et al.  Design of experiments: an efficient strategy to identify factors influencing extraction and derivatization of Arabidopsis thaliana samples in metabolomic studies with gas chromatography/mass spectrometry. , 2004, Analytical biochemistry.

[16]  Andrew Hayes,et al.  Integrative investigation of metabolic and transcriptomic data , 2006, BMC Bioinformatics.

[17]  S. Gygi,et al.  Correlation between Protein and mRNA Abundance in Yeast , 1999, Molecular and Cellular Biology.

[18]  Jan Karlsson,et al.  The response of the poplar transcriptome to wounding and subsequent infection by a viral pathogen. , 2004, The New phytologist.

[19]  H. Lehrach,et al.  Extraction of proteins from plant tissues for two‐dimensional electrophoresis analysis , 2003, Electrophoresis.

[20]  B. D. Kohorn,et al.  A Cell Wall-associated, Receptor-like Protein Kinase* , 1996, The Journal of Biological Chemistry.

[21]  Joost T. van Dongen,et al.  Combined Transcript and Metabolite Profiling of Arabidopsis Leaves Reveals Fundamental Effects of the Thiol-Disulfide Status on Plant Metabolism1[W][OA] , 2006, Plant Physiology.

[22]  Daniel Eriksson,et al.  MASQOT: a method for cDNA microarray spot quality control , 2005, BMC Bioinformatics.

[23]  Bernard Henrissat,et al.  Biosynthesis of cellulose-enriched tension wood in Populus: global analysis of transcripts and metabolites identifies biochemical and developmental regulators in secondary wall biosynthesis. , 2006, The Plant journal : for cell and molecular biology.

[24]  J. Trygg,et al.  Extraction and GC/MS analysis of the human blood plasma metabolome. , 2005, Analytical chemistry.

[25]  J. S. Hunter,et al.  Statistics for Experimenters: An Introduction to Design, Data Analysis, and Model Building. , 1979 .

[26]  Johan Trygg,et al.  O2‐PLS, a two‐block (X–Y) latent variable regression (LVR) method with an integral OSC filter , 2003 .

[27]  Patrik Rydén,et al.  UPSC-BASE--Populus transcriptomics online. , 2006, The Plant journal : for cell and molecular biology.

[28]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[29]  Daniel Eriksson,et al.  Data integration in plant biology: the O2PLS method for combined modeling of transcript and metabolite data. , 2007, The Plant journal : for cell and molecular biology.

[30]  Johan Trygg,et al.  High-throughput data analysis for detecting and identifying differences between samples in GC/MS-based metabolomic analyses. , 2005, Analytical chemistry.

[31]  B. Palsson,et al.  The model organism as a system: integrating 'omics' data sets , 2006, Nature Reviews Molecular Cell Biology.

[32]  Jan Karlsson,et al.  Modular gene expression in Poplar: a multilayer network approach. , 2009, The New phytologist.

[33]  M. Hirai,et al.  Elucidation of Gene-to-Gene and Metabolite-to-Gene Networks in Arabidopsis by Integration of Metabolomics and Transcriptomics* , 2005, Journal of Biological Chemistry.

[34]  M. Zanor,et al.  Integrated Analysis of Metabolite and Transcript Levels Reveals the Metabolic Shifts That Underlie Tomato Fruit Development and Highlight Regulatory Aspects of Metabolic Network Behavior1[W] , 2006, Plant Physiology.

[35]  J. Shao Linear Model Selection by Cross-validation , 1993 .

[36]  David R Goodlett,et al.  Genetic basis of proteome variation in yeast , 2007, Nature Genetics.

[37]  M. Gribskov,et al.  The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray) , 2006, Science.

[38]  T. Lundstedt,et al.  Experimental design and optimization , 1998 .

[39]  Bernd Schmidt,et al.  OPLS methodology for analysis of pre-processing effects on spectroscopic data , 2006 .

[40]  M. Rantalainen,et al.  Statistically integrated metabonomic-proteomic studies on a human prostate cancer xenograft model in mice. , 2006, Journal of proteome research.

[41]  M. S. Khots,et al.  D-optimal designs , 1995 .

[42]  Bernd Schmidt,et al.  The OPLS methodology for analysis of multi‐block batch process data , 2006 .

[43]  N. Sherman,et al.  Protein Sequencing and Identification Using Tandem Mass Spectrometry: Kinter/Tandem Mass Spectrometry , 2000 .

[44]  Roberto Todeschini,et al.  A new algorithm for optimal, distance based, experimental design , 1992 .

[45]  B. Sundberg,et al.  A Populus EST resource for plant functional genomics. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[46]  Daniel Eriksson,et al.  Orthogonal projections to latent structures as a strategy for microarray data normalization , 2007, BMC Bioinformatics.

[47]  J. Trygg O2‐PLS for qualitative and quantitative analysis in multivariate calibration , 2002 .