Novel Multivariate Methods for Integration of Genomics and Proteomics Data: Applications in a Kidney Transplant Rejection Study

Abstract Multi-omics research is a key ingredient of data-intensive life sciences research, permitting measurement of biological molecules at different functional levels in the same individual. For a complete picture at the biological systems level, appropriate statistical techniques must however be developed to integrate different ‘omics’ data sets (e.g., genomics and proteomics). We report here multivariate projection-based analyses approaches to genomics and proteomics data sets, using the case study of and applications to observations in kidney transplant patients who experienced an acute rejection event (n=20) versus non-rejecting controls (n=20). In this data sets, we show how these novel methodologies might serve as promising tools for dimension reduction and selection of relevant features for different analytical frameworks. Unsupervised analyses highlighted the importance of post transplant time-of-rejection, while supervised analyses identified gene and protein signatures that together predicted...

[1]  Philippe Morel,et al.  Differential effects of preexisting uremia and a synchronous kidney graft on pancreas allograft functional survival in rats. , 1992, Transplantation.

[2]  Ignacio González,et al.  Visualising associations between paired ‘omics’ data sets , 2012, BioData Mining.

[3]  Ramnik J. Xavier,et al.  Gene enrichment profiles reveal T-cell development, differentiation, and lineage-specific transcription factors including ZBTB25 as a novel NF-AT repressor. , 2010, Blood.

[4]  R. Polikar,et al.  Ensemble based systems in decision making , 2006, IEEE Circuits and Systems Magazine.

[5]  D. Tritchler,et al.  Sparse Canonical Correlation Analysis with Application to Genomic Data Integration , 2009, Statistical applications in genetics and molecular biology.

[6]  Longitudinal Analysis of Whole Blood Transcriptomes to Explore Molecular Signatures Associated with Acute Renal Allograft Rejection , 2014 .

[7]  David Gomez-Cabrero,et al.  Data integration in the era of omics: current and future challenges , 2014, BMC Systems Biology.

[8]  Terence P. Speed,et al.  Quality Assessment of Affymetrix GeneChip Data , 2005 .

[9]  Kim-Anh Lê Cao,et al.  Independent Principal Component Analysis for biologically meaningful dimension reduction of large biological data sets , 2012, BMC Bioinformatics.

[10]  T. Speed,et al.  GOstat: find statistically overrepresented Gene Ontologies within a group of genes. , 2004, Bioinformatics.

[11]  Mark J. Miller,et al.  Emergency granulopoiesis promotes neutrophil-dendritic cell encounters that prevent mouse lung allograft acceptance. , 2011, Blood.

[12]  B. Starnes,et al.  Antiinflammatory effects of soluble complement receptor type 1 promote rapid recovery of ischemia/reperfusion injury in rat small intestine. , 1999, Clinical immunology.

[13]  S. Keshavjee,et al.  Ischemia-reperfusion-induced lung injury. , 2003, American journal of respiratory and critical care medicine.

[14]  Y. Yuzawa,et al.  Aspects of immune dysfunction in end-stage renal disease. , 2008, Clinical journal of the American Society of Nephrology : CJASN.

[15]  Ignacio González,et al.  integrOmics: an R package to unravel relationships between two omics datasets , 2009, Bioinform..

[16]  Christoph H. Borchers,et al.  Proteomic Signatures in Plasma during Early Acute Renal Allograft Rejection* , 2010, Molecular & Cellular Proteomics.

[17]  B. McManus,et al.  Functional Genomic Analysis of Peripheral Blood During Early Acute Renal Allograft Rejection , 2009, Transplantation.

[18]  A. M. Lefer,et al.  Cardioprotective effects of a C1 esterase inhibitor in myocardial ischemia and reperfusion. , 1995, Circulation.

[19]  D. Kreisel,et al.  Bcl3 prevents acute inflammatory lung injury in mice by restraining emergency granulopoiesis. , 2011, The Journal of clinical investigation.

[20]  Philippe Besse,et al.  Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems , 2011, BMC Bioinformatics.

[21]  A. Tenenhaus,et al.  Regularized Generalized Canonical Correlation Analysis , 2011, Eur. J. Oper. Res..

[22]  Matthew R. Laird,et al.  Protein Protein Interaction Network Evaluation for Identifying Potential Drug Targets , 2009 .

[23]  S. Keleş,et al.  Sparse partial least squares regression for simultaneous dimension reduction and variable selection , 2010, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[24]  V. Frouin,et al.  Variable selection for generalized canonical correlation analysis. , 2014, Biostatistics.

[25]  Chris Harbron,et al.  RefPlus: an R package extending the RMA Algorithm , 2007, Bioinform..

[26]  R. Altman,et al.  Personal Genomic Measurements: The Opportunity for Information Integration , 2013, Clinical pharmacology and therapeutics.

[27]  Raymond T. Ng,et al.  A computational pipeline for the development of multi-marker bio-signature panels and ensemble classifiers , 2012, BMC Bioinformatics.

[28]  A. Chauhan,et al.  The effect of C1 inhibitor on intestinal ischemia and reperfusion injury. , 2008, American journal of physiology. Gastrointestinal and liver physiology.

[29]  Weiwen Zhang,et al.  Integrating multiple 'omics' analysis for microbial biology: application and methodologies. , 2010, Microbiology.