Meta-analysis of gene expression data: a predictor-based approach

MOTIVATION With the increasing availability of cancer microarray data sets there is a growing need for integrative computational methods that evaluate multiple independent microarray data sets investigating a common theme or disorder. Meta-analysis techniques are designed to overcome the low sample size typical to microarray experiments and yield more valid and informative results than each experiment separately. RESULTS We propose a new meta-analysis technique that aims at finding a set of classifying genes, whose expression level may be used to answering the classification question in hand. Specifically, we apply our method to two independent lung cancer microarray data sets and identify a joint core subset of genes which putatively play an important role in tumor genesis of the lung. The robustness of the identified joint core set is demonstrated on a third unseen lung cancer data set, where it leads to successful classification using very few top-ranked genes. Identifying such a set of genes is of significant importance when searching for biologically meaningful biomarkers. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[2]  Danh V. Nguyen,et al.  Tumor classification by partial least squares using microarray gene expression data , 2002, Bioinform..

[3]  Todd,et al.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning , 2002, Nature Medicine.

[4]  David E. Misek,et al.  Gene-expression profiles predict survival of patients with lung adenocarcinoma , 2002, Nature Medicine.

[5]  Eytan Domany,et al.  Outcome signature genes in breast cancer: is there a unique set? , 2004, Breast Cancer Research.

[6]  D. Hanahan,et al.  The Hallmarks of Cancer , 2000, Cell.

[7]  Pan‐Chyr Yang,et al.  Up-regulated caveolin-1 accentuates the metastasis capability of lung adenocarcinoma by inducing filopodia formation. , 2002, The American journal of pathology.

[8]  G. Kundu,et al.  Osteopontin: role in cell signaling and cancer progression. , 2006, Trends in cell biology.

[9]  Stefan Michiels,et al.  Prediction of cancer outcome with microarrays: a multiple random validation strategy , 2005, The Lancet.

[10]  P. Schraml,et al.  Differential messenger RNA and protein expression of the receptor for advanced glycosylated end products in normal lung and non-small cell lung carcinoma. , 1997, Cancer research.

[11]  Michael A. Arbib,et al.  The handbook of brain theory and neural networks , 1995, A Bradford book.

[12]  Bruce A Luxon,et al.  Genomic profiling identifies alterations in TGFβ signaling through loss of TGFβ receptor expression in human renal cell carcinogenesis and progression , 2003, Oncogene.

[13]  P. Woll,et al.  Studies on the expression of endothelin, its receptor subtypes, and converting enzymes in lung cancer and in human bronchial epithelium. , 2000, American journal of respiratory cell and molecular biology.

[14]  J. Minna,et al.  Lung cancer • 9: Molecular biology of lung cancer: clinical implications , 2003, Thorax.

[15]  J. Dopazo,et al.  Methods and approaches in the analysis of gene expression data. , 2001, Journal of immunological methods.

[16]  J. Foekens,et al.  Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer , 2005, The Lancet.

[17]  S. Donnini,et al.  An angiogenic role for the neurokines midkine and pleiotrophin in tumorigenesis. , 1997, Cancer research.

[18]  Sangsoo Kim,et al.  Combining multiple microarray studies and modeling interstudy variation , 2003, ISMB.

[19]  L. Ein-Dor,et al.  Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Jun Chen,et al.  Joint analysis of two microarray gene-expression data sets to select lung adenocarcinoma marker genes , 2004, BMC Bioinformatics.

[21]  Lucila Ohno-Machado,et al.  Analysis of matched mRNA measurements from two different microarray technologies , 2002, Bioinform..

[22]  Y. Kotake,et al.  Enhanced expression of Tie2, its ligand angiopoietin-1, vascular endothelial growth factor, and CD31 in human non-small cell lung carcinomas. , 1999, Clinical cancer research : an official journal of the American Association for Cancer Research.

[23]  Brad T. Sherman,et al.  DAVID: Database for Annotation, Visualization, and Integrated Discovery , 2003, Genome Biology.

[24]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[25]  Nikunj C. Oza,et al.  Online Ensemble Learning , 2000, AAAI/IAAI.

[26]  Richard Baumgartner,et al.  Class prediction and discovery using gene microarray and proteomics mass spectroscopy data: curses, caveats, cautions , 2003, Bioinform..

[27]  T. Kislinger,et al.  Blockade of RAGE–amphoterin signalling suppresses tumour growth and metastases , 2000, Nature.

[28]  A. Chinnaiyan,et al.  Integrative analysis of the cancer transcriptome , 2005, Nature Genetics.

[29]  A. Bresnick,et al.  The S100A4 metastasis factor regulates cellular motility via a direct interaction with myosin-IIA. , 2006, Cancer research.

[30]  L. Zanetta,et al.  Monoclonal antibody to vascular endothelial-cadherin is a potent inhibitor of angiogenesis, tumor growth, and metastasis. , 2000, Cancer research.

[31]  Y. Saga,et al.  Bikunin Target Genes in Ovarian Cancer Cells Identified by Microarray Analysis* , 2003, The Journal of Biological Chemistry.

[32]  T. Barrette,et al.  Meta-analysis of microarrays: interstudy validation of gene expression profiles reveals pathway dysregulation in prostate cancer. , 2002, Cancer research.

[33]  Roland Eils,et al.  Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes , 2005, BMC Bioinformatics.

[34]  M. Schummer,et al.  Selecting Differentially Expressed Genes from Microarray Experiments , 2003, Biometrics.

[35]  Ash A. Alizadeh,et al.  Prediction of survival in diffuse large-B-cell lymphoma based on the expression of six genes. , 2004, The New England journal of medicine.

[36]  D. Botstein,et al.  Diversity of gene expression in adenocarcinoma of the lung , 2001, Proceedings of the National Academy of Sciences of the United States of America.